12

I'm tasked with searching for the use of cliches and common phrases in text. The phrases are similar to the phrases you might see for the phrase puzzles on Wheel of Fortune. Here are a few examples:

  • Easy Come Easy Go
  • Too Good To be True
  • Winning Isn't Everything

I cannot find a list of phrases however. Does anybody know of such a list?

Seriously, even a list of all Wheel of Fortune solutions would suffice.

dmcer
  • 8,116
  • 1
  • 35
  • 41
Marcus Adams
  • 53,009
  • 9
  • 91
  • 143
  • Thanks for the answers so far, but I need something more comprehensive. As a test, it should probably have all three of my examples. Thanks. – Marcus Adams Apr 19 '10 at 01:23

5 Answers5

12

Here's an enormous list for you (A-K), L-Z on the next page:

http://www.learn-english-today.com/proverbs/proverbs.html

Also, Wikiquote has a bunch for you:

http://en.wikiquote.org/wiki/English_proverbs

If you want a list of clichés, look no further: http://clichesite.com/alpha_list.asp?which=lett+1

Both of these sites have the three you mentioned:

1) http://1001truisms.webs.com/truisms.htm

Vivek Aditya
  • 1,145
  • 17
  • 46
atp
  • 30,132
  • 47
  • 125
  • 187
5

If you want to generate a list automatically:

Scrape twitter and as may blogs as you can get RSS feeds for. Use them to build a corpus for a Markov chain and look for very high probability parses. I suspect Clichés will be very normal at the start but have unusually low entropy near the end. That is, once you see "X Y Z" you can say with near certainty that the next words will be "A B C".

BCS
  • 75,627
  • 68
  • 187
  • 294
2

Cliché Site has a list of 2,100 idioms. The list looks pretty easy to harvest.

dmcer
  • 8,116
  • 1
  • 35
  • 41
1

I know an answer has been accepted... but the answer is dated. Currently, wiktionary is the best place to go (~8000 entries): https://en.wiktionary.org/wiki/Category:English_idioms

RussellB.
  • 346
  • 1
  • 3
  • 9
0

You should be able to harvest a list from http://evilpettingzoo.com/wheel/ pretty easily.

Sebastian Paaske Tørholm
  • 49,493
  • 11
  • 100
  • 118