0
Usenet Posted 22 years ago
English in UK

Looking for a corpus of English...

Hello all,
I am doing a dissertation which requires me to use a wide corpus of English. I have looked at the BNC and the Bank of English but the costs are prohibitive as I am funding this myself.

Has anyone any idea where I might get a corpus of a few million words which I use for concordancing and word frequency-of-use analysis?
I would really appreciate any help or advice you guys could give me.
Thanks,
Howard.

Howard Coakley
e-mail... howard
  

Top answer

[nq:1]Hello all, I am doing a dissertation which requires me to use a wide corpus of English. I have looked ... use for concordancing and word frequency-of-use analysis?

  • [nq:1]Hello all, I am doing a dissertation which requires me to use a wide corpus of English.
  • I have looked ...
  • use for concordancing and word frequency-of-use analysis?
  • [/nq] There are masses of On Line Newspapers with Modern English, beware the Tabloids have different word frequencies from Broadsheets.
  • For pre 1923 works there is Project Gutenberg.
Free · every Monday

Get the Weekly English Kit 📬

New words, one handy idiom, and a 2-minute quiz — delivered to your inbox to keep your streak alive.

9 Answers
0
[nq:1]Hello all, I am doing a dissertation which requires me to use a wide corpus of English. I have looked ... use for concordancing and word frequency-of-use analysis? I would really appreciate any help or advice you guys could give me.[/nq]
There are masses of On Line Newspapers with Modern English, beware the Tabloids have different word frequencies from Broadsheets.

For pre 1923
0
[nq:1]There are masses of On Line Newspapers with Modern English, beware the Tabloids have different word frequencies from Broadsheets.[/nq]
Where does that leave The Independent, which now publishes in both formats?

Brian {Hamilton Kelly} (Email Removed) "We can no longer stand apart from Europe if we would. Yet we are untrained to mix with our neighbours, or even talk to them". Geor
0
[nq:2]There are masses of On Line Newspapers with Modern English, beware the Tabloids have different word frequencies from Broadsheets.[/nq]
[nq:1]Where does that leave The Independent, which now publishes in both formats?[/nq]
Betwixt and between?
Sitting on the fence?
0
Howie at

says in (Email Removed):
[nq:1]Hello all, I am doing a dissertation which requires me to use a wide corpus of English. I have looked ... words which I use for concordancing and word frequency-of-use analysis? I would really appreciate any help or advice you guys[/nq]
Dolls need not reply? That's my four-word contribution. Will it help?
[nq:1]could give me.[/nq]
0
writes
[nq:1]Hello all, I am doing a dissertation which requires me to use a wide corpus of English. I have looked ... use for concordancing and word frequency-of-use analysis? I would really appreciate any help or advice you guys could give me.[/nq]
See "Computational Analysis of Present Day American English", Henry Kucera and W Nelson Francis, Brown University Press, 1967. A little out o
0
[nq:1]See "Computational Analysis of Present Day American English", Henry Kucera and W Nelson Francis, Brown University Press, 1967. A little out of date perhaps, but it includes samples of AmerEnglish from a very wide variety of areas of interest.[/nq]
Hi Dave,
Thanks for that.
I'm afraid I really need to do this from a text-file based corpus. Having done some research on the Kucera e
0
[nq:1]Dolls need not reply? That's my four-word contribution. Will it help?[/nq]
Well, it didn't help me, but it seemed to make you feel better. That gives me a lovely warm feeling anyway.
H.
0
writes
[nq:2]See "Computational Analysis of Present Day American English", Henry Kucera ... AmerEnglish from a very wide variety of areas of interest.[/nq]
[nq:1]Hi Dave, Thanks for that. I'm afraid I really need to do this from a text-file based corpus. Having done some research on the Kucera et-al work, It doesn't seem to be available in that format. I'll keep looking...[/nq]
I know
0
writes
[nq:2]Hi Dave, Thanks for that. I'm afraid I really need ... seem to be available in that format. I'll keep looking...[/nq]
[nq:1]I know it was done nearly 40 years ago, but it was done on a computer, so there might just ... the authors still has the original files it was compiled from. If, of course, you can find the authors $-}[/nq]
Hi again Dave,
Yes - it might be a possi

Related Questions