Refine this word faster
Corpus
"Corpus" in a Sentence (19 examples)
We created a freely available English-Japanese bilingual corpus.
I would prefer to have a list of Italian words which aren't in the corpus.
Were we to populate the corpus with unnatural sentences or inaccurate translations, this resource wouldn't be of much use, now would it?
It's so easy to write good example sentences, that even if we accidentally delete a few good sentences in the process of getting rid of a whole lot of bad ones, I think we could drastically improve the quality of this corpus by doing a lot of deleting.
Tom and Mary were on the verge of diving, off the left edge of the sentence, in the infinite corpus, when they spotted underneath a shoal of hungry contributors, teeth out, ready to jump on them and shred their mistakes down to the last one.
At the moment, normal users cannot delete sentences, only corpus maintainers can. We will someday add the possibility for users to delete their own sentences, but in the meantime, if you want to have a sentence deleted, add a comment on the sentence asking for deletion and explain why you'd like to delete it.
Some people say the corpus is intended for learners, some think it's more for scientific purposes.
I wish there were more Native American languages in the Tatoeba Corpus.
One way to lower the number of errors in the Tatoeba Corpus would be to encourage people to only translate into their native languages.
One way to lower the number of errors in the Tatoeba Corpus would be to encourage people to only translate into their native languages instead of vice versa.
Show 9 more sentences
No one suggests that Browning intended to mean vagina when he wrote “owls and bats, / Cowls and twats,” because the context does not allow for it, nor does the greater context of the Browning corpus.
A corpus approach is a useful methodology for observing, describing and interpreting the stylistic features of language in literary and non-literary texts.
Today, computer databases and corpora infinitely increase the ease of this type of research, but the collecting process remains essentially the same.
Text corpora are being used in most current lexicographic projects. Applied linguistic research is another field where text corpora are welcome as an inexhaustible source of empirical information, a polygon for testing various linguistic tools – spell-checkers, OCRs, machine translation systems, NLP systems, etc.
Comparable corpora are made up of texts in different languages that may be related in various ways, but are not translations of each other. They may have nothing in common at all, or be on the same subject, of the same genre, or from the same chronological period, etc.
The Lancaster/IBM Spoken English Corpus began in September 1984 as part of a research project into the automatic assignment of intonation […] The original design of the corpus was determined by the need to provide data for research into speech synthesis. As a result, unlike most other corpora currently being used in the computational linguistics field, the SEC exists in several forms. […] However, whatever the original motivation for compiling a corpus, it quickly becomes an object of interest in its own right. New users find it valuable for applications for which it was not designed.
the corpus of the uterus
About a hundred years ago in Germany, the publishing of corpuses of the ancient Greek coinages was started. […] The significance of those, and some other corpuses is exclusive, because they allowed an enormous amount of numismatic material kept in museum and private collections all over the world, to be studied and systematized.
An assessment in 1991 proposed publication of the results of this work in three stages: […] secondly, a corpus of the Roman pottery to present the type series and to discuss the fabrics and forms recovered, […]
See also for "corpus"
Next best steps
Mini challenge
Unscramble this word: corpus