IIIT-H researcher makes largest version of Telugu dictionary

Thus far, she has published five papers, including two remotely.

HYDERABAD: Starting out with a mind-numbing task of typing out 21,000 odd Telugu words along with their meanings from an authentic dictionary, Sreekavitha Parupalli, a researcher from the Centre for Exact Humanities IIIT Hyderabad has built a database of Telugu words in their usage as verbs, adverbs and adjectives.

It also happens to be the largest computational version of Telugu dictionary. Sreekavitha has contributed 8483 verbs, 253 adverbs and 1,673 adjectives to OntoSenseNet - a verb-centric ontological resource for indian languages.  “With the Telugu dictionary, native speakers can now perform better annotation tasks as both the word and its meaning are in a language they are familiar with. And any computer science expert or researcher can design algorithms that can work on the annotated language data. My research work has added numerous words to the existing resources such as the Telugu WordNet”, she said.

While researchers working in the areas of language processing, machine translation, word sense disambiguation, culture studies, language teaching, dictionary compilation, and other areas of linguistics and language technology have plenty of online English language resources, for vernacular languages this is a challenge. This coupled with encouragement from her late Prof Navajyoti Singh paved way for Sreekavitha’s research.

Thus far, she has published five papers, including two remotely. “All of this is new and pioneering work. And the multiple papers I published are basically experiments that adopt the dictionary developed and the ontology proposed by Prof Navjyoti,” said Sreekavitha. The dictionary has also improved the accuracy of sentiment analysis task in Telugu.

Related Stories

No stories found.

X
The New Indian Express
www.newindianexpress.com