User:Divinenephron

From Wiktionary, the free dictionary
Jump to navigation Jump to search

I'm a student studying medicine at the University of Cambridge. I mostly focus on adding etymologies linked to medical science – I use them to help me remember parts of my course.

I also use other resources such as the OED and Wordnik. When I have time I hope to create a bot to import etymologies from the public domain Century Dictionary, as was used to great effect by Wordnik.

My Tools

[edit]

My Todo List

[edit]
  1. Create Century Bot.
    • Find an OCRed version of the Century Dictionary or OCR it myself.
    • Process the text for etymologies using the Python NTLK. Probably use PWF or SRCF servers.
    • Ask some admins where they'd like the data to be added – namespace and layout (I'd like it to be extendable to put other information and dictionaries in the same place).
    • Create an application (GAE?) to quickly compare current Wiktionary pages with their Century Dictionary entry, so that editors can quickly transfer useful information.