User:Divinenephron
Jump to navigation
Jump to search
I'm a student studying medicine at the University of Cambridge. I mostly focus on adding etymologies linked to medical science – I use them to help me remember parts of my course.
I also use other resources such as the OED and Wordnik. When I have time I hope to create a bot to import etymologies from the public domain Century Dictionary, as was used to great effect by Wordnik.
My Tools
[edit]My Todo List
[edit]- Create Century Bot.
- Find an OCRed version of the Century Dictionary or OCR it myself.
- Process the text for etymologies using the Python NTLK. Probably use PWF or SRCF servers.
- Ask some admins where they'd like the data to be added – namespace and layout (I'd like it to be extendable to put other information and dictionaries in the same place).
- Create an application (GAE?) to quickly compare current Wiktionary pages with their Century Dictionary entry, so that editors can quickly transfer useful information.