|Disciplines||Linguistics, Literature and Language - Irish|
|Temporal Terms||Modern (19th c. to 20th c.), 21st century|
|Methods and Techniques||Audio/Video interaction and sharing, Automatic recogition, Collaborative publishing, Communication and collaboration, Data Analysis, Data Capture, Data publishing and dissemination, Data reuse, Data Structuring and enhancement, Digital document preparation, Generic Searching/linking/visualizing, Graphical interaction and sharing, Linking records, Manual transcription, Practice-led Research, Project Management, Requirements, Resource sharing, Searching and querying, Security/backup, Statistical analysis, Strategy and project management, Text Encoding, Textual analysis, Textual interaction and sharing, User interface/Website design, Visualization, Web technologies|
|Contact||kscannegmail [dot] com|
|Start/End date||January 2005 - (open-ended)|
|Data Formats||Adobe Portable Document Format (PDF), Extensible Markup Language (XML), Text File (TXT)|
|Irish Geographic Names||All Ireland|
Semantic networks are sometimes called a "wordnets", after Princeton's English-language WordNet, which was the first-ever full-scale semantic network, dating back to the mid-1980's. Semantic networks are similar to, but much richer than, traditional thesauri, which generally only record (near) synonyms and sometimes antonyms. Líonra Séimeantach na Gaeilge, like most other wordnets, encodes a richer set of relationships, including hypernyms and hyponyms (broader and narrower terms), meronyms and holonyms (part vs. whole), etc.
Semantic networks have many applications in Natural Language Processing. They are used in systems for word sense disambiguation, document summarization and indexing, and information retrieval. When a semantic network in one language contains mappings to a second language (LSG is linked to the English WordNet), it can be used in various ways to improve machine translation. In general terms, from an artificial intelligence perspective, a semantic network encodes some of the "real-world knowledge" that is required for computers to understand and process texts in a non-trivial way.
* Comprehensive database. There are 32742 synsets, 36262 headwords and 77596 individual word senses, including a great deal of modern terminology, as well as literary and dialect forms, slang, etc.
* Free license. Like the Princeton WordNet (but, unfortunately, unlike nearly all other wordnets in existence), the LSG is free software. Specifically, all data, including the PDF thesaurus, are released under the terms of the GNU Free Documentation License. This means, in short, that you have the freedom to copy and redistribute the data, with or without modification, as long as you do so under the same license.
* English mappings. Entries in the LSG are linked to synsets in the Princeton WordNet.
* Frequent updates. We are providing regular updates, incorporating corrections and refinements, but also reflecting Irish as a living language via new terminology, shifting usages, etc.
* Common lexicon. The database used to generate the thesaurus is the same one used to generate the GaelSpell family of spellcheckers and the Gramadóir grammar checker. Improvements to one project will be reflected in the others automatically.