---
Computers - Artificial Intelligence - Natural Language
---

Click here!

Natural Language Technology in Mumbai - Course at India's National Centre for Software Technology. Official site with archived lecture, schedule, assignments.

Survey of the State of the Art in Human Language Technology - A 1996 high-level review of: spoken/written input, analysis and understanding, generation, speech output, discourse and dialogue, document processing, multiple languages and modes, transmission and storage, mathematical methods, other resources, how to evalate an NLP program.

Language Technology Group Helpdesk FAQ - Questions and answers on a wide range of language processing topics, with keyword index.

SULTRY: The Sydney University Language Technology Research Laboratory - An institute which applies natural language processing research to problems of human-computer interaction. Selected publications, academic information, and descriptions of current projects.

Bibliography of Research in Natural Language Generation - A part of the Computer Science Bibliography Collection, providing references for papers published through 1994. Searchable, browsable.

ELDA: The European Language Resource Distribution Agency - The practical arm of the ELRA agency, dedicated to solving practical and legal problems in the distribution of language resources. Legal information, catalog of resources for sale, current projects.

"String Searching Algorithms" Book - String Searching Algorithms: exact / approximate string matching, edit distances, common sequences, longest repetitions.

References on Zipf's Law - An academic bibliography on this relation between a word's frequency in a text and its place in a ranking of words by frequency. Includes some online texts.

Tabularum - A resource site for research into the automatic understanding of tabular information, including information extraction from tables.

The Natural Language Software Registry - A "concise summary of the capabilities and sources of language processing software available to researchers. It comprises academic, commercial and proprietary software with theory, specifications and terms on which it can be acquired clearly indicated."

Language Technologies Institute - A research program at Carnegie Mellon University, focusing on machine translation and speech processing. Includes news, admissions procedures, staff profiles and current projects.

Teaching materials for statistical NLP - Centered around Eugene Charniak's book "Statistical Language Learning", this page features a book review, some notes, sample solutions to exercises in the book, and a small corpus with which to experiment.

Grammatical Inference - Repository of information on grammatical inference, automata induction, and language acquisition.

European Language Resource Association - A nonprofit organization serving the commercial language resource community. Site features quarterly newsletter, official definition of "language resource," and member services.

Mingsee, Inc. - Develops systems which enable computers to analyse and "understand" text by using proprietary algorithms.

Language Technology Projects - Descriptions of projects covering a wide range of language technology applications. Includes a slide presentation and summary chart for each project.

EAGLES: Expert Advisory Group on Language Engineering Standards - A European Commission initiative to provide standards for linguistic engineering applications such as corpora, lexicons, mark-up languages and software. Contains current guidelines.

ACL NLP/CL Universe - A searchable directory of sites on natural language processing and computational linguistics.

Language Technology World - A comprehensive portal on the wide range of technologies that deal with human language. News, conferences, projects, organisations, systems, and resources.

DISC Best Practice Guide - Information, standards and tools to support the development of Spoken Language Dialogue Systems.

Fieldmethods.net - News pieces and discussion fora for various fields of natural language technology.

Natural Language FAQs - Selected FAQ lists from Usenet groups related to natural language processing.

INFOLINGUA : Natural Language Processing - References in computational linguistics, morphology, parsing, lexicography, text understanding, text generation, interfaces, automatic translation, CALL, speech processing, quantitative linguistics, automatic indexing, character recognition, literary computing, dictionaries.

CMU AI Repository - NLP area - Machine readable parts of NLP textbooks, NLP corpora and dictionaries, fonts, and software.

Link Grammar - A formalism for the computational parsing of English. Includes parser with downloadable source code, English-to-German translator, documentation, bibliography.

Lexical Functional Grammar - A collection of information on this formalism. Basic information, archive, bibliography, LFG links, e-mail list.

Lexical Functional Grammar: The Stanford Web Site. - A collection of information about this theory of grammar. Archive, downloadable bibliography, conference information, LFG implementations.

CLASS: Collaboration in Language and Speech Language and Technology - European Union program promoting cooperative development of language technology. Administrative information, associated projects, publications in DOC format.

VerbMobil - Mobile translation system for the translation of spontaneous speech in face-to-face situations.

Connexor Parsers - Language parsers and taggers for English, German, French, Spanish, Italian, Dutch, Finnish and Swedish. On-line parser demos and limited documentation available.

Berkeley Formal Grammar Conference 2000 - A meeting of grammar engineers, with separate sessions on lexical-functional grammars, head-driven phrase structure grammars, and constraint-based grammars as a whole. Schedule, general information, workshop proposals.

MUC-6 - The 1996 Message Understanding Conference, organized for the evaluation of information extraction systems applied to a common task. Task definitions, templates, scenarios.

LFG98 Lexical Functional Grammar Conference - Held at the University of Queensland, Brisbane, Australia. Workshops, organizational information, downloadable papers on Austronesian languages.

LFG2001 - Held at the University of Hong Kong. Organizational information, program, workshops, abstracts.

LFG2002 - To be held at the National Technical University of Athens, Athens, Greece. Call for papers, information on accommodations and transportation.

International Workshop on Information Presentation and Natural Multimodal Dialogue - Meeting exploring the connection between interface design and language technology, held in Verona, Italy. Background, program, papers online in PDF format.

Chompy Home Page - Chompy is a freeware natural language parser written in Java. There are two main aims to the Chompy project: to provide an educational program to demonstrate the basics of Chomsky's structuralist grammar, and to develop a natural language parser class for Java.

Virtual Woman by CyberPunk Software - Beta test the shareware Virtual Woman game. Color graphics and music.

Wired News: Meet HAL's Ancestors - With recent advances in voice-recognition, it's becoming easier than ever to talk to a computer. But how well does the computer understand what it's being told?

SabrinaDev : Opensource AI - An opensource AI project to create a Turing-Test-capable chatbot for self and social improvement.

The Billy Project - Chat bot Billy, available for download.

MeBot Project - A 'welcome bot' experiment set up in the first person to represent the author, MrHolden.

Arty Fishal - Web-based bot attempting to simulate an insane asylum resident.

Jesus Chat - Chatterbot called Jesus.

Zabaware - Chatterbot software gives computers a personality. UltraHal is available as a PC assistant (Windows shareware), a web-based demonstration conversation application and an AI bot for websites.

ECTOR, a Future Intelligent Chatterbot - A description of how to build an intelligent chatterbot (based on an AI PhD thesis and an IRC chatterbot).

ICQza - Natural language artificial intelligence chat robot for ICQ. ICQza sends and receives ICQ instant messages and simulates human responses.

Jack the Ripper Bot - Peer into the mind of the serial killer.

The Simon Laven Page - An entertaining site for chatterbot enthusiasts and professionals. With dozens of chatterbots, two Java chatrooms (with and without chatterbots), a chatterbot message board, chatterbot papers, and the latest chatterbot news through its free newswire service.

Paolo's AI - PAULA - The home of downloadable, teachable chatbot Paula.

Eliza - Web-based version of this person-centered therapist emulator.

GuruBot Chatterbot - Learns from user input, has artificial intelligence and uses brain files to store information. Spanish brain available.

Catty bot - Does not know about any language, does not learn, adapt or process natural language - thanks to www.google.com

Eliza - Sarcastic Version - Web-based sarcastic version of this person-centered therapist emulator.

Maybot: Talking Your Language - Maybot is a recently-formed business supplying Internet-based chatterbot software and services, and other natural language interface applications.

Talking Servant - Talking Servants German Language Chatterbot Anette.

Colorful Personalities - Dialogues with colorful personalities of early AI.

HuMimics, Inc. - Home of a new age in artificial human technology, applied mimetic sciences. Talk to Web One and other online mimetic entities.

Colorzone Interactive Media - Colorzone - home of CARA the talking Chatterbot.

BOTizen - Company providing automated online personalities for customer service purposes. Corporate information, news, product and service details, success stories.

Gnod: The Global Network Of Dreams - A self-adapting system which searches the web on the basis of user input. "A search-engine for things you don't know about."

The Personality Forge - Chat with a community of Artificial Intelligence Personalities, then create your own AI Personalities and watch them chat with real people. Bots will remember you.

Elizaneth - A Javascript implementation of Joseph Weizenbaum's Eliza by Arne Solli.

Dr. Werner Wilhelm Webowitz: Web Quack Psychiatwist - A web-based version of Eliza. Meant to imitate a conversation with a psychiatrist.

BotSpot: The Mutual Admiration Society - Article by Don Barker about what happens when chatterbots like Eliza, ALICE and Shallow Red "converse" with each other.

The Forbin Project - Named after a 1969 movie, this site features logs of conversations between the chat bots ALICE and Barry DeFacto.

Arthur the Chatterbot - Developed by the Kingston school of Information Systems.

Yu - Learn the Secrets - Yu is an overworked programmer spending most of his time chatting at the side. If you manage to impress him he will let you into Theories.com

N.I.C.O.L.E. - "Nearly Intelligent Computer Operated Language Examiner" - NICOLE is a theory or experiment that if a computer is given enough combinations of how words, phrases and sentences are related to one another, it could talk back to you.

Android - Downloadable software that refers to a user-extensible digital knowledgebase.

Answerpad - Downloadable program which answers questions based on past memorized conversations. Compatible with Microsoft Agent characters.

The Disputed Illegitimate Son of Mgonz - Recreation of Mark Humphrys' original Mgonz, By Dave O Connor. Rude, profane, but fun.

Peer-Reviewed.Org - The web site of a ficticious organization that is populated by chatterbots that discuss the peer-review process.

Chatterbots: Crash Test Dummies of Communication - This is a research about chatterbots and CMC.

Applications of AI: Eliza - Description of Eliza.

Ai Research - Focuses on creating genuine Artificial Intelligence - the technology that enables machines to converse with humans in natural language. Based on a groundbreaking approach, Ai's technology will pass the Turing Test for machine intelligence by 2011.

Ask ALEX - Experimental bot programmed to help users find legal information on JURIST: The Legal Education Network, and elsewhere online.

a virtual French conversation - talking French with Camif, a virtual chatbot with educational purpose : make students speak correctly French

The Internet Firstborn - About an artificial baby with the ability to chat and learn, like a real person.

a virtual English conversation - talking English with Camif, a virtual chatbot with educational purpose : make students speak correctly English

Classic Chatterbots: Eliza by Joseph Weizenbaum - Describes Eliza and offers downloads.

Joseph Weizenbaum's ELIZA: Communications of the ACM, January 1966 - A Computer Program For the Study of Natural Language Communication Between Man and Machine.

Eliza - The computer psychiatrist. - Web version of Eliza.

Deepti Project - Aims to develop a Hindi-speaking chatbot using AIML along the lines of the ALICE project. News, documentation, team member information.

Functional Response Emulation Devices - A survey of the android studies conducted by Robby Garner, 2-time winner of the Loebner Prize.

Eliza's Brain - A site with tools and resources for creating and operating your own chatbot. The interactive bot Ellie is online.

jabberWACKY.com - live AI Chatbot - Have a chat with the Jabberwack today - a fully conversant, amusing bot called 'Thoughts'. Then try out some nonsense sentences from your own text with the 'Words'. Jabberwacky is a chatbot - an Artificial Intelligence - an AI.

Android Studies - Robby Garner's chatterbot page, home of the Turing Hub, a 24/7 Turing test.

2002 Loebner Prize Contest Home Page - Call for Entries, Rules, and other Information about the 2002 Loebner Prize Contest in Artificial Intelligence, to be held in Atlanta Georgia, USA.

Chat bot in Bulgarian - A bulgarian chat bot.

Virtual Personalities - Verbots help real humans deal more comfortably and effectively with their increasingly complex world.

Mind Files - Extensive collection of mind files and other resources for the Daisy and Billy chatbots, provided by Greg Leedberg.

Elizabeth - A User-Friendly Chatterbot for Windows - Elizabeth is a friendly, powerful and adaptable ELIZA-style system for conversation and AI/NLP learning, with visual displays of all internal processing, and many additional facilities (e.g. for handling grammars).

RunABot - create a personal chat bot with an easy to use Web-based interface.

1001 Questions - Teach this program from MIT AI Lab about everyday objects. Its questions improve with experience, and collected knowledge will be used to make other programs smart.

The Programmable Artificial Intelligence project - PAI (Programmable Artificial Intelligence) is a program capable of having a conversation in its mother tongue, English.

Artificial Intelligent Robots - The home of the chatterbot Talk-Bot, which uses smiley faces to express emotions and small icons to enhance the conversation.

The Chatterbox Challenge - An annual Chatterbot Contest offering money and awards to contestants in several different categories.

ALIMbot - The knowledgeable bot - ALIMbot is a hobby project written in Visual Basic 6.0 by M F Wahid. ALIM, in Classical Arabic, means the one who has knowledge. ALIMbot is an Artificially Intelligent chatbot that tries to effectively react to the user input. It will not only decide What to say? but also How to say?.

BotSpot Chatterbots - Links to chatterbots and provides reviews and information about them.

Chatbot Friends - This website is for chatterbot fans everywhere.In this site you will find links to chatterbots, software, links to sources of funding (under contests) andchatbots hosted on this site.

MobyGames - Racter - Short information about the software and pictures of the cover.

Racter FAQ - An expose of the Racter speech engine.

Urban Shoes: Hello, I´m Racter - Article by Luis Joaquin M. Katigbak where he talks with Racter.

Racter - Introduction and some links.

Claude - A shareware clone of Racter.

The ALICE Connection - Dedicated to the ALICE chat bot project. Downloadable version, links to documentation and background information.

A.L.I.C.E. AI Foundation - This page contains information and links on ALICE, some papers, and news articles.

Ally the Chatbot - Once upon a time there was beautiful girl bot. She saved the world and everyone loved her. Her name was Ally. The End.

The alicebot Project - This site revolves around the ALICE chatterbot.

John Lennon Artificial Intelligence Project - Chat online with cyber-Beatle John Lennon. Java powered.

Alison's Chat Corner - Provides conversation with a computer-generated brunette woman.

Pascalice - An AIML interpreter supplemented with some editing tools. Binary file, source code, documentation, relevant links.

Turing Test Page - Comprehensive list of online resources pertaining to all aspects of the Turing test, including background reading and chatbot links.

The Turing Test and Chinese Room Experiment - A nice, concise page by Larry Hauser describing the Turing Test and Searle's Chinese Room.

Computing Machinery and Intelligence - Turing's original 1950 article on machine intelligence, where he introduces the famous Turing Test, and started this profound multi-decade debate.

Loebner Prize Home Page - Annual Turing Test contest.

Improving the Turing Test - An explanation of how one might enhance the Turing Test format into Virtual Reality.

Article in ACM Crossroads Magazine - Published in Crossroads, the ACM student magazine, this is an article arguing that the Turing Test is not a good test for intelligence.

Turing Test Questions - An attempt to collect the largest possible number of one-line questions that could be asked during a Turing Test.

Turing test and intelligence - This document examines the meaning of the Turing test and suggests that meeting the turing test is already in the process of being achieved.

(Spain) UNED NLP Group, Madrid - Natural Language Group at the Spanish National Distance University (UNED).

(USA) Columbia Natural Language Processing Group - Pursues research in natural language generation, concept-to-speech generation, summarization of news, statistical language modeling and digital libraries. Info on their projects, people, publications, software tools, events.

(USA) Computing Research Laboratory - Concentrates on multilingual processing of natural language texts. Core research areas are: AI, computational linguistics, and human-computer interaction. Has papers, data, and software.

(UK) Word-Grammar Interest Group - A research group which develops and applies the theory of Word Grammar. Site lists members and some relevant publications.

(USA) Microsoft NLP Research - Information on their projects, people, publications, and employment opportunities.

(UK) Language Evolution and Computation homepage - A University of Edinburgh research unit. "Our research involves applying mathematical and computational modelling techniques to traditional issues in the evolution of communication and language, historical linguistics, and language typology." Site lists group members, online papers, software, and related links.

(UK) Corpus Research at the University of Birmingham - Information on their projects, software tools, links to corpus information.

(Austria) Austrian Research Institute for Artificial Intelligence - "Research in modelling and processing human languages, especially for German. This includes constructing linguistic resources (such as lexicons, grammars, discourse models), processing algorithms (such as morphological components, parsers, generators, speech synthesizers, discourse processing components), and application prototypes (such as natural language interfaces, advisory systems and concept-to-speech systems)."

(UK) University of Cambridge NLP Group - The history of this group; Project links; links to papers and technical reports.

(USA) Natural Language Processing Group, MITRE - MITRE's research efforts in the area of natural language processing. Links to projects, people, papers, and software.

(UK) University of Leeds - Pointers to projects and software tools and information about the researchers.

(USA) Johns Hopkins University NLP lab - "Committed to finding novel and efficient computational methods that rival human performance in natural language competency tasks." Information on their people, conferences and meetings, links, courses, facilities, software tools.

(UK) Edinburgh Language Technology Group - A research and development group working in the area of natural language engineering. The site contains several software tools free to academic research groups.

(Germany) DFKI Language Technology - This research lab of the German DFKI research institute has several projects on language technology.

(UK) University of Sussex at Brighton - People, research topics, technical reports and dissertations, info for prospectives students, research seminar series, links.

(Canada) Simon Fraser University Natural Language Laboratory - "Computers are used to understand the structure and meaning of "natural languages" such as English, French, and Spanish." Machine translation, computer-assisted language learning, information extraction, natural language interfaces. Publications online.

(USA) SRI AI Center NLP Program - Information on their projects in multimedia/multimodal interfaces, spoken language systems, written language systems. Links to the projects, publications, staff.

(Belgium) Centre for Computational Linguistics - The main objective of the Centre for Computational Linguistics at the Katholieke Universiteit Leuven is to promote basic research in formal and computational linguistics, and the application of this research in natural language processing.

(UK) University of Sheffield NLP Group - "Architectures for NLP, NL Analysis (IE and Dialogue), NL Generation and NLP Resources and Tools." The developers of GATE.

(USA) Language Science Research Group, Washington University - Research in this group focuses on segmentation and language acquisition.

(USA) MIT Infolab - Research group of the MIT AI Laboratory. A key on-line system is START, Natural Language question-answering over several topics.

(Australia) Centre for Language Technology, Macquarie University - Research on Language Technology, with particular emphasis in practical applications in the short and medium term. Links to research projects and university courses.

(USA) Information Sciences Institute - NLG Group - The Natural Language Processing group at the Information Sciences Institute of the University of Southern California (USC/ISI) is currently involved in various aspects of computational linguistics/natural language processing.

(Spain) Universitat Politecnica de Catalunya's Natural Language Processing Research Group - Main research fields are related to the use of multilingual lexical resources, information extraction from documents, design of NL interfaces, basic NLP techniques (tagging, parsing, sense disambiguation), NL understanding and Knowledge Representation. Tools and demos available.

(Sweden) Human Language Technology group at NADA - Performs research within all aspects of human language and computers. Links to courses, projects, publications, and reports. Some of the contents are in Swedish.

(USA) Neural Theory of Language (NTL) Research Group - A group of scholars at the University of California, Berkeley, studying the connections between neurology, computing and language learning. Current projects, research articles, and an overview of the group's history and purpose.

(UK) Computational Linguistics UK - Britain's special interest group for computational linguistics. News, organizational information, and general information on the British natural language processing research community.

(USA) CSLI Center for the Study of Information and Technology (CSLI) - An independent research center devoted to research in the emerging science of information, computing, and cognition. Founded by researchers from Stanford University, SRI International, and Xerox PARC.

(USA) Conversational Interaction and Spoken Dialogue Research Group - A University of Rochester research group that investigates conversational interaction through the study of machine-human interaction. Program information, current projects, tools for corpus linguistics and discourse transcription, archive of downloadable papers.

(UK) Laboratory for Natural Language Engineering - University of Durham - Research on several topics, including the LOLITA system. Links to list of projects, people, publications, Journal of Natural Language Engineering.

(USA) Center for Machine Translation - A Carnegie Mellon University research center that focuses on multi-lingual machine translation. Links to projects, personnel, job openings, and technical reports.

(USA) Computational Psycholinguistics Research at CLIP - Descriptions of current projects and links to published papers. Covers the areas of syntactic disambiguation, selectional constraints and semantic similarity.

(Netherlands) Language and Inference Technology group - This group is part of the Institute for Logic, Language and Computation (ILLC) at the University of Amsterdam. Research on representational and algorithmic aspects of computational linguistics and computational logic. Links to news, people, publications, research, and teaching.

(Italy) Cognitive and Communication Technologies Division at ITC-IRST - A research institute of the Instituto Trentino di Cultura focusing on NL generation, information extraction, dialogue and multimodality, linguistic resources and tools, parsing, and formal linguistics.

(USA) Xerox Content Analysis - A team working on basic products for multilingual language analysis, providing current projects, demos, and an archive of publications. Includes an online demo guessing 47 languages.

(Mexico) Natural Language Lab of the National Politechnic Institute - The homepage of the head of this lab. Links to nearly all products of the Lab. Areas of interest are computational syntax, semantics, anaphora resolution, lexical resources. The Lab organizes an annual international conf, see www.cicling.org.

(France) Language, Information and Representation -- LIMSI - Research on knowledge and reasoning, document processing, interpretation, generation and dialogue processing, and question/answering. Links to members, topics, reports. Versions in English and French.

(Greece) NCSR "Demokritos", Software & Knowledge Engineering Laboratory - NCSR "DEMOKRITOS" is the biggest state-run research centre in Greece. The Software & Knowledge Engineering Lab (SKEL) at the Institute of Informatics and Telecommunications of NCSR develops technologies that address the emerging problem of information overload exploiting techniques and tools from the areas of Language technology, Personalization, Knowledge discovery in data, Multimedia processing.

(US) Responsive Virtual Human Technology - An NSF funded project led by Research Triangle Institute studying spoken language interaction with virtual characters

(Australia) Sydney Language Technology Research Group - A University of Sydney research group. Research on machine learning, XML/SGML markup, and tagging.Projects,resources,applications.

(USA) Human Language Technology Research Institute - A University of Texas research group. Research in NLP and speech recognition and synthesis.Links to people, projects, publications.

(India) Language Technology Research at IITK - A research group from the Indian Institute of Technology. Varied research including machine translation and processing of Hindi.

(USA) Cornell Natural Language Processing Group - Information about people, projects, publications, datasets, and courses.

(USA) Natural Language Theory and Technology group at PARC - Provides information about the people and projects in the NLTT group at PARC. Includes a short history and a list of selected papers.

(India) Language Technology Research at AU-KBC, Chennai - The group focusses on developing tools, technologies and products for Indian languages especially for Tamil. Research projects include Machine Translation, Information Retrieval(IR), Information Extraction(IE) and developing tools and lexical resources including a Tamil WordNet.

(Australia) CSIRO Intelligent Interactive Technology - "The Intelligent Interactive Technology (IIT) team at the CSIRO conducts basic and applied research in Natural Language Processing (NLP) and Multi-Agent Architectures to improve human-computer interaction."

ARIES Natural Language Tools - Proprietary tools for the lexical work on the Spanish language. Free demo, documentation.

Natural Language Software Registry - A directory of academic, commercial and proprietary software with specifications and licensing terms. From DFKI Saarbrücken.

Phrasys Natural Language Processing - A variety of services for natural language technology developers, including consultancy, commercial software and freeware. Features a Java component useful for string analysis.

GATE: General Architecture for Text Engineering - A computer architecture for a broad range of Natural Language Processing tasks, available under the GNU Public License. Abundant documentation, Java class library, web-based demos.

AFGL Project: Affix Grammars over a Finite Lattice - A system of public domain software for natural language processing. Includes a formalism for compact grammar description, parser generation system, transduction tool.

TextAI: Text Analysis International - Provides NLP applications based on its proprietary VisualText technology. Product and service information, online software tour, some documentation.

Cogilex - Company offering expert services and customized tools for natural language processing. Site features demo download of the "QuickTag and QuickParse" utility for Windows, also online tools.

DGA: Dependency Grammar Annotator - A Java-based interactive graphical tool for the syntactic annotation of texts within the formal framework of Dependency Grammars. Demo, download and documentation.

LT TTT - Text tokenization system and toolset, including transducer and separate part-of-speech tagger. License information, sample output, documentation. From the Edinburgh Language Technology Group.

Thistle - A Java GUI editor for editing tree diagrams (such as those employed in constraint-based grammars), existing in both applet and standalone forms. Sample trees and editors.

Public Domain Language Engineering Generic Tools - Lecture by Tomaz Erjavec, including text, slides and links. From the 1996 TELRI conference.

Alembic Workbench - Corpus analysis toolset from the MITRE Group, including SGML annotation tool. Download, documentation, research.

Annotate - Tool for semi-automatic graphic annotation of corpora. License, documentation, screenshot. Requires GCC and MySQL, in addition to registration.

Morphological and Orthographic Tools for English - UNIX tools for the analysis and synthesis of text, from Sussex's John Carroll. GZIP downloads, descriptions, related publications.

KPML Access Page - Graphically based language engineering program, developed for working with large-scale grammars under the Systemic Formal Linguistics framework. Downloadable program images, documentation, resources and source code.

Parser Servlet - Online demo of a top-down Java parser based on Linguistic String Project techniques. Parsing, XML annotation, English-to-Finnish machine translation.

Konstantz LFG Workbench - An implemenation of the 1982 LFG formalism designed for exploring the interface of syntax and morphology. Links to documentation, source files and compiled modules.

Grammar Writer's Workbench for Lexical Functional Grammar - An implementation of the 1982 LFG formalism, supplemented with more recent features. Downloads, documentation, bibliography, license information. From Xerox NLTT.

GroupLens - An experimental collaborative filtering service based on "Better Bit Bureaus" which is itself a collaborative venture between Paul Resnik of the Center for Coordination Science at MIT and Brad Miller and others at the University of Minnesota

Senga: Information Retrieval Software - Senga is a development group focused on information retrieval software. The primary purpose of the components distributed on Senga is to build a large scale internet search engine.

Smart Tutorial - A tutorial on the SMART IR system from Cornell. Put together by Hans Paijmans with a technical report on the implementation of an earlier version of SMART.

Banter Technology - Develops technologies for natural language processing, semantic analysis and adaptive business process automation

Emdros - Open source text database engine, including query-language, for storage and retrieval of linguistic analyses of text. Documentation, download and project updates.

OpenNLP - Collaborative organization for open source projects related to natural language processing. Lists ongoing projects and documents proposed standard Java and XML APIs.

Web Interface to WordNet - Direct online access to version 1.6 of this database. Hosted by Oxford English Online.

WordNet - A Lexical Database for English - Nouns, verbs, adjectives and adverbs are organized into synonym sets, each representing one underlying lexical concept.

WordNet Perl Module - Perl OO interface to George Miller's WordNet database.

CommonLisp Interface to WordNet - Examples, documentation, downloads, links.

WordNet Python Module - Python interface (by Oliver Steele) to WordNet database created by George Miller.

JWNL (Java WordNet Library) - A Java API for accessing the WordNet relational dictionary. Free download, support and background information.

CLAWS Tagger - From the UCREL corpus annotation project. Documentation, free online trial, licensing information.

Memory-Based Tagger Demo - Online tagging of Dutch, Spanish, English, Slovene, Swedish and German text. Extensive list of related research.

TnT: Trigrams 'n Tags - Statistical part-of-speech tagger trainable on different languages and tagsets. Trained and trainable versions. Online demo, noncommercial licensing information, documentation.

TreeTagger - A language independent part-of-speech tagger, available in versions adapted to German and to English. Documentation, downloads, license information, demos.

General Data Tagger - Perl code in text format from Kristie Seymore. Brief notes.

AUTASYS - A Fully Automatic English Wordclass Analysis System - A Windows 95/98 compatible tagger developed by Alex Chengyu Fang. Tagsets, samples, documentation.

QTAG - Downloadable Java-powered tagger, from the University of Birmingham.

Monty Tagger: A Brill-Based Part-of-Speech Tagger - Portable tagger with implementations in Python and Java, based on Eric Brill's 1994 tagger. Background and license information, downloads.

Machinese Phase Tagger - Online demo of this constraint grammar-based tagger for English, French, Spanish, German, Dutch, Italian and Finnish,

EuroWordNet - A project to compile compatible wordnets for seven European languages. Documentation, project reports, and downloadable database samples.

British National Corpus - A balanced synchronic text corpus containing 100 million words with morphosyntactic annotation.

The Association for Computational Linguistics - International professional society dedicated to research throughout the field of natural language processing.

The Computation and Language E-Print Archive - A fully automated archive of papers in computational linguistics, natural language processing, speech processing etc.

Books on Computational Semantics - Two draft online textbooks in PostScript format, by Patrick Blackburn and Johan Bos. Also features an online Prolog tutorial.

Human Language Technology - An European internet clearinghouse for language engineering research.

The ACL NLP/CL Universe - A directory of sites related to computational linguistics.

Frequently Asked Questions About Computational Linguistics - Geared to people who are unfamiliar with the field.

Statistical Natural Language Processing - A web-based course in statistical natural language processing from Göteborg University, Sweden. Includes a basic reading course, set of student projects and inventory of useful resources.

Hermit Crab - A morphological parser and generator, developed for classical generative phonology and morphology. Download, documentation, background information and computational morphology research.

Introduction to Language Technology Research - Introductory information and a directory of resources in theoretical and applied computational linguistics.

A Language of Metaphors - A theory that suggests that metaphors are based in mathematical truths, and may be key to both brain structure and artificial intelligence.

ILK: Induction of Linguistic Knowledge - A research program at Tilburg University in the Netherlands, aimed at using inductive learning technology to advance both language engineering and the understanding of linguistic knowledge. Publications, downloadable software, and text analysis demos.

InDiGen: Integrated Discourse Generation - A research project which aims to bring about "an integrated approach to discourse and sentence planning which captures the interaction of discourse marker selection, ellipsis, and discourse structure." Includes related publications, web-based demo.

Definition of Computational Semiotics - A concise outline of this field from the perspective of defining knowledge units for artificial intelligence systems.

Andersen, Peter Bøgh - Professor at the University of Aalborg in Denmark, whose research focuses on computational semiotics. Papers and descriptions of current research.

SIGdial: Special Interest Group in Discourse and Dialogue - A subgroup of the Association for Computational Linguistics which supports empirical, standardized research in the computational analysis of spoken discourse, including standard corpora. Organizational information, events, and resources.

SIGPHON: Special Interest Group in Computational Phonology - A subgroup of the Association for Computational Linguistics which supports computer-based research in phonology and morphology. Organizational information, bibliography.

Jurafsky, Daniel - University of Colorado professor whose research includes machine learning, parsing and computational psycholinguistics. Current research, syllabi, and archive of publications in PostScript and PDF formats.

SIGSEM: Special Interest Group in Computational Semantics - A subdivision of the Association for Computational Linguistics. Includes organization information and schedules, as well as assorted research tools and resources.

SIGNLL: Special Interest Group on Natural Language Learning - A subgroup of the Association for Computational Linguistics, dedicated to research on the machine learning of natural language.

SIGGEN: Special Interest Group in Text Generation - A subgroup of the Association for Computation Linguistics, supporting research in computer generation of natural language. Organizational information and extensive natural language generation resources.

L2004 - Course module at UMIST providing a general introduction to the field, including parsing and sentence generation algorithms. In sequential HTML pages.

Computational Morphology and Phonology - A list of online resources related to computational morphology and phonology.

Morphological Parsing - Downloads and documentation for the PC-KIMMO morphological parser, as well as background information and research in computational morphology.

WordNet Bibliography - A comprehensive list of research publications involving the WordNet lexical database.

Introduction to Computational Phonology - Brief course on the fundamentals of this field, by Dafydd Gibbon. Includes basics of computing phonotactics and phonological parsing.

Global Wordnet Association - A society dedicated to the collection and standardization of wordnets, corpora, and other basic language processing tools. List of current and pending wordnets.

The Xtag Project - Aims to develop a wide-coverage grammar of the English language, using a lexicalized tree adjoining grammar formalism. The current version of Xtag, as well as general resources for tree adjoining grammars.

Computational Morphology - An introduction to the challenges which computational morphology poses for the programmer. By Harald Trost.

Learning Computational Grammars - A European research project which ran from 1998 to 2001, exploring the possibility of expanding computational grammars through machine learning. Publications, demos, project information.

Dan Jurafsky's Computational Psycholinguistics Research - Publications pursuing probabilistic models of psycholinguistic phenomena.

What is Computational Linguistics - A concise introduction to the field, by Hans Uszkoreit.

Corpus-Based Computational Linguistics Resources - An annotated list of resources in this field and the allied discipline of statistical natural language processing. Corpora, tools, literature and other resources.

Bonnema Renko: "Data Oriented Semantics" - A thesis project, presenting many of the issues facing computational semantics and some experimental solutions.

International Committee on Computational Linguistics - Organizes the worldwide COLING conference. Information on the nature of COLING, past COLING proceedings, and hosting future COLINGs.

Rieger, Burghard Publications - A list of papers in computational semiotics and computational semantics. Many are downloadable in PDF format.

Nordic Computational Linguistics Network - Serves the computational linguistics research communities in the Nordic and Baltic languages. Current activity in the field, past proceedings of the Nordic Computational Linguistics conferences, and comprehensive links. In Swedish and English.

Speech Prosody at Bell Labs - Current research on non-lexical aspects of speech and paralinguistic communication. General information, downloadable papers, and multilingual text-to-speech demo.

An Algorithmic Approach to English Pluralization - Research paper by D.M. Conway of the School of Computer Science and Software Engineering, Monash University. (1998).

ICoS-1: First Workshop on Inference in Computational Semantics - Start of a series of annual workshops, held in Amsterdam. Background information, schedule, program.

Semiotics of Autonomous Information Systems - Special session at the National Institute of Standards and Technology Joint Conference. Schedule and many papers in various formats.

SIGLEX Workshops - Links to past and future workshops organized by this special interest group of the Association for Computational Linguistics, dedicated to computer-aided lexical and lexicographical research. Includes programs and schedules of events dating back to 1995.

SIGDAT Conferences and Workshops - Programs and schedules of conferences organized by the data and corpus linguistics special interest group of the Association for Computational Linguistics. Includes upcoming conferences and events back to 1995.

SIGPARSE Workshops - Programs and schedules of workshops organized by this special interest group of the Association for Computational Linguistics, dedicated to the advancement of parsing technology.

EACL Conference Proceedings - Searchable proceedings of conferences of the European Association for Computational Linguistics, prior to their integration with ACL conferences. Papers are not available for download.

CICLing - 2000 - First annual conference giving an overview of computational linguistics, held in Mexico City. Pictures, conference information, keynote speakers.

CLIN VII - Dutch computational linguistics conference hosted by IPO Eindhoven. Abstracts, downloads of selected papers.

CLIN 98 - Ninth annual meeting of Computational Linguistics in the Netherlands, held at the University of Leuven. Schedule, general information, abstracts.

CLIN 99 - Tenth annual meeting of Computational Linguistics in the Netherlands, held at the Utrecht Institute of Linguistics. Program, abstracts, registration and sponsor information.

CLIN2000 - Eleventh annual meeting of Computational Linguistics in the Netherlands, held at Tilburg University. Program, registration information, how to buy the proceedings.

ICoS-2: Second Workshop on Inference in Computational Semantics - Held at Schloss Dagestuhl, in Saarland, Germany. Background information, program, schedule.

COLING-ACL 98 - The 17th annual International Conference of Computational Linguistics, held in conjunction with the 36th annual ACL conference in Montreal, Quebec, Canada. List of participants, program.

COLING 2000 - The 19th annual International Conference on Computational Linguistics, held in Saarbrücken, Germany. Participants, program, procedural information.

First Workshop on Computational Semiotics for New Media - Held in Surrey, United Kingdom. Call for participation, program, list of organizers.

CLIN 97 - Meeting of Dutch researchers held in Nijmegen. Program, organizational and sponsor information.

FLAIRS 2003 Special Track on Recent Advances in Natural Language Processing - A dedicated section of this artificial intelligence conference in St. Augustine, Florida, United States. Features call for papers, organizational information, contact details.

CLIN 2001 - Twelfth annual meeting of Computational Linguistics in the Netherlands, held at the University of Twente. Program, abstracts, general information.

COSIGN 2001 - Conference on computational semiotics for games and new media, held in Amsterdam, Netherlands. Call for participation, program, downloadable proceedings in PDF format.

EMNLP 2001 - Conference on empirical methods in natural language processing, sponsored by the ACL's SIGDAT group, held in Pittsburgh, Pennsylvania, United States. Site features participants, schedule, and papers online in PostScript or PDF formats.

NAACL-2001 - Second annual meeting of the ACL's North American chapter, held in Pittsburgh, Pennsylvania, United States. Schedule, program, procedural information.

CICLing - 2001 - The second of these annual meetings dedicated to a general overview of trends in computational linguistics. Held in Mexico City. Pictures, speakers, conference information.

PACLIC 15 - The 15th Pacific Asia Conference on Language, Information and Computation, held in Hong Kong. List of participants, pictures, program, procedural information.

Probability Theory in Linguistics - Workshop held in Washington, DC, by the Linguistic Society of America, covering probabilistic approaches to a number of subfields. Handouts available in PDF format.

SCANALU 2002: The First International Conference on Scalable Natural Language Understanding - A conference endorsed by the Special Interest Group in Computational Semantics, to be held in Heidelberg, Germany. Submission guidelines, important dates, contact information.

COLING 2002 - 19th International Conference on Computational Linguistics, in Taipei, Taiwan. Submission and registration procedures, organizational information, schedules.

CICLing - 2002 - Third of these annual meetings held in Mexico city, covering computer-related aspects of linguistics. Schedule information, submission guidelines, list of approved papers, pictures from past conferences.

First International Wordnet Conference - In Mysore, Karnataka, India, hosted by the Global Wordnet Association. Schedules, information on registration and travel to and around the Mysore area.

ACL-02 - Fortieth annual meeting of the Association for Computational Linguistics, held in Philadelphia, Pennsylvania, United States. Program, general information.

University of Heidelberg Department of Computational Linguistics - Undergraduate and graduate programs focused on Natural Language Processing Systems. Basic information, staff profiles, current projects.

CLaRK - Graduate Programme in Computational Linguistics and Represented Knowledge - The Tuebingen-Sofia International CLaRK Graduate Programme provides a joint teaching and research facility wherein doctoral and master's students from Central and Eastern Europe (CEE) pursue their researches in the interdisciplinary field of computational linguistics and knowledge representation.

Georgetown University Computational Linguistics - A graduate program which specializes in training students for non-academic careers, offering a certification program and master's and doctoral degrees. Online application, admissions and program information, selected research projects.

Göteborg University Program in Computational Linguistics - A Swedish program offering bachelor's and master's degrees. Program information and list of student thesis projects.

Theoretical Computational Linguistics at the University of Stuttgart - German graduate program in theoretical aspects of computational linguistics. Staff homepages, publication archive, program information.

University of Essex Master's Program - Information on admissions and job prospects in the field, as well as detailed course information including syllabi and downloadable course handouts.

Laboratory for the Computational Studies of Language - Research institute at the Middle East Technical University in Ankara, Turkey. Staff profiles, publications archive, tools, and current projects including a corpus of the Turkish language.

University of the Saarland Department of Computational Linguistics and Phonetics - Program information, research projects, publication database, a German-language corpus and discourse interpretation software.

Laboratory for Computational Linguistics at Technion-Israel - An institute with projects focusing on Hebrew corpus linguistics and computational semantics. Downloadable technical reports, staff information.

University of Toronto Research - Focuses on problems in the nuanced representation of linguistic and semantic knowledge. Current areas of research, related university courses, publications archive.

Brown Laboratory for Linguistic Information Processing - Brown University research program. Staff and student profiles, publication archive.

Center for Computational Linguistics - A Czech research center at Charles University, Prague. Dedicated to research based on grammatical analysis of the Czech language corpus.

Lancaster University Centre for Computer Corpus Research on Language - An institute with a long record of corpus-based linguistic research. Information on courses, events, and published works. Also features a web-based public course in corpus linguistics.

Copenhagen Business School - Danish program offering master's and doctoral degrees in the field, with research focusing on the formal description of Danish, knowledge modeling and machine translation. Basic information on staff and current projects.

University of Amsterdam - A Dutch postgraduate program of study. Home pages of research staff, current events, archives.

Brandeis Research Lab for Linguistics and Computation - Program focused on using lexical web techniques for lexically based semantic indexing and content abstraction. Staff profiles with individual publication archives.

University of Potsdam - German university with research and postgraduate education programs in computational linguistics. Information for students, descriptions of current projects.

Korea University - English-language page for this school's program in the computational study of the Korean language. Includes demos staff profiles, descriptions of current projects.

HPSG Grammars in ALE - This document shows how to implement a natural language parser using ALE.

HPSG-L Archive - Web-based archive of this listserv, with files dating back to 1999.

Ohio State University HPSG - Conferences, resources, links.

Center for the Study of Language and Information - Head-Driven Phrase Structure Grammar (HPSG) research at Stanford University. People, research, publications, mailing list, links.

Interactive Bibliography - HTML and BibTex formats.

Overview and Some Work in Progress - Transcript of a 1996 talk by Carl Pollard, detailing the state of HPSG theory at that time.

HPSG Gazette - Online newsletter.

The Babel-System - Implementation of an HPSG for German. Features online interface. Portions in German.

Phrase Structure Grammars for Natural Languages - Project description.

ALE: The Attribute-Logic Engine - A freeware platform for building HPSGs. Current and past versions, documentation, sample grammars and related materials.

HPSG 2001 - Conference organized by the Center for the Study of Language and Information and held in Trondheim, Norway. Schedule, program, abstracts, some downloadable papers.

Electronically available papers of Detmar Meurers - Several HPSG papers, and other related areas.

Carl Pollard - Publication listing, contact information.

Introduction to ALE-RA - A course by Colin Matheson exploring the lexical rules under which the ALE-RA computational morphology tool operates.

HPSG and Grammar Engineering - Links to projects, meetings.

Click here!

---

netmation.com | netmation.net | netmation.org | netmation.tv

Copyright © 1991-2005 Netmation Inc. All Rights Reserved
Site Designed and Hosted by Netmation Inc.