---
Reference - Knowledge Management - Knowledge Discovery
---

Click here!

model&mine - Dorian Pyle, author of "Data Preparation for Data Mining", provides resources on data mining, business modeling, and analytical CRM, including: articles, White Papers, downloads, books, information on courses and consulting, extensive links, and FAQs for mining pro's and newbies.

UCI Knowledge Discovery in Databases Archive - An online repository of large datasets which encompasses a wide variety of data types, analysis tasks, and application areas. The primary role of this repository is to serve as a benchmark testbed to enable researchers in knowledge discovery and data mining to scale existing and future data analysis algorithms to very large and complex data sets.

KD Nuggets Tool Links - Tools for Data Mining and Knowledge Discovery. Comprehensive list of tools with Internet Links.

Second Moment - The news and business resource for applied analytics. Powerful content weblog mixing articles, commentary, technique and critique of the intersection of academic KD research and the directed KD of corporations.

Information Discovery, Inc. - Knowledge Access Suite. Relevant patterns in data are automatically pre-mined into a pattern warehouse for business users.

Knowmadic Inc. - Provides web automation solutions by 'driving' the browser. Applications in B2B integration, web automation, content aggregation, data warehousing, and data mining browser helpers.

Nonlinear Thinking - A thought process that offers simple solutions within complexity reducing the dependence on experts, consultants and external resources. Features articles and tips for discovering novel solutions to recurring problems.

Goldridge Strategic Landscapes - Visualize complex market relationships based on real-time market data contained in Goldridge databases to which clients subscribe.

Web Mining for Knowledge Discovery - Firm offers services mining, analyzing and transforming raw data into meaningful information. Features company profile, description of services and contact details.

KD Central - Portal features links to theory, articles, guides, tools, consultants, and companies.

Megaputer Intelligence - Data, text, and web mining software. PolyAnalyst includes in-place mining, strong Microsoft integration.

Bibliography: Data Mining and Knowledge Discovery in Databases - A bibliography of KDD research from a Computer Science perspective. Links are provided to almost every paper, most of which are seminal.

Net Perceptions - Real-time relationship marketing and personalization, integrating high-scale data mining, analytic, and recommendation technologies with a direct conduit to action.

Mining Customer Data - By Gary Saarenvirta. A step-by-step look at a powerful clustering and segmentation methodology.

Knowledge Discovery In Databases: Tools and Techniques - Article by Peggy Wright that presents the results of a literature survey outlining the state-of-the-art in KDD techniques and tools.

Data Mining Group - Data Mining group in the Laboratory for Knowledge Discovery in Databases (KDD) at Kansas State University

The Deep Web: Surfacing Hidden Value - White paper on the Deep Web, an area of the Internet 550 times larger than the surface web crawled by traditional search engines.

Psybertron Knowledge Modelling Weblog - What, Why and How do we Know ? Research into models for knowledge management in business organisation decision support. (Supersedes Ian's Knowledge Modelling Weblog)

ClearForest - ClearForest software examines random facts and converts them into relevant knowledge drastically reducing research time, and increasing the quality of information gathered.

Brosis Innovations Inc. - XCise Pro text mining software. It automates the process of searching for, mapping and reporting both the presence and frequency of key words and phrases.

Readware Technology - Products and toolkits to master the flood of textual information. The 'Surveyor' class of products analyzes masses of text and brings out the most relevant issues classified by the subjects you select. Use Readware to analyze, assess, classify, filter and extract information, and perform advanced search.

Trivium - SEE-K combines unique technologies and methodolies resulting from 9 years of ongoing research and experience in Human Capital and Kowledge Management.

DataSet - Extending the familiar Data Warehousing paradigm into wider scope: managing unformatted text from multiple sources - letters, documents, notes, HTML pages with data records, within Relational Database. Advances the power of MS-Word.

Fractal Edge Limited - Fractal:Edge develops software products for navigating large information sources quickly and accurately using a patented visualisation technique based on fractals.

Stratify (formerly Purple Yogi) - Stratify provides enterprise software solutions that allow companies to leverage their vast amount of corporate information to make more effective business decisions and facilitate highly efficient business transactions.

TextAnalyst - Neural Network Natural Language Analysis - TextAnalyst implements a proprietary neural network technology analyzing natural language in several languages. TextAnalyst's functionality includes textbase navigation, natural language querying, semantic analysis, summarization, clustering and classifying and automated knowledge base creation.

WEBSOM - Self-Organizing Maps for Internet Exploration. An ordered map of the information space is provided: similar documents lie near each other on the map. The order helps in finding related documents once any interesting document is found.

The Vantage Point - VantagePoint helps you extract information from any structured text database. It supports patent analysis, investment planning, innovation forecasting, and technology planning.

Thinkmap.com - The Thinkmap Platform increases the sophistication with which organizations communicate over the Internet, transforming a static collection of objects or information into a striking animated display that encourages interaction.

TextSmart 1.0 - TextSmart means fast and complete analysis of open-ended survey responses.

System of Meaning Integrities structural Creation (SMIsC) - SMIsC is a text-mining computer software tool enabling the user to integrate large unstructured masses of text into browsable networks of local coherence, to uncover in them unexpected patterns of meaning thematically organized text clusters transformable into discourses or narratives, and to analyze these patterns in their statics and dynamics.

TheBrain - A Dynamic Information System - TheBrain lets you integrate content across applications, document types and internet data, freeing you from application and location-centric computing. TheBrain stores information associatively and interoperates with standard Windows applications, so you don't have to think about launching programs or digging through files and folders. In TheBrain ideas come first. File formats become incidental and physical locations become transparent, giving you a context rich workspace completely native to your way of thinking. Powerful new metaphor, useful for personal or site information navigation.

Antarcti.ca Systems Inc. - Antarcti.ca transforms networks into places, with shared information landscapes that resemble physical geographies. Demos include "map display" of search results.

Enfish - Information integration software incorporating desktop data and corporate knowledge.

Research Outlet and Integration - twURL -- filter, organize, and publish web topics for competitive intelligence, due diligence, libraries, news background, and search tracking. twURL's visual decision support interface helps turn raw, unorganized collections of 1000's of URLs on a topic into selected, categorized, and reusable HTML files and data bases.

Virtual Self - Virtual Self makes possible a real-time, taxonomy-free, flexible, state-of-the art system that gets you to the answer you are seeking more quickly, provides access to all your information regardless of where it is, and leverages your existing IT infrastructure.

Maps of the Web: visual Internet search directory, ODP, chat, bookmarks - Map.net is the first visual web search engine / directory - a map of the Internet providing intuitive, fun navigation and relevant search results.

RecomMind MindServer - RecomMind (Berkeley, CA) makes the MindServer knowledge management and search software for web sites and intranets. MindServer provides personalized search results and automatically deals with ambiguous queries in even the most specialized document collections.

IBM Intelligent Miner for Text - Offers a wide range of text-analysis tools, full-text retrieval components, and Web-access tools to enrich their business-intelligence solutions. For analysis of email, insurance claims, news feeds, and Lotus Notes and analysis patent portfolios, customer complaint letters, even competitors' web pages.

Knowledge Base - Provides knowledge base and knowledge management software, focusing on web-based self help solutions.

Yellowbrix - YellowBrix is an industry leader in providing companies with content infrastructure services to manage, retrieve and distribute information from multiple sources across all distribution channels. Our services include syndicated news as well as superior content management built on sophisticated technology.

iCrossReaderTM - A Powerful Text Mining Tool. Produces on-demand survey capable of refocusing user's attention on the new aspects of a given theme.

MapStan.net : Your navigation plan based on internet users' experience - MapStan.net is the only service that supplies you a personalized plan of your navigation based on the experience of Internet users.

Torch Concepts: Advanced technology for content management and information mining - Torch Concepts unique algorithm-based products allow organizations and businesses to find and organize critical information located in internal corporate databases, corporate intranets, and on the WWW.

LexiQuest, Inc. - LexiQuest lets you discover, manage and retrieve text-based information by using advanced natural language technology. Intuitive knowledge management tools empower your employees and customers with quick, precise results to their research needs.

TEMIS - Text Mining Technology and Consulting company offering software components ("Insight Discoverer") for the efficient analysis of large document collections: information extraction, classification and clustering.

Igentica Ltd. - Provides software that searches, mines and integrates information for e-commerce and other applications. Features demonstration software, product descriptions, and contact details.

KnowledgeSite, Inc. - Provides technology for automatic summarization of electronic documents and metadata creation. Features product overview demonstration, and contact information.

Datawatch Monarch - A data access and analysis tool that uses spooled report files as a source for data. Features information on the product and sample data modules.

NetMap Analytics - A text-mining and fraud detection system for use within the Insurance and Retail industries. Visualisation, Clustering, Deviation Detection and Link analysis.

myKB Hosted Knowledge Base - Knowledgebase software that easily integrates into websites. Customers can quickly access company support information.

MinePoint Software - MinePoint is a modular software solution designed to provide intelligent data analysis capabilities for data residing in disparate information systems.

Weka 3 - Collection of machine learning algorithms for solving data mining problems implemented in Java and open sourced under the GPL.

About the Cat-a-Cone - A novel user interface that integrates search and browsing of very large category hierarchies with their associated text collections. One key insight is the separation of the representation of category labels from documents, which allows the display of multiple categories per document.

About Scatter/Gather - The Scatter/Gather interface uses text clustering as a way to group document according to the overall similarities in their content. Scatter/Gather is so named because it allows the user to scatter documents into clusters, or groups, then gather a subset of these groups and re-scatter them to form new groups.

About TileBars - To many users, the way search engines choose to rank retrieved documents is a bit of a mystery. The TileBars interface is an attempt to show the user, graphically, the relationship between the words in the query and the documents retrieved.

Temis - Consulting and proprietary software (Insight Discoverer, Skill Cartridge) for text mining. Strong in Germany.

Eidetica - hosted knowledge - World-class Professional Search Systems. Index and Subject Inference Hosting. Advanced Text Mining Services.

Defining Data Visualization - Don Nachtwey - Thinx Software. Data Visualization is an emerging market space that is not well defined. Data Visualization is a breed of products that feature a graphic component and a data component. But as a consumer with a data visualization requirement, I would have trouble doing a product comparison.

Automated Analysis of Natural Language Texts - Megaputer white paper about popular text analysis methods.

Text Mining and the Knowledge Management Space - A Semio Corporation white paper.

Leximancer Text Mapping and Exploration - Bayesian based technology for mapping and mining concepts in large text collections. Builds concept maps/thesauruses and classify text from most languages. Displays text map on a java globe.

Untangling Text Data Mining - Defines data mining, information access, and corpus-based computational linguistics, and then discusses the relationship of these to text data mining. The intent behind these contrasts is to draw attention to exciting new kinds of problems for computational linguists.

Automatic Resource Compilation by Analyzing Hyperlink Structure and Associated Text - Describes the design, prototyping and evaluation of ARC, a system for automatically compiling a list of authoritative Web resources on any (sufficiently broad) topic. The goal of ARC is to compile resource lists similar to those provided by Yahoo! or Infoseek.

Text Mining at Waikato - The Text Mining group at the University of Waikato in New Zealand. With a focus on Viterbi search and entropy-based methods the group has a compression feel to it.

TextAnalyst - TextAnalyst is a unique text mining tool, using a semantic network for retrieval, clustering, classification, summarization, and natural language querying.

Quenza from Xanalys - Automatic extraction of entities and cross references from text.

David Small: Shakespeare Project - The difficulty seems to be, not so much that we publish unduly in view of the extent and variety of present-day interests, but rather that publication has been extended far beyond our present ability to make real use of the record.

Weka - An open source framework for text analysis implemented in Java that is being developed at the University of Waikato in New Zealand. Features product overview, developers notes and mailing list.

WordStat content analysis & text mining software - Contains information and trial version of WordStat, a content analysis & text data mining tool.

TextAI: Text Analysis International - Provides NLP applications based on its proprietary VisualText technology. Product and service information, online software tour, and documentation.

A Roadmap to Text Mining and Web Mining - A comprehensive list of text and Web mining related resources including links to working groups, products, conferences, and workshops.

IBM Intelligent Miner for Text - Offers a wide range of tools for text analysis.

Extraction of knowledge from unstructured text - A comprehensive and annotated survey of knowledge extraction from text, in the form of Powerpoint PDF slides.

WebAnalyst - Profiles the content of a web page, or from a content database, and uses data mining techniques to associate profiled content dynamically during a browsing session.

Machine Learning in Automated Text Categorization - Survey discussing the main approaches to text categorization that fall within the machine learning paradigm. By Fabrizio Sebastiani. [PDF Format].

Pertinence - Automatic text summarization tools.

text mining and web-based information retrieval reference - List of about 100 links reflecting reviews and analyses of text mining research, academic and commercial. A good list but not annotated or categorized.

Text mining, Web Mining - Related Conference & Workshops - Provides a list of the upcoming conferences and workshops related to text mining and web mining, with submission deadlines and links to home pages.

Text Mining Community - Provide a web home for people interested in text mining related technologies, with a mailing list and a resources section.

Knowledge discovery and data mining: theory and practice - M.A. Bramer (Ed.) - Full Contents.

Advances in Knowledge Discovery and Data Mining - book

Military Books - The Military and Aviation Book Society: whether your interest be aviation, the armed service, military history or memorabilia we bring you a wealth of reading.

PenguinsRule - A site working on solutions to help animal rescue after oil-spills.

ACM Special Interest Group on Knowledge Discovery in Data and Data Mining (SIGKDD) - SIGKDD encourages basic research in KDD (through annual research conferences, newsletter and other related activities), adoption of "standards" in the market in terms of terminology, evaluation, methodology, and interdisciplinary education among KDD researchers, practitioners, and users.

InterActive Software Concepts, Inc. - Information searches, data mining and knowledge discovery contracts using the company's own Internet software and techniques.

Plumb Design - Creates online experiences that facilitate the exchange of knowledge and the interplay of ideas.

Engenium Corporation - Provides software based on "conceptual matching technology" for human resources and other applications. Executive profiles, press releases, product and service information.

InfoViz - Document finder. The research group "Visualization of Dataspaces" was established in October 1996 at the University of Applied Sciences in Potsdam, Germany.

Dynamic queries, starfield displays, and the path to Spotfire - Ben Shneiderman. The old days of command line interfaces and submitting queries to databases are passing quickly. In their place are dynamic queries and starfield displays that update a two-dimensional graphical display in 100 milliseconds.

A Combined Visualization Approach for WWW-Search Results - The idea of Information Visualization is to get insights into great amounts of abstract data. Especially document sets found by searching the World Wide Web are a special challenge.

A Study of Visualisation Tools for the Web - Finding things on the web is a problem that grows with the web itself. Automated classification and clustering techniques are needed to fully exploit the benefits of both directories and search engines.

Information Visualization - Details about technologies in information visualization at the Pacific Northwest National Laboratory. Includes graphics and published papers.

The SAGE Visualization Group - Explanations of the SAGE, SDM, Visage, Autobrief and VQE systems and their applications.

OLIVE: On-line Library of Information Visualization Environments - Includes Temporal, 1-D, 2-D, 3-D, Multi-D, Tree, Network, and Workspace environments. Many links to ongoing projects and papers.

Information Visualization - Graph Visualization. Latour Tree Visualization Project. Tree and Skeletal Images.

Graph Visualisation and Navigation in Information Visualisation - This is a survey on graph visualisation and navigation techniques, as used in information visualisation.

Mappa.Mundi Magazine - Mappa.Mundi Magazine explores how we see and use the Internet via an eclectic mix of articles about technology, history, and the future of cyberspace. Interesting magazine and worth a look

Populated Information Terrains - This page contains information on some of the visualisation techniques being employed by the Communications Research Group, Department of Computer Science, University of Nottingham, UK.

Ceetron - understanding by visualization - Provider of 3D Visualization and Image Processing Software Solutions

InfoVis.Net - A bilingual (English/EspaƱol) website devoted to Information Visualization, its techniques, who's who, resources, bibliography and a collaborative space for contributing to this rapidly evolving topic.

Xerox PARC UIR Information Visualization - Information Visualization is the use of computer-supported interactive visual representations of abstract data to amplify cognition. Whereas scientific visualization usually starts with a natural physical representation, Information Visualization applies visual processing to abstract information.

Jerry Isdale's Big List of InfoVis Links - Academic studies, events, government activities, journal articles, links and products.

Information Visualization via Hyperbolic Geometry - We visualize the structure of sections of the World Wide Web by constructing graphical representations in 3D hyperbolic space. The felicitous property that hyperbolic space has "more room" than Euclidean space allows, more information to be seen amid less clutter, and motion by hyperbolic isometries provides for mathematically elegant navigation

As You Like It: Tailorable Information Visualization - Information visualization tools have traditionally implemented a set of pre-defined visual displays. We describe the DOODLE Visualization Tool, which is interactive and supports visualizations specified by the user with a visual constraint-based language.

PaVIS: Proximity Visualization of Abstract Data - Website devoted to visualization of abstract data collections, like graphs, multivariate data tables, or sets of multimedia objects. Icons representing objects from a collection are positioned such that proximity relationships within the collection are preserved, i.e. icons for similar objects are clustered, and separated from the dissimilar ones. Examples used include MDS (multidimensional scaling).

Gary Ng's Information Visualisation Resources site - Extensive and current directory of sites relating to information visualisation.

Tim Cribbin's Information Visualisation page - Comprehensive resource for researchers and students interested in information visualisation and exploration. The site focuses mainly on HCI issues and associated research being conducted by the Vivid research group, Brunel University.

Welcome to WebMap - WebMap Technologies inc.: a software company providing breakthrough eBusiness solutions for the visual interaction with information, focusing on map-oriented display of data search results

SPIRE at PNNL - SPIRE (Spatial Paradigm for Information Retrieval and Exploration)provides a wealth of tools for exploring the information, including query, subset, and trend analysis tools.

The Vivid Research Centre - The Vivid group brings together researchers from related fields in order to pursue high quality research in the area of human-computer interaction.

Maya Viz - Maya Viz's Katalyst software supports and enables users in Decision Communities to capitalize on the value of shared visualizations of data.

Antarcti.ca Systems: Visual Mapping Technology - Antarcticas data visualization software improves the search and navigation of databases and taxonomies. By making network data visual, like desktop data, the value of your information is increased.

Verona - Verona is a visual knowledge management tool designed to support unstructured information and collaboration.

Visualizing Network Information and Graphs - Describes the NicheWorks prototype interface, a visualization tool for large data sets. Includes papers on graphing and exploratory data analysis.

Semtation - Publisher of Semtalk, a tool that attaches a graphical model to a text document. Provides a overview of the product, papers, news, and a free trial version.

VVI - Realtime, Web And Bulk Data Reporting And Visualization Solutions

NetVis Module - An open source web-based tool for researchers to simulate, analyze, and visualize social networks using data from online surveys, imported CSV files, and electronic discussion groups.

OpenDX - Official site of OpenDX, the Open Source Software Project based on IBM's Data Explorer. News, downloads, add-ons, support.

Official Home of the GRASS GIS - GRASS GIS (Geographic Resources Analysis Support System) is an Open Source/Free Software Geographical Information System (GIS) that operates on various platforms through a graphical user interface and shell in X-Windows.

Visualization in Scientific Computing - Information about scientific visualization and the use of IBM Data Explorer.

SemTalk - Graphical editor for the Semantic Web. Attach a model to a text document as a graphical summary of its content.

Macrofocus :: Interactive Visualization - Development of interactive visualization systems that enable better-informed decisions, and support the generation and communication of knowledge.

Data Mining and Knowledge Discovery - The premier technical journal focused on the theory, techniques and practice for extracting information from large databases. Available electronically via Kluwer Online.

KDNuggets: Data Mining and Knowledge Discovery - A free electronic newsletter on data mining and knowledge discovery topics.

SIGKDD Explorations - Newsletter of a special interest group of the Association for Computing Machinery.

Click here!

---

netmation.com | netmation.net | netmation.org | netmation.tv

Copyright © 1991-2005 Netmation Inc. All Rights Reserved
Site Designed and Hosted by Netmation Inc.