Machine Learning Resources Suggestions welcome Applications | Bibliographies | Books | Commercial | Conference Announcements | Courses | Data Repositories | Home Pages | Information | Jobs | Job Seekers & Headhunters | Journals & Special Issues | Mailing Lists | Mirrors | ML Search Engines | Publication Search Engines | Research Groups | Software | Special Interest Groups | Tutorials Applications (related to ML) * Browsers/Assistants (i.e., WWW-Learning) * InfoFinder (Andersen Consulting) * The Informant (Dartmouth) * Java Meta-Learning (Sal Stolfo) * Karnak * News Dude (UC Irvine) * Syskill & Webert (UC Irvine) * WebWatcher (CMU) * Firefly + My Yahoo! List of Published ML Applications (outdated) (David W. Aha) Software Reuse * Building Programs from Data Processing Examples (Andrew Gleibman, Microspec Technologies Ltd) Workshops * Report on the MLnet Workshop on Industrial Applications of Machine Learning (ILWS'94; Yves Kodratoff) * ICML-95 Workshop: Applying ML in Practice * ICML-97 Workshop (Engels, Verdenius, Evans & Herrmans) * ICML-98 Workshop on The Methodology of Applying Machine Learning: Problem Definition, Task Decomposition and Technique Selection + Workshop Summary Bibliographies (surely this is incomplete!) * Automated Text Categorization (Fabrizio Sebastiani) * Baldwin effect (Turney, Whitley, & Anderson) * Bias Shift (Peter Turney) * Class Imbalance Problem (Nathalie Japkowicz) * Combining Estimators (Tirthanker Ray Chadhuri) * Computer Science Bibliographies * Computational Learning Theory * Constructive Induction/Feature Engineering (Tom Fawcett) * Learning Classifier Systems (Tim Kovacs) * Conceptual Clustering (Luis Talavera) * Context-Sensitive Learning (Peter Turney) * Cost-Sensitive Learning (Peter Turney) * Data Mining + Temporal, Spatial and Spatio-Temporal Data Mining Bibliography (Roddick & Spiliopoulou) * Decision Trees + Recursive Partitioning Algorithms (Tjen-Sien Lim) * Feature Selection (Peter Turney) * KDD and Data Mining (Andy Pryke) * Lazy Learning: A Living Annotated Bibliography (David W. Aha) * Machine Learning (Peter Turney) * Machine Learning Applied to Information Retrieval (Peter Turney) * Machine Learning in Strategic Game Playing (Johannes Fürnkranz) * A Machine Learning Bibliography (University of Dortmund) * Bibliography of Machine Learning (University of Manitoba) * Neural Networks + Data Exploration Using Self-Organizing Maps (Dissertation, Sami Kaski) + Rule extraction from artificial neural nets (Andrews, Diederich, & Giles) + Symbolic Neural Networks (Olcay Boz) * Receiver Operating Characteristic (ROC) Literature (Kelly Zou) * Small Disjuncts (Gary Weiss) * Aha's Modest lists of references + Decision Trees + Feature Selection + Lazy Learning o Feature weighting (i.e., for similarity functions) o Fast implementations o Formal Analyses o Function Learning o Instance-averaging o Integration: Genetic algorithms o Integration: Rules o Learning prototypes o Storage reduction Books Commercial * Abtech Corporation (ModelQuest statistical/neural network tool) * AIM Learning Systems (e.g., Genetic Programming software) * ALTA Analytics (NETMAP) * Knowledge Technologies of Andersen Consulting (Data Mining Consulting Services) * Angoss International Limited (KnowledgeSEEKER, KnowledgeSTUDIO) * Assistum (Assistum KM tool suite) * Attar Software (XpertRule Miner, Profiler) * Belmont Research: CrossGraphs * Blue Martini Software * J.P. Brown Ultimate Resources, Inc. (SuperInduction) * Cirrus Recognition (CirrusNet neural network technology) * CognIT a.s (Makers of KM tools) * Data Distilleries * DAZ Systems Incorporated (Data-Mining and KDD Products) * DBMiner Technology Inc. (DBMiner) * Integral Decisions Systems SE GmbH (Datamining vendors & consultants) * Information Discovery, Inc. (Information Discovery System, Customer Retail News, Intra-Knowledge, etc) * InfoWare (makers of STATlab) * Integral Solutions, Ltd. (Clementine) * Intelligent Application (Edinburgh, Scotland) * IBM + Quest (Data Mining Project) + Advanced Scout (Data Miner; see the NBA application) * iDetect Software (makers of MINDSuite) * International Knowledge Discovery Institute * ISoft + Makers of ALICE * Jurik Research Software (provides data preprocessing software for neural nets) * Knowledge Industries * Megaputer Intelligence, Ltd. (PolyAnalyst) * Management Intelligenter Technologien GmbH * NeoVista Solutions, Inc. * Net Perceptions (Makers of personalization software) * Neural Technologies * Partek Incorporated (Makers of Partek Pro 2000, a comprehensive data pattern recognition and data visualization system) * Prediction Company * Q-Why * Reduct & Lobbe, Inc (makers of DataLogic) * Renascence Partners + Easynet Neural Net software package * RuleQuest Research (Makers of Cubist; J. Ross Quinlan, CEO) * Saffron Technology * Salford Systems (makers of CART software) * Script Software International (KnowledgeMiner) * Silicon Graphics + MineSet * Semio Corporation (Makers of SemioMap) * Smart Information Services GmbH (intelligent information retrieval services) * Tandem Computers Inc. (makers of an object relational data mining architecture) * TRIADA (Ngram technology) * Ultragem (Using GA's) * *virtuak (inhesion ML tools) * WhizBang! Labs + WhizBang! Labs Research * WizSoft, Inc. (WizRule) * ZDM Scientific Ltd. Conference Announcements Courses on Machine Learning Data Repositories * Bilkent University Function Approximation Repository * CMU Artificial Intelligence Repository + Machine Learning * GMD Repository * National Space Science Data Center (e.g., NASA's datasets) * ILP Applications and Datasets (Oxford University) * RISE: ISI Repository of Test Domains for Learning Extraction Rules * Datasets at Silicon Graphics + Census Income + Census Year * StatLib * StatLog * DatGen: A Program for Creating Classification Datasets (formerly SCDS, Gabor Melli) * UCI Repository + Reference as: Blake, C., & Merz, C.J. (1998). UCI Repository of machine learning databases [www.ics.uci.edu/~mlearn/MLRepository.html]. Irvine, CA: University of California, Department of Information and Computer Science. * Unipen * AI Planning Domain Repository (University of Maryland) * University of Nevada at Reno Home Pages * Global Web Site for ML and CBR Researchers (U. Ottawa) * ML/CBR Home Pages (David W. Aha) * Bayesian Statisticians Home Pages * Data Miners (Andy Pryke) * Programming by Example (Henry Lieberman) * SIGNLL Members Information Applications | Artificial Life | Belief Nets | Competitions | COLT | Design | Ensembles and Mixture Models | Evaluation | Fraud Detection | Fuzzy Logic | Games | General ML Resources Pages | Genetic Algorithms | Human-Computer Learning | Inductive Logic Programming | Information Retrieval | Knowledge Acquisition | KDD | Knowledge Engineering | Lazy Learning | Link Analysis | Local Learning | Manufacturing | Minimum Description & Message Length | Multi-Agent Learning | Natural Language | Neural Networks | Ontologies | Planning | Robots | Rough Sets | Skill Acquisition | Support Vector Machines | Temporal Processing | Text Processing | Unsupervised Learning * Applications + Applications in Machine Learning (Asa Rudstrom) * Artificial Life + Report on the ALife V Conference + SIM_AGENT Toolkit (U. Birmingham) * Belief Nets + Bayesian Knowledge Discovery Project (The Open University) + Bayes Nets(Kevin Murphy) * Competitions + Abbadingo One: DFA Learning Competition (Lang and Pearlmutter) + New East-West Challenge: Results (Michie, Muggleton, Page, & Srinivasan) + The Learning Machine Challenge (Announced at IJCAI'01) * Computational Learning Theory + Computational Learning Theory (Nicolo Cesa-Bianchi) + COLT Bibliography (U. Maryland) * Design + ML in Engineering Design and Manufacturing (Yoram Reich) + 1996 AI & Design Conference Workshop on ML in Design (Brown, Goel, and Duffy) * Ensembles and Mixture Models + TR: Additive Logistic Regression: a Statistical View of Boosting (8/98: Friedman, Hastie, & Tibshirani) + Ensembles of Classifiers and Guide to Internet Resources + Mixture Models (David Dowe) + Classification and Regression using Mixtures of Experts (Steve Waterhouse, PhD dissertation) + Classification and Regression by Combining Models (dissertation by Chris Merz) o Bibtex file of Chris's references/a> * Evaluation + Evaluation of Intelligent Systems (An on-line resource) * Fraud Detection + AI and Fraud Detection (Jörn Dinkla) * Fuzzy Logic + Links (Bjarne Hansen) * Games + Bibliography on ML in Strategic Game Playing (Johannes Fuernkranz) + Machine Learning in Games (Jay Scott) + First Tournament of Learning to Play Games * General ML Resources Pages + Miningco.com's AI Site + AI Subject Index on ML (CNRC) + AI/ML Resources (Andy Pryke) + Learning Classifier Systems (Alwyn Barry) + Machine Learning/Data Mining Repository (Center for Advanced Studies, University of Louisiana, Lafayette) + IFIP/SWI Machine Learning Pages + Information Services (Austrian Research Institute for AI) + Intelligent Software Products (SAIC G6G) + Laksberg's Machine Learning Archive (Artur Laksberg) + Machine Learning Resources (Paul Nielsen) + MLnet Machine Learning Archive (GMD) + Other WWW Pages of Interest (Chuck Anderson) + Digital Library at U. Macedonia + Science: CS: AI: ML (Yahoo) * Genetic Algorithms + Genetic Algorithms Archive (NRL) + Genetic Programming (Jaime Fernandez) + Nova Genetica * Human-Computer Learning + Human-Computer Learning Foundation * Inductive Logic Programming + ILPnet2 o ILP2 Newsletter + Introduction to ILP (Oxford University) + ILP for KDD Workshop at ICML-96 + Top-Down Induction of First Order Logical Decision Trees (PhD dissertation, Hendrik Blockeel) * Information Retrieval + Bibliography on Applying ML to IR Tasks (Peter Turney, previously Rik Belew and Jude Shavlik) * Knowledge Acquisition + Brian Gaine's KAW Page * Knowledge Discovery in Databases (e.g., Data Mining) + Comparisons o Comparison of DM Tools (Ovum) o Elder Research Website (Contains notes on KDD-98 copmarisons) o KDD-Cup-98: 2nd International Data Mining Tools Competition (Parsa and Howes) + Corporate KDD Bookmark (Thierry Van de Merckt) + Cross-Industry Standard Process Model for Data Mining + Data Mining Server (Rudjer Boskovic Institute; An internet service for online data analysis based on ILLM) + Databases & Logic Programming (Michael Ley) + The Data Mine (Andy Pryke) + Debunking Data Mining Myths (Robert D. Small, InformationWeek article) + Knowledge Discovery Mine (Gregory-Piatetsky Shapiro) + Predictive Toxicology Evaluation Challenge (Srinivasan & King) + Terminology, Data Mining (Willie Kloesgen & Jan Zytkow) + Vanderbilt University + VLDB Home Page * Knowledge Engineering + Knowledge Engineering Methods and Languages * Lazy Learning + Prototype Selection for Composite Nearest Neighbor Classifiers (David Skalak, PhD dissertation) + Bontempi G., Birattari M., Bersini H., (1999). Lazy learning for modeling and control design, International Journal of Control, 72, 7/8, 643-658. (Gianluca believes that this is the first article in an automatic control journal with the keyword lazy) + Lazy Learning Special Issue in AI Review (Ed.: D.W. Aha) * Link Analysis + 1998 AAAI Fall Symposium on AI & Link Analysis * Local Learning + Georgia Tech (Chris Atkeson) * Manufacturing + The AUTON Project (Andrew Moore and colleagues) * Minimum Description & Message Length + Minimum Description Length (MDL) and Minimum Message Length (MML) (David Dowe) * Multi-Agent Learning + Learning in Multi-Agent Systems Webliography + Adaptation and Learning in Multi-Agent Systems (book; Weiss & Sen, Eds.) * Natural Language + AAAI 1998 Symposium on Applying ML to Discourse Processing + CIRCSIM-Tutor Project (Illinois Institute of Technology) * Neural Networks + Announcements of Conferences, Workshops, and Other Events on Neural Networks, Vision, and Speech + NN Benchmarking (Lutz Prechelt) + Searle's terrific summary relating neural nets with other approaches + Hybrid Connectionist-Symbolic Models: A Report from the IJCAI'95 Workshop (Ron Sun) * Ontologies + Onotology Information (Doug Skuce) * Planning + Learning Procedural Planning Knowledge in Complex Environments (11/96 dissertation by Doublas J. Pearson, U. Michigan) + Planning Resources (Rob St. Amant) + UMCP Hierarchical Task Network Planning Algorithm + Planning Digest * Robots + Xavier (CMU) * Rough Sets + Electronic Bulletin of the Rough Set Community * Skill Acquisition + Skill Acquisition Info Page (University of Karlsruhe) * Support Vector Machines + An Introduction to Support Vector Machines (N. Cristianini & J. Shawe-Taylor) * Temporal Processing + Dissertation: Refinement of Temporal Constraints in an Event Recognition System using Small Datasets (George Paliouras, U. Manchester, 1997) * Text Processing + Dissertation: Machine learning on lage, non-homogeneous text data (Dunja Mladenic, 3/99) * Unsupervised Learning + Computer Journal, Special Issue on Clustering and Classification (1999) + Unsupervised learning, clustering and mixture models (David Dowe) Job Listings Job Seekers Journals & Special Issues Mailing Lists * AI & KBS for Nondestructive Testing + a href="http://www.listbot.com/subscribe/DESKPACK">Subscription * AI & Statistics Mailing List * Classifier Systems (John Holmes, U. Penn) * datamine-l (Data Mining/Knowledge Discovery/Data Warehousing) * European Mailing List on Learning Robots (Volker Klingspor) * Hybrid Models - Learning and Architectures (Vasant Honavar): To subscribe. * Inductive Logic Programming Network * Knowledge Acquisition (Kaw) * Knowledge Discovery Mine * ML-Net Mailing List (see the ML-Net II home page) * Uncertainty in AI * ML-List * Machine Learning for User Modeling Mirrors * UK: CCP14 Auto-Mirrored Web Sites ML Search Engines * Biblio (Austrian AI Institute; Publications) * Cora (Just Research, Inc.) * ML Papers (Andrew Ng) * ML Search (Andrew Ng and Dayne Freitag: WWW Pages) * Neuroscience (Fred Lenherr) Publication Search Engines * Biblio: Search for a publication (Austrian AI Institute) * Database Bibliographies (Mohan Kamath, U. Massachusetts) * Unified Computer Science TR Search (Marc VanHeyuningen, U. Indiana) Research Groups * Australia + Australian National University + CSIRO (Gov't research organization) + University of New South Wales + University of Sydney * Austria + Austrian Research Institute for AI (OFAI) * Belgium + Katholieke Universiteit Leuven * Canada + Canadian National Research Council + University of Ottawa University of Waterloo * Croatia + Rudjer Boskovic Institute o Data Mining Server (An internet service for online data analysis based on ILLM) * France + Université Paris Sud + Université Pierre et Marie Curie (ACASA) * Germany + Aachen University of Technology + University of Dortmund + German National Research Center for Information Technology + Goethe University o Laboratory for Data Mining + University of Kaiserslautern o Learning Laboratory o Autonomous Mobile Robots + University of Karlsruhe * Israel + The Hebrew University * Italy + Machine Learning and Neural Networks Group (Joint Group involving the Universities of Florence, Siena, and Pisa) + University of Torino * New Zealand + University of Waikato * Portugal + University of Porto * Slovenia + Jozef Stefan Institute * Spain + Spanish Scientific Research Council * Switzerland + Dalle Molle Institute for Perceptual AI (IDIAP) * Turkey + Bilkent University * United Kingdom + Robert Gordon University + Oxford, University + Turing Institute + University of Bristol + University College London (Gatsby Computational Neuroscience Unit) + University of the West of England o Evolutionary Computing Group o M.S. in Machine Learning and Adaptive Computing * United States + Brigham Young University + Carnegie Mellon University o AUTON: Autonomous Machine Learning Lab (Andrew Moore and colleagues) o Learning Laboratory o Learning Robots Laboratory o Reinforcement Learning + Colorado State University + General Electric Research and Development Center (Information Technology Laboratory) + George Mason University + Georgia Institute of Technology + IBM Almaden Research Center (Data Mining Project) o Data Mining Home Page + Institute for the Learning Sciences + Iowa State University + JPL AI Group + Kansas State University (Laboratory for KDD) + Massachusetts Institute of Technology o Center for Biological and Computational Learning o Programming by Example + Naval Research Laboratory + NEC Research Institute + Rutgers University + Soar o The SOAR Songbook + United States Air Force Academy + University of California at Irvine + University of California at Santa Cruz + University of Illinoise at Urbana-Champaign o Inductive Learning o Explanation-Based Learning + University of Massachusetts o Adaptive Networks Lab o ML Lab + University of New Mexico (Adaptive Computation Group) + University of Southern Florida + University of Texas at Austin + University of Wisconsin at Madison + Vanderbilt University Software (see also Information) * Belief Networks and Other Probabilistic Systems + BAYDA (Henry Tirri, U. Helsinki Complex Systems Computation Group) + Bayesian Knowledge Discoverer (Marco Ramoni) + Belief Networks (Russell Almond) + Belief Network Power Constructor (Jie Cheng) o USA Mirror Site + BETSY: Bayesian Essay Test Scoring sYstem (Larry Rudner, University of Maryland) + Hugin (BBN) + Microsoft Bayesian Networks Modeling Tool (MSBN; network construction & evaluation) + Netica (Norsys) + RoC (Robust Bayesian Classifier) (Marco Ramoni, 3/99) * Evaluation of Multiple Systems + ROC Convex Hull for Comparing Classifiers (Tom Fawcett, Foster Provost) * Clustering: See Unsupervised Learning * Collections (Varied) of Programs + Data Mining Siftware + MLT Toolbox (GMD) + UCI ML programs (FOCL, Occam, etc) + Siftware at the Knowledge Discovery Mine (Gregory Piatetsky-Shapiro) + ML Research Software (University of Texas at Austin) + Peter Clark - ML Software + Prolog Library of ML Programs (Thomas Hoppe, GMD) + TOOLDIAG (U. Lisbon & UNINOVA) * Data Generators + Database Generator (Gabo Melli) * Data Mining + SGI's MineSet + Siftware at the Knowledge Discovery Mine + Agent-Based Data Mining (Sal Stolfo) + DBPredictor on-line Classification System (Gabor Melli, Simon Fraser University) + SUBDUE (U. Texas @ Arlington) * Decision Trees (TDIDT) and Rule Inducers + AQnn (Michalski and colleagues): What does "AQ" denote? Answer: "AQ or Aq (q as a superscript) stands for Algorithm Quasi-optimal (as originally developed in 1969, the algorithm creates approximately or strictly optimal soluations of the general covering problem (which subsumes learning rules from examples)." - R. Michalski + C4.5 and its extensions o C4.5's FTP Directory (J. Ross Quinlan) o Parallel C4.5 (Bin Li) o C5.0 info and demonstration versions o C4.5 Patch File (for outputting classification rules as Prolog statements) (Kamran Karimi, University of Regina) o RFCT: An unsupervised, temporal extension of C4.5 (Kamran Karimi, University of Regina) + CN2 (Peter Clark) + CN2 Integrated with Qualitative Modelling (Peter Clark) + Incremental Decision Trees(Paul Utgoff) + ILA-2 (Eastern Mediterranean University) + OC1 (Johns Hopkins University) + SIPINA (D.A. Zighed) * Explanation-Based Learning + Lazy Partial Evaluation (Peter Clark) * First-Order Representations (e.g., Inductive Logic Programming) + ACCEL (University of Texas at Austin; first-order abductive construction of explanations) + Claudien (Luc Dehaspe, Luc De Raedt, & Wim Van Laer) + CRUSTACEAN (David W. Aha) + FDEP (induces functional dependencies from relations) (Iztok Savnik) + FOIL (J. Ross Quinlan) + GMD's ILP Software Repository + FORTE (University of Texas at Austin; first-order theory revision) * Genetic Algorithms + Genetic Algorithms source code (NRL's GA Archive) + REGAL 3.0 (University of Torino) + Automatic Design of Algorithms Through Evolution (ADATE) + Swarm (Santa Fe Institute) * Lazy Learning + IBL (David W. Aha) + NGE (David W. Aha) + PEBLS (Johns Hopkins University) + Protos (Bruce Porter) + TiMBL (ILK, Tilburg: a suite of tools) o TiMBL Reference Guide * Knowledge Acquisition + Mobal (GMD) * Neural Networks + Bayesian Learning for Neural Networks (Radford Neal) + Fahlmann's Neural Networks + A free Neural Network repository (Lutz Prechelt) * Integrated Planning and Learning + Prodigy (Carnegie Mellon University) * Reinforcement Learning + Reinforcement Learning: A Tutorial (Mance E. Harmon; broken link) + Reinforcement Learning with Selective Perception and Hidden State (dissertation, Andrew McCallum) + A Reinforcement Learning Approach for Job-Shop Scheduling (Tom Dietterich and Wei Zhang) * Rule Learning + RIPPER (William Cohen) + CBA: Classification and Association Rules (Bing Liu) * Statistical Software + MODSTAT + Statistical software (Warren Kovach) + StatLib: Applied statistics algorithms (Carnegie Mellon University) + Multiple Regression with Automatic Curve Fitting (James Widdoes) + R (A clone of Splus; Gerhard Paass) * Theory Revision (propositional) + NEITHER (University of Texas at Austin; propositional theory revision) * Unsupervised Learning + Autoclass (NASA Ames) o Autoclass V3.1 C Program + CLUTO (Software Package for Clustering High-Dimensional Datasets; 12/01) + CViz (IBM's cluster visualization software, currently free as of 8/98) + ECOBWEB (Yoram Reich) + Fast Fuzzy Cluster (Michael Bickel) + MCLUST/EMCLUST (Chris Fraley) + Snob (Chris Wallace and David Dowe) * Workbenches + Kepler (GMD) + MLC++ Toolbench o Stanford o Silicon Graphics + WEKA Toolbench (University of Waikato) Special Interest Groups * SIGKDD: ACM Special Interest Group on Knowledge Discovery in Data * American Association for Artificial Intelligence (AAAI) * Artificial Intelligence Research in Environmental Sciences (AIRIES) * British Computer Society Specialist Group on Expert Systems * Canadian Society for Computational Studies of Intelligence (CSCSI) * International Grammatical Inference Community * European Network of Excellence in ML (MLnet) * IJCAI Inc. * Neural Information Processing Systems (NIPS) * Institute for Operations Research and the Management Sciences (INFORMS) * Knowledge Discovery Mine * Meteorology-AI (Eric Jones) * Robots + AAAI 1996 Mobile Robot Competition and Exhibition * ACL Special Interest Group on Natural Language Learning (SIGNLL) * SOAR Project (University of Southern California) * Society for AI and Statistics * Classification Society of North America Tutorials (and short notes) ________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________ Credits Thanks for your feedback, and to NCARAI, NRL's AI Center, for providing computing support for this WWW page, maintained by David W. Aha. See also the CBR Resources Page.