Data Mining Extensions

From Wikipedia, the free encyclopedia

Data Mining Extensions (DMX) is a query language for Data Mining Models, much like SQL is a query language for relational databases and MDX is a query language for OLAP databases. DMX syntax is more similar to SQL than MDX.

Contents

[edit] Data Definition Language

Data Definition Language (DDL) part of DMX can be used to

  • Create new data mining models and mining structures - CREATE MINING STRUCTURE, CREATE MINING MODEL
  • Delete existing data mining models and mining structures - DROP MINING STRUCTURE, DROP MINING MODEL
  • Export and import mining structures - EXPORT, IMPORT
  • Copy data from one mining model to another - SELECT INTO

[edit] Data Manipulation Language

Data Manipulation Language (DML) part of DMX can be used to

  • Train mining models - INSERT INTO
  • Browse data in mining models - SELECT FROM
  • Make predictions using mining model - SELECT ... FROM PREDICTION JOIN

[edit] Example of prediction query

This example is a singleton prediction query, which predicts for the given customer whether he will be interested in the home loan products.

SELECT
  [Loan Seeker],
  PredictProbability([Loan Seeker])
FROM
  [Decision Tree]
NATURAL PREDICTION JOIN
(SELECT 
   35 AS [Age],
   'Y' AS [House Owner],
   'M' AS [Marital Status],
   'F' AS [Gender],
   2 AS [Number Cars Owned],
   2 AS [Total Children],
   18 AS [Total Years of Education]
)

[edit] External links