Data Mining Extensions

Data Mining Extensions (DMX) is a query language for Data Mining Models supported by Microsoft's SQL Server Analysis Services product.

Like SQL, it supports a data definition language, data manipulation language and a data query language, all three with SQL-like syntax. Whereas SQL statements operate on relational tables, DMX statements operate on data mining models. Similarly, SQL Server supports the MDX language for OLAP databases. DMX is used to create and train data mining models, and to browse, manage, and predict against them. DMX is composed of data definition language (DDL) statements, data manipulation language (DML) statements, and functions and operators.

DMX Queries

DMX Queries are formulated using the SELECT statement. They can extract information from existing data mining models in various ways.

Data Definition Language

The Data Definition Language (DDL) part of DMX can be used to

Data Manipulation Language

The Data Manipulation Language (DML) part of DMX can be used to

Example: a prediction query

This example is a singleton prediction query, which predicts for the given customer whether she will be interested in home loan products.

SELECT
  [Loan Seeker],
  PredictProbability([Loan Seeker])
FROM
  [Decision Tree]
NATURAL PREDICTION JOIN
(SELECT 
   35 AS [Age],
   'Y' AS [House Owner],
   'M' AS [Marital Status],
   'F' AS [Gender],
   2 AS [Number Cars Owned],
   2 AS [Total Children],
   18 AS [Total Years of Education]
)

See also

This article is issued from Wikipedia. The text is licensed under Creative Commons - Attribution - Sharealike. Additional terms may apply for the media files.