SAS® Enterprise Miner™ Features
Intuitive interfaces
- Interactive GUI for building process flow diagrams.
- Batch processing code for scheduling large modeling and scoring jobs.
Data preparation, summarization & exploration
- Access and integrate structured and unstructured data sources.
- Outlier filtering.
- Data sampling.
- Data partitioning.
- File import.
- Merge and append tools.
- Univariate statistics and plots.
- Bivariate statistics and plots.
- Batch and interactive plots.
- Segment profile plots.
- Easy-to-use Graphics Explorer wizard and Graph Explore node.
- Interactively linked plots and tables.
- Data transformations.
- Time series data preparation and analysis.
- Interactive variable binning.
- Rules Builder node for creating ad hoc data-driven rules and policies.
- Data replacement.
Advanced predictive & descriptive modeling
- Clustering and self-organizing maps.
- Market basket analysis.
- Sequence and web path analysis.
- Link analysis.
- Dimension reduction techniques:
- Variable selection.
- LARS (Least Angle Regression) variable selection.
- Principal components.
- Variable clustering.
- Time series mining.
- Linear and logistic regression.
- Decision trees.
- Gradient boosting.
- Neural networks.
- Partial least squares regression.
- Two-stage modeling.
- Memory-based reasoning.
- Model ensembles, including bagging and boosting.
- Time series data mining.
- Survival analysis.
- Ratemaking for insurance.
- Incremental response/net lift models.
Open-source R integration node
- Write code in the R language inside of SAS Enterprise Miner.
- Makes SAS Enterprise Miner data and metadata available to your R code and returns R results to SAS Enterprise Miner.
- Training and scoring for supervised and unsupervised R models.
- Allows for data transformation and data explorations of R models in SAS Enterprise Miner.
- Generates model comparisons and SAS score code for supported models.
Select set of high-performance procedures & nodes
- Multithreaded, high-performance procedures:
- High-performance variable reduction.
- High-performance neural networks.
- High-performance random forests.
- High-performance 4score.
- High-performance decide.
- High-performance data mining database.
- High-performance sampling.
- High-performance data summarization.
- High-performance imputation.
- High-performance binning.
- High-performance correlation.
- High-performance Bayesian network.
- High-performance clustering.
- High-performance Support Vector Machine.
- Multithreaded, high-performance nodes:
- HP Data Partition.
- HP Explore.
- HP Transform.
- HP Variable Selection.
- HP Regression.
- HP Neural.
- HP Forest.
- HP Impute.
- HP Tree.
- HP GLM.
- HP Principal Components.
- HP Cluster.
- HP SVM.
Fast, easy & self-sufficient way for business users to generate models
- SAS Rapid Predictive Modeler automatically generates predictive models for a variety of business problems.
- Business analysts and subject-matter experts work from SAS Enterprise Guide or the SAS Add-In for Microsoft Office (Excel only).
- Models can be opened, augmented and modified in SAS Enterprise Miner.
- Produces concise reports, including variable importance charts, lift charts, ROC charts and model scorecards, for easy consumption and review.
- Ability to score the training data with an option to save the scored data set.
Model comparisons, reporting & management
- Assessment features for comparing multiple models using lift curves, statistical diagnostics and ROI metrics.
- Highly visual model comparison interface.
- Innovative Cutoff node examines to determine probability cutoff point(s) for binary targets.
- Report creation and distribution.
- Model result packages.
- Group processing for multiple targets and segments.
- Interactive environment for comparing and contrasting competing models and assessing the importance of key input variables on the predicted response times.
- Register Model node provides integrated environment for model registration into the SAS Metadata Server.
- Macro can also be used for registering models developed with SAS code into the SAS Metadata Server.
Automated scoring process
- Interactive scoring in a variety of real-time or batch environments.
- Automatically generates score code in SAS, C, Java and PMML.
- Score data based on models saved as PMML documents (experimental).
- Score SAS Enterprise Miner models directly inside Aster, EMC Pivotal (previously Greenplum), IBM DB2, IBM Netezza, Oracle and Teradata databases with SAS Scoring Accelerator.
- Model registration and management.
- Deploy models in multiple environments.
- Integrate SAS Enterprise Miner training and scoring processes directly into other SAS solutions.
Open, extensible design
- Extension node for easily adding tools and personalized SAS code.
- Interactive editor features for training and score code.
- Integrate text mining for analysis of both structured and unstructured data.
- Incorporate time series, Web paths and associations rules as additional input variables into the model development process.
Scalable processing
- The Java client and the SAS server architecture both scale from single-user to large enterprise solutions.
- Server-based processing and storage.
- Grid computing, in-database and in-memory processing options.
- Asynchronous model building.
- Ability to stop processing cleanly.
- Parallel processing.
- Multithreaded predictive algorithms.