Uploadyourdataset.Theenginehandlesfeatureselection,modelevaluation,andhyperparametertuningthendeliversthebestalgorithmautomatically.
best_model
Stacking Ens.
cv_folds
5-fold
accuracy
98.4%
0%
Multicollinearity Drop Threshold
0%
Missing Tolerance Limit
0th
Winsorization Percentile
0+
Max Target Encoding Threshold
We execute true statistical procedures on every dimension of your uploaded dataset.
Automatic Pearson correlation matrix tracking. Features exceeding r > 0.95 collinearity are aggressively dropped to preserve model stability.
Smart numeric imputation switching between Median (for heavy skew > 1) and Mean (Gaussian approximations) evaluated feature-by-feature.
Runs mathematical K-Means clustering prior to supervised modeling, adding non-linear cluster distances as raw engineered features.
Monitors target distribution density dynamically, applying Yeo-Johnson (skew > 1) or Log1p (skew > 1.5) transforms to linearize regressions.
Aggressive 1st/99th percentile clipping applied dynamically to heavily skewed numeric features where IQR outlier volume strictly exceeds 2%.
Switches effortlessly between standard OneHotEncoders (< 15 values) and signal-preserving TargetMeanEncoders for high cardinality data.
Follow exactly what happens to a dataset moving through SentientML.
| age | job | balance |
|---|---|---|
| 30 | unemployed | 1787 |
| 33 | services | 4789 |
| 35 | management | 1350 |
Connect your raw CSV, Excel, or JSON datasets directly in the browser. Employs intelligent memory-aware chunking to handle large files seamlessly and immediately profiles column types.
Problem Type
price
90%
3511
0
Automatically scans continuous and categorical columns to highlight valid targets. Evaluates unique value ratios to classify the problem type instantly as Regression or Classification.
Runs full structural evaluations including zero-variance filtering, missing-value density mapping, and generates a strict >0.95 Pearson collinearity dropping matrix.
Routine ML Pipeline
Categorical Encoding
3OneHot (2) + TargetEnc (1)
Constructs the mathematical pipeline. Replaces values via Adaptive mean/median imputation. Resolves severe skewness with Yeo-Johnson transforms and scales with Robust or Standard scalers.
Executes a two-phase tournament: screens all algorithms first, then deep-tunes the top 3 using Optuna Bayesian optimization with 5-fold cross-validation. Automatically applies SMOTE for extreme class imbalance.
Aggregates the tuned models into a Variance-Reduced Stacking Ensemble meta-learner. Evaluates performance mathematically and generates full SHAP and Permutation importance attributions.
Download the optimized pipeline structure to inference mode immediately. Enter new feature vectors into the compiled model to see live, on-the-fly predictions locally.

Founder & Architect
Mathematics overtakes guesswork.
"Traditional machine learning relies on profound trial and error where valuable weeks are lost to tuning and hoping."
SentientMLreplaces hope with certainty. The engine synthesizes exact, automated pipelines from your data's statistical DNA, generating the optimum mathematical architecture instantly.