Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
v0.16.0 Nov. 24, 2020
Enhancements
make_pipeline
to acceptWoodwork
inputs Update pipelines andmake_pipelines
to accept Woodwork DataTables #1393Woodwork
inputs Update components to accept Woodwork inputs #1423AutoMLSearch
Freeze Hyperparameters for AutoMLSearch #1284Target Encoder
into transformer components Add Target Encoder to Components #1401AutoMLSearch
Add callback for error handling in AutoMLSearch #1403explain_predictions_best_worst
output to help users identify which rows in their data are included Adding the Index ID to explain_prediction_best_worst_output #1365explain_predictions_*
functions are now determined by the magnitude of shap values as opposed to thetop_k
largest and smallest shap values. Displaying the top_k features with largest shap value magnitudes #1374is_defined_for_problem_type
method toObjectiveBase
Add TimeSeriesRegression problem type #1386random_state
parameter tomake_pipeline_from_components
function make_pipeline_from_components accepts a random state #1411DelayedFeaturesTransformer
Add a DelayedFeaturesTransformer #1396TimeSeriesRegressionPipeline
class Timeseries regression pipeline #1418core-requirements.txt
from the package distribution Removing core-requirements from installed package #1429"code"
and"details"
fields Update data check message to include a "code" and "details" fields #1451 Fix high variance cv message for automl #1462TimeSeriesSplit
data splitter for time series problems Data split for time series #1441problem_configuration
parameter to AutoMLSearch Add problem_configuration parameter to AutoMLSearch #1457Fixes
IndexError
raised inAutoMLSearch
whenensembling = True
but only one pipeline to iterate over Fixed IndexError when ensembling = True but number of pipelines to search over is 1 #1397AutoMLSearch
Fix stacked ensemble and LightGBM errors in AutoMLSearch #1388Woodwork
'sto_pandas()
toto_series()
andto_dataframe()
Update Woodworkto_pandas
toto_dataframe
andto_series
#1428ExpVariance
objective on data with highly skewed target When baseline score is close to 0, set percent improved to nan #1467Changes
OutliersDataCheck
to return the list of columns, rather than rows, that contain outliers Change OutliersDataCheck to find outliers for columns #1377AutoMLSearch
to passWoodwork
data structures to every pipeline (instead of pandas DataFrames) Update AutoML to pass Woodwork DataTables to every pipeline/component (instead of pandas DataFrames) #1450AutoMLSearch
to default tomax_batches=1
instead ofmax_iterations=5
Update automl to use default of max_batches=1 #1452Documentation Changes
DataChecks
, and usage of data checks in AutoML Update documentation to include all data checks and usage of data checks in AutoML #1412np.array
tonp.ndarray
Update docstrings fromnp.array
tonp.ndarray
#1417Testing Changes
category_encoders
from test-requirements.txt Removing category_encoders from test-requirements.txt #1373make lint
to check notebook versions in the docs Adding a script to standardize the notebook versions. #1431make lint-fix
to standardize notebook versions in the docs Adding a script to standardize the notebook versions. #1431Breaking Changes
top_k
andtop_k_features
parameters inexplain_predictions_*
functions now returnk
features as opposed to2 * k
features Displaying the top_k features with largest shap value magnitudes #1374problem_type
toproblem_types
inRegressionObjective
,BinaryClassificationObjective
, andMulticlassClassificationObjective
Rename LabelLeakageDataCheck to TargetLeakageDataCheck. #1319