Changelog
Develop#
0.13.35#
- [FEATURE] Create ExpectationValidationGraph class to Maintain Relationship Between Expectation and Metrics and Use it to Associate Exceptions to Expectations (#3433)
- [BUGFIX] Addresses issue #2993 (#3054) by using configuration when it is available instead of discovering keys (listing keys) in existing sources. (#3377)
- [BUGFIX] Fix Data asset name rendering (#3431) (Thanks @shpolina)
- [DOCS] minor fix to syntax highlighting in how_to_contribute_a_new_expectation⊠(#3413) (Thanks @edjoesu)
- [DOCS] Fix broken links in how_to_create_a_new_expectation_suite_using_rule_based_profile⊠(#3410) (Thanks @edjoesu)
- [ENHANCEMENT] update list_expectation_suite_names and ExpectationSuiteValidationResult payload (#3419)
- [MAINTENANCE] Clean up Type Hints, JSON-Serialization, ID Generation and Logging in Objects in batch.py Module and its Usage (#3422)
- [MAINTENANCE] Fix Granularity of Exception Handling in ExecutionEngine.resolve_metrics() and Clean Up Type Hints (#3423)
- [MAINTENANCE] Fix broken links in how_to_create_a_new_expectation_suite_using_rule_based_profiler (#3441)
- [MAINTENANCE] Fix issue where BatchRequest object in configuration could cause Checkpoint to fail (#3438)
- [MAINTENANCE] Insure consistency between implementation of overriding Python hash() and internal ID property value (#3432)
- [MAINTENANCE] Performance improvement refactor for Spark unexpected values (#3368)
- [MAINTENANCE] Refactor MetricConfiguration out of validation_graph.py to Avoid Future Circular Dependencies in Python (#3425)
- [MAINTENANCE] Use ExceptionInfo to encapsulate common expectation validation result error information. (#3427)
0.13.34#
- [FEATURE] Configurable multi-threaded checkpoint speedup (#3362) (Thanks @jdimatteo)
- [BUGFIX] Insure that the "result_format" Expectation Argument is Processed Properly (#3364)
- [BUGFIX] fix error getting validation result from DataContext (#3359) (Thanks @zachzIAM)
- [BUGFIX] fixed typo and added CLA links (#3347)
- [DOCS] Azure Data Connector Documentation for Pandas and Spark. (#3378)
- [DOCS] Connecting to GCS using Spark (#3375)
- [DOCS] Docusaurus - Deploying Great Expectations in a hosted environment without file system or CLI (#3361)
- [DOCS] How to get a batch from configured datasource (#3382)
- [MAINTENANCE] Add Flyte to README (#3387) (Thanks @samhita-alla)
- [MAINTENANCE] Adds expect_table_columns_to_match_set (#3329) (Thanks @viniciusdsmello)
- [MAINTENANCE] Bugfix/skip substitute config variables in ge cloud mode (#3393)
- [MAINTENANCE] Clean Up ValidationGraph API Usage, Improve Exception Handling for Metrics, Clean Up Type Hints (#3399)
- [MAINTENANCE] Clean up ValidationGraph API and add Type Hints (#3392)
- [MAINTENANCE] Enhancement/update _set methods with kwargs (#3391) (Thanks @roblim)
- [MAINTENANCE] Fix incorrect ToC section name (#3395)
- [MAINTENANCE] Insure Correct Processing of the catch_exception Flag in Metrics Resolution (#3360)
- [MAINTENANCE] exempt batch_data from a deep_copy operation on RuntimeBatchRequest (#3388)
- [MAINTENANCE][WIP] Enhancement/cloud 169/update checkpoint.run for ge cloud (#3381)
0.13.33#
- [FEATURE] Add optional ge_cloud_mode flag to DataContext to enable use with Great Expectations Cloud.
- [FEATURE] Rendered Data Doc JSONs can be uploaded and retrieved from GE Cloud
- [FEATURE] Implement InferredAssetAzureDataConnector with Support for Pandas and Spark Execution Engines (#3372)
- [FEATURE] Spark connecting to Google Cloud Storage (#3365)
- [FEATURE] SparkDFExecutionEngine can load data accessed by ConfiguredAssetAzureDataConnector (integration tests are included). (#3345)
- [FEATURE][MER-293] GE Cloud Mode for DataContext (#3262) (Thanks @roblim)
- [BUGFIX] Allow for RuntimeDataConnector to accept custom query while suppressing temp table creation (#3335) (Thanks @NathanFarmer)
- [BUGFIX] Fix issue where multiple validators reused the same execution engine, causing a conflict in active batch (GE-3168) (#3222) (Thanks @jcampbell)
- [BUGFIX] Run batch_request dictionary through util function convert_to_json_serializable (#3349) (Thanks @NathanFarmer)
- [BUGFIX] added casting of numeric value to fix redshift issue #3293 (#3338) (Thanks @sariabod)
- [DOCS] Docusaurus - How to connect to an MSSQL database (#3353) (Thanks @NathanFarmer)
- [DOCS] GREAT-195 Docs remove all stubs and links to them (#3363)
- [MAINTENANCE] Update azure-pipelines-docs-integration.yml for Azure Pipelines
- [MAINTENANCE] Update implemented_expectations.md (#3351) (Thanks @spencerhardwick)
- [MAINTENANCE] Updating to reflect current Expectation dev state (#3348) (Thanks @spencerhardwick)
- [MAINTENANCE] docs: Clean up Docusaurus refs (#3371)
0.13.32#
- [FEATURE] Add Performance Benchmarks Using BigQuery. (Thanks @jdimatteo)
- [WIP][FEATURE] add backend args to run_diagnostics (#3257) (Thanks @edjoesu)
- [BUGFIX] Addresses Issue 2937. (#3236) (Thanks @BenGale93)
- [BUGFIX] SQL dialect doesn't register for BigQuery for V2 (#3324)
- [DOCS] "How to connect to data on GCS using Pandas" (#3311)
- [MAINTENANCE] Add CODEOWNERS with a single check for sidebars.js (#3332)
- [MAINTENANCE] Fix incorrect DataConnector usage of _get_full_file_path() API method. (#3336)
- [MAINTENANCE] Make Pandas against S3 and GCS integration tests more robust by asserting on number of batches returned and row counts (#3341)
- [MAINTENANCE] Make integration tests of Pandas against Azure more robust. (#3339)
- [MAINTENANCE] Prepare AzureUrl to handle WASBS format (for Spark) (#3340)
- [MAINTENANCE] Renaming default_batch_identifier in examples #3334
- [MAINTENANCE] Tests for RuntimeDataConnector at DataContext-level (#3304)
- [MAINTENANCE] Tests for RuntimeDataConnector at DataContext-level (Spark and Pandas) (#3325)
- [MAINTENANCE] Tests for RuntimeDataConnector at Datasource-level (Spark and Pandas) (#3318)
- [MAINTENANCE] Various doc patches (#3326)
- [MAINTENANCE] clean up imports and method signatures (#3337)
0.13.31#
- [FEATURE] Enable GCS DataConnectorintegration withPandasExecutionEngine(#3264)
- [FEATURE] Enable column_pair expectations and tests for Spark (#3294)
- [FEATURE] Implement InferredAssetGCSDataConnector(#3284)
- [FEATURE]/CHANGE run time format (#3272) (Thanks @serialbandicoot)
- [DOCS] Fix misc errors in "How to create renderers for Custom Expectations" (#3315)
- [DOCS] GDOC-217 remove stub links (#3314)
- [DOCS] Remove misc TODOs to tidy up docs (#3313)
- [DOCS] Standardize capitalization of various technologies in docs(#3312)
- [DOCS] Fix broken link to Contributor docs (#3295) (Thanks @discdiver)
- [MAINTENANCE] Additional tests for RuntimeDataConnector at Datasource-level (query) (#3288)
- [MAINTENANCE] Update GCSStoreBackend + tests (#2630) (Thanks @hmandsager)
- [MAINTENANCE] Write integration/E2E tests for ConfiguredAssetAzureDataConnector(#3204)
- [MAINTENANCE] Write integration/E2E tests for both GCSDataConnectors(#3301)
0.13.30#
- [FEATURE] Implement Spark Decorators and Helpers; Demonstrate on MulticolumnSumEqual Metric (#3289)
- [FEATURE] V3 implement expect_column_pair_values_to_be_in_set for SQL Alchemy execution engine (#3281)
- [FEATURE] Implement ConfiguredAssetGCSDataConnector(#3247)
- [BUGFIX] Fix import issues around cloud providers (GCS/Azure/S3) (#3292)
- [MAINTENANCE] Add force_reuse_spark_context to DatasourceConfigSchema (#3126) (thanks @gipaetusb and @mbakunze)
0.13.29#
- [FEATURE] Implementation of the Metric "select_column_values.unique.within_record" for SQLAlchemyExecutionEngine (#3279)
- [FEATURE] V3 implement ColumnPairValuesInSet for SQL Alchemy execution engine (#3278)
- [FEATURE] Edtf with support levels (#2594) (thanks @mielvds)
- [FEATURE] V3 implement expect_column_pair_values_to_be_equal for SqlAlchemyExecutionEngine (#3267)
- [FEATURE] add expectation for discrete column entropy (#3049) (thanks @edjoesu)
- [FEATURE] Add SQLAlchemy Provider for the the column_pair_values.a_greater_than_b Metric (#3268)
- [FEATURE] Expectations tests for BigQuery backend (#3219) (Thanks @jdimatteo)
- [FEATURE] Add schema validation for different GCS auth methods (#3258)
- [FEATURE] V3 - Implement column_pair helpers/providers for SqlAlchemyExecutionEngine (#3256)
- [FEATURE] V3 implement expect_column_pair_values_to_be_equal expectation for PandasExecutionEngine (#3252)
- [FEATURE] GCS DataConnector schema validation (#3253)
- [FEATURE] Implementation of the "expect_select_column_values_to_be_unique_within_record" Expectation (#3251)
- [FEATURE] Implement the SelectColumnValuesUniqueWithinRecord metric (for PandasExecutionEngine) (#3250)
- [FEATURE] V3 - Implement ColumnPairValuesEqual for PandasExecutionEngine (#3243)
- [FEATURE] Set foundation for GCS DataConnectors (#3220)
- [FEATURE] Implement "expect_column_pair_values_to_be_in_set" expectation (support for PandasExecutionEngine) (#3242)
- [BUGFIX] Fix deprecation warning for importing from collections (#3228) (thanks @ismaildawoodjee)
- [DOCS] Document BigQuery test dataset configuration (#3273) (Thanks @jdimatteo)
- [DOCS] Syntax and Link (#3266)
- [DOCS] API Links and Supporting Docs (#3265)
- [DOCS] redir and search (#3249)
- [MAINTENANCE] Update azure-pipelines-docs-integration.yml to include env vars for Azure docs integration tests
- [MAINTENANCE] Allow Wrong ignore_row_if Directive from V2 with Deprecation Warning (#3274)
- [MAINTENANCE] Refactor test structure for "Connecting to your data" cloud provider integration tests (#3277)
- [MAINTENANCE] Make test method names consistent for Metrics tests (#3254)
- [MAINTENANCE] Allow PandasExecutionEngineto acceptAzure DataConnectors(#3214)
- [MAINTENANCE] Standardize Arguments to MetricConfiguration Constructor; Use {} instead of dict(). (#3246)
0.13.28#
- [FEATURE] Implement ColumnPairValuesInSet metric for PandasExecutionEngine
- [BUGFIX] Wrap optional azure imports in data_connector setup
0.13.27#
- [FEATURE] Accept row_condition (with condition_parser) and ignore_row_if parameters for expect_multicolumn_sum_to_equal (#3193)
- [FEATURE] ConfiguredAssetDataConnector for Azure Blob Storage (#3141)
- [FEATURE] Replace MetricFunctionTypes.IDENTITY domain type with convenience method get_domain_records() for SparkDFExecutionEngine (#3226)
- [FEATURE] Replace MetricFunctionTypes.IDENTITY domain type with convenience method get_domain_records() for SqlAlchemyExecutionEngine (#3215)
- [FEATURE] Replace MetricFunctionTypes.IDENTITY domain type with convenience method get_full_access_compute_domain() for PandasExecutionEngine (#3210)
- [FEATURE] Set foundation for Azure-related DataConnectors (#3188)
- [FEATURE] Update ExpectCompoundColumnsToBeUnique for V3 API (#3161)
- [BUGFIX] Fix incorrect schema validation for Azure data connectors (#3200)
- [BUGFIX] Fix incorrect usage of "all()" in the comparison of validation results when executing an Expectation (#3178)
- [BUGFIX] Fixes an error with expect_column_values_to_be_dateutil_parseable (#3190)
- [BUGFIX] Improve parsing of .ge_store_backend_id (#2952)
- [BUGFIX] Remove fixture parameterization for Cloud DBs (Snowflake and BigQuery) (#3182)
- [BUGFIX] Restore support for V2 API style custom expectation rendering (#3179) (Thanks @jdimatteo)
- [DOCS] Add condaas installation option in README (#3196) (Thanks @rpanai)
- [DOCS] Standardize capitalization of "Python" in "Connecting to your data" section of new docs (#3209)
- [DOCS] Standardize capitalization of Spark in docs (#3198)
- [DOCS] Update BigQuery docs to clarify the use of temp tables (#3184)
- [DOCS] Create _redirects (#3192)
- [ENHANCEMENT] RuntimeDataConnector messaging is made more clear for test_yaml_config()(#3206)
- [MAINTENANCE] Add credentialsYAML key support forDataConnectors(#3173)
- [MAINTENANCE] Fix minor typo in S3 DataConnectors (#3194)
- [MAINTENANCE] Fix typos in argument names and types (#3207)
- [MAINTENANCE] Update changelog. (#3189)
- [MAINTENANCE] Update documentation. (#3203)
- [MAINTENANCE] Update validate_your_data.md (#3185)
- [MAINTENANCE] update tests across execution engines and clean up coding patterns (#3223)
0.13.26#
- [FEATURE] Enable BigQuery tests for Azure CI/CD (#3155)
- [FEATURE] Implement MulticolumnMapExpectation class (#3134)
- [FEATURE] Implement the MulticolumnSumEqual Metric for PandasExecutionEngine (#3130)
- [FEATURE] Support row_condition and ignore_row_if Directives Combined for PandasExecutionEngine (#3150)
- [FEATURE] Update ExpectMulticolumnSumToEqual for V3 API (#3136)
- [FEATURE] add python3.9 to python versions (#3143) (Thanks @dswalter)
- [FEATURE]/MER-16/MER-75/ADD_ROUTE_FOR_VALIDATION_RESULT (#3090) (Thanks @rreinoldsc)
- [BUGFIX] Enable --v3-api suite editto proceed without selecting DataConnectors (#3165)
- [BUGFIX] Fix error when RuntimeBatchRequestis passed toSimpleCheckpointwithRuntimeDataConnector(#3152)
- [BUGFIX] allow reader_options in the CLI so can read .csv.gzfiles (#2695) (Thanks @luke321321)
- [DOCS] Apply Docusaurus tabs to relevant pages in new docs
- [DOCS] Capitalize python to Python in docs (#3176)
- [DOCS] Improve Core Concepts - Expectation Concepts (#2831)
- [MAINTENANCE] Error messages must be friendly. (#3171)
- [MAINTENANCE] Implement the "compound_columns_unique" metric for PandasExecutionEngine (with a unit test). (#3159)
- [MAINTENANCE] Improve Coding Practices in "great_expectations/expectations/expectation.py" (#3151)
- [MAINTENANCE] Update test_script_runner.py (#3177)
0.13.25#
- [FEATURE] Pass on meta-data from expectation json to validation result json (#2881) (Thanks @sushrut9898)
- [FEATURE] Add sqlalchemy engine support for column.most_common_valuemetric (#3020) (Thanks @shpolina)
- [BUGFIX] Added newline to CLI message for consistent formatting (#3127) (Thanks @ismaildawoodjee)
- [BUGFIX] fix pip install snowflake build error with Python 3.9 (#3119) (Thanks @jdimatteo)
- [BUGFIX] Populate (data) asset name in data docs for RuntimeDataConnector (#3105) (Thanks @ceshine)
- [DOCS] Correct path to docs_rtd/changelog.rst (#3120) (Thanks @jdimatteo)
- [DOCS] Fix broken links in "How to write a 'How to Guide'" (#3112)
- [DOCS] Port over "How to add comments to Expectations and display them in DataDocs" from RTD to Docusaurus (#3078)
- [DOCS] Port over "How to create a Batch of data from an in memory Spark or Pandas DF" from RTD to Docusaurus (#3099)
- [DOCS] Update CLI codeblocks in create_your_first_expectations.md (#3106) (Thanks @ories)
- [MAINTENANCE] correct typo in docstring (#3117)
- [MAINTENANCE] DOCS/GDOC-130/Add Changelog (#3121)
- [MAINTENANCE] fix docstring for expectation "expect_multicolumn_sum_to_equal" (previous version was not precise) (#3110)
- [MAINTENANCE] Fix typos in docstrings in map_metric_provider partials (#3111)
- [MAINTENANCE] Make sure that all imports use column_aggregate_metric_provider (not column_aggregate_metric). (#3128)
- [MAINTENANCE] Rename column_aggregate_metric.py into column_aggregate_metric_provider.py for better code readability. (#3123)
- [MAINTENANCE] rename ColumnMetricProvider to ColumnAggregateMetricProvider (with DeprecationWarning) (#3100)
- [MAINTENANCE] rename map_metric.py to map_metric_provider.py (with DeprecationWarning) for a better code readability/interpretability (#3103)
- [MAINTENANCE] rename table_metric.py to table_metric_provider.py with a deprecation notice (#3118)
- [MAINTENANCE] Update CODE_OF_CONDUCT.md (#3066)
- [MAINTENANCE] Upgrade to modern Python syntax (#3068) (Thanks @cclauss)
0.13.24#
- [FEATURE] Script to automate proper triggering of Docs Azure pipeline (#3003)
- [BUGFIX] Fix an undefined name that could lead to a NameError (#3063) (Thanks @cclauss)
- [BUGFIX] fix incorrect pandas top rows usage (#3091)
- [BUGFIX] Fix parens in Expectation metric validation method that always returned True assertation (#3086) (Thanks @morland96)
- [BUGFIX] Fix run_diagnostics for contrib expectations (#3096)
- [BUGFIX] Fix typos discovered by codespell (#3064) (Thanks cclauss)
- [BUGFIX] Wrap get_view_names in try clause for passing the NotImplemented error (#2976) (Thanks @kj-9)
- [DOCS] Ensuring consistent style of directories, files, and related references in docs (#3053)
- [DOCS] Fix broken link to example DAG (#3061) (Thanks fritz-astronomer)
- [DOCS] GDOC-198 cleanup TOC (#3088)
- [DOCS] Migrating pages under guides/miscellaneous (#3094) (Thanks @spbail)
- [DOCS] Port over âHow to configure a new Checkpoint using test_yaml_configâ from RTD to Docusaurus
- [DOCS] Port over âHow to configure an Expectation store in GCSâ from RTD to Docusaurus (#3071)
- [DOCS] Port over âHow to create renderers for custom Expectationsâ from RTD to Docusaurus
- [DOCS] Port over âHow to run a Checkpoint in Airflowâ from RTD to Docusaurus (#3074)
- [DOCS] Update how-to-create-and-edit-expectations-in-bulk.md (#3073)
- [MAINTENANCE] Adding a comment explaining the IDENTITY metric domain type. (#3057)
- [MAINTENANCE] Change domain key value from âcolumnâ to âcolumn_listâ in ExecutionEngine implementations (#3059)
- [MAINTENANCE] clean up metric errors (#3085)
- [MAINTENANCE] Correct the typo in the naming of the IDENTIFICATION semantic domain type name. (#3058)
- [MAINTENANCE] disable snowflake tests temporarily (#3093)
- [MAINTENANCE][DOCS] Port over âHow to host and share Data Docs on GCSâ from RTD to Docusaurus (#3070)
- [MAINTENANCE] Enable repr for MetricConfiguration to assist with troubleshooting. (#3075)
- [MAINTENANCE] Expand test of a column map metric to underscore functionality. (#3072)
- [MAINTENANCE] Expectation anonymizer supports v3 expectation registry (#3092)
- [MAINTENANCE] Fix -- check for column key existence in accessor_domain_kwargsn for condition map partials. (#3082)
- [MAINTENANCE] Missing import of SparkDFExecutionEngine was added. (#3062)
Older Changelist#
Older changelist can be found at https://github.com/great-expectations/great_expectations/blob/develop/docs_rtd/changelog.rst