Highlights
Extract.AI
Added extract.ai. Use chatGPT to extract data using plain descriptions.
yml
wrangles:
- extract.ai:
api_key: ${OPENAI_API_KEY}
output:
length:
description: >-
Any lengths found in the data
such as cm, m, ft, etc.
type:
description: >-
The type of item in the data
such as spanner, cellphone, etc.
Python
Added a python wrangle. This allows executing simple python commands inline within a recipe. For more complex python, use custom functions. This is evaluated once per row. Row values are referenced by the column name.
yml
wrangles:
- python:
command: [x.upper() for x in my_column]
output: result
Features
- Allow using custom functions for recipes called using a model ID.
- Allow the console command (wrangles.recipe) to use all recipe features, such as calling a recipe by model ID or URL.
- Added an optional timeout parameter for recipes to set a time limit in seconds. If omitted, the time is unlimited.
- Added a clear method to the memory connector to clear all saved data.
- Improved convert fractions to decimals to deal with split fractions such as 1-1/2.
- Added create.embeddings to generate embeddings for text.
- Allow parameterizing SQL queries and connecting to sandbox environments with the salesforce connector.
Bugs
- Prevent remove words from automatically capitalizing by default.
Misc
- Added an additional test job within the built container.
- Fixed numpy version within the container to 1.24.3 due to an issue with the optimized build with newer versions.