Index operations support
We improved Index operations support (1944, 1955).
Here are some examples:
- Before
py
>>> kidx = ks.Index([1, 2, 3, 4, 5])
>>> kidx + kidx
Int64Index([2, 4, 6, 8, 10], dtype='int64')
>>> kidx + kidx + kidx
Traceback (most recent call last):
...
AssertionError: args should be single DataFrame or single/multiple Series
py
>>> ks.Index([1, 2, 3, 4, 5]) + ks.Index([6, 7, 8, 9, 10])
Traceback (most recent call last):
...
AssertionError: args should be single DataFrame or single/multiple Series
- After
python
>>> kidx = ks.Index([1, 2, 3, 4, 5])
>>> kidx + kidx + kidx
Int64Index([3, 6, 9, 12, 15], dtype='int64')
python
>>> ks.options.compute.ops_on_diff_frames = True
>>> ks.Index([1, 2, 3, 4, 5]) + ks.Index([6, 7, 8, 9, 10])
Int64Index([7, 9, 13, 11, 15], dtype='int64')
Other new features and improvements
We added the following new features:
DataFrame:
- `swaplevel` (1928)
- `swapaxes` (1946)
- `dot` (1945)
- `itertuples` (1960)
Series:
- `swaplevel` (1919)
- `swapaxes` (1954)
Index:
- `to_list` (1948)
MultiIndex:
- `to_list` (1948)
GroupBy:
- `tail` (1949)
- `median` (1957)
Other improvements and bug fixes
- Support DataFrame parameter in Series.dot (1931)
- Add a best practice for checkpointing. (1930)
- Remove implicit switch-ons of "compute.ops_on_diff_frames" (1953)
- Fix Series._to_internal_pandas and introduce Index._to_internal_pandas. (1952)
- Fix first/last_valid_index to support empty column DataFrame. (1923)
- Use pandas' transpose when the data is expected to be small. (1932)
- Fix tail to use the resolved copy (1942)
- Avoid unneeded reset_index in DataFrameGroupBy.describe. (1951)
- TypeError when Index.name / Series.name is not a hashable type (1883)
- Adjust data column names before attaching default index. (1947)
- Add plotly into the optional dependency in Koalas (1939)
- Add plotly backend test cases (1938)
- Don't pass stacked in plotly area chart (1934)
- Set upperbound of matplotlib to avoid failure on Ubuntu (1959)
- Fix GroupBy.descirbe for multi-index columns. (1922)
- Upgrade pandas version in CI (1961)
- Compare Series from the same anchor (1956)
- Add videos from Data+AI Summit 2020 EUROPE. (1963)
- Set PYARROW_IGNORE_TIMEZONE for binder. (1965)