Msdm

Latest version: v0.11

Safety actively analyzes 681881 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 2 of 2

0.5

This release mainly includes interfaces, algorithms, and test domains for tabular partially observable markov decision processes (POMDPs).

Summary of changes:
- Core POMDP classes:
- `PartiallyObservableMDP`
- `TabularPOMDP`
- `BeliefMDP`
- `POMDPPolicy`
- `ValueBasedTabularPOMDPPolicy`
- `AlphaVectorPolicy`
- `FiniteStateController`
- `StochasticFiniteStateController`
- Domains:
- `HeavenOrHell`
- `LoadUnload`
- `Tiger`
- Algorithms:
- `PointBasedValueIteration`
- `QMDP`
- `FSCGradientAscent`
- JuliaPOMDPs wrapper
- Fixes to Policy Iteration and Value Iteration
- Updated README.md

0.4

New Features
- QLearning, SARSA, Expected SARSA, DoubleQLearning
- Policy Iteration
- Entropy Regularized Policy Iteration
- Works with python 3.9
- QuickMDP and QuickTabularMDP constructors
- Construction of TabularMDPs from matrices
- New domains: CliffWalking, GridMDP generic class, Russell & Norvig gridworld example
- Gridworld plotting of action values

0.3

Major overhaul of core and tabular methods:

- States/actions are assumed to be hashable (e.g., Gridworld now uses frozendict; no built-in hashing functions; dictionaries are the main way to create maps)
- The distribution classes have been streamlined (Multinomial has been removed and DictDistribution is the main way to represent categorical distributions; .sample() takes a random number generator)
- Policy classes have been simplified
- More thorough type hints

0.2

Page 2 of 2

© 2024 Safety CLI Cybersecurity Inc. All Rights Reserved.