A powerful data analysis package based on mathematical step functions. Strongly aligned with pandas.

Overview

staircase logo

The leading use-case for the staircase package is for the creation and analysis of step functions.

Pretty exciting huh.

But don't hit the close button on the browser just yet. Let us convince you that much of the world around you can be modelled as step functions.

For example, the number of users viewing this page over time can be modelled as a step function. The value of the function increases by 1 every time a user arrives at the page, and decreases by 1 every time a user leaves the page. Let's say we have this data in vector format (i.e. tuple, list, numpy array, pandas series). Specifically, assume arrive and leave are vectors of times, expressed as minutes past midnight, for all page views occuring yesterday. Creating the corresponding step function is simple. To achieve it we use the Stairs class:

>>> import staircase as sc

>>> views = sc.Stairs()
>>> views.layer(arrive,leave)

We can visualise the function with the plot function:

>>> views.plot()

pageviews example

We can find the total time in minutes the page was viewed:

>>> views.clip(0,1440).integral()
9297.94622521079

We can find the average number of viewers:

>>> views.clip(0,1440).mean()
6.4569071008408265

We can find the average number of viewers, per hour of the day, and plot:

>>> views.slice(pd.interval_range(0, periods=24, freq=60)).mean().plot()

mean page views per hour

We can find the maximum concurrent views:

>>> views.clip(0,1440).max()
16

We can create histogram data showing relative frequency of concurrent viewers (and plot it):

>>> views.clip(0,1440).hist().plot.bar()

concurrent viewers histogram

Plotting is based on matplotlib and it requires relatively little effort to take the previous chart and improve the aesthetics:

concurrent viewers histogram (aesthetic)

There is plenty more analysis that could be done. The staircase package provides a rich variety of arithmetic operations, relational operations, logical operations, statistical operations, for use with Stairs, in addition to functions for univariate analysis, aggregations and compatibility with pandas.Timestamp.

Installation

staircase can be installed from PyPI:

python -m pip install staircase

or also with conda:

conda install -c conda-forge staircase

Documentation

The complete guide to using staircase can be found at staircase.dev

Contributing

There are many ways in which contributions can be made - the first and foremost being using staircase and giving feedback.

Bug reports, feature requests and ideas can be submitted via the Github issue tracker.

Additionally, bug fixes. enhancements, and improvements to the code and documentation are also appreciated and can be done via pull requests. Take a look at the current issues and if there is one you would like to work on please leave a comment to that effect.

See this beginner's guide to contributing, or Pandas' guide to contributing, to learn more about the process.

Versioning

We use SemVer for versioning. For the versions available, see the tags on this repository.

License

This project is licensed under the MIT License - see the LICENSE file for details

Acknowledgments

The seeds of staircase began developing at the Hunter Valley Coal Chain Coordinator, where it finds strong application in analysing simulated data. Thanks for the support!

Comments
  • POC _closed value check

    POC _closed value check

    Not complete but seems to do what's desired for the add & subtract operations.

    • [x] closes #96
    • [x] tests added / passed
    • [x] ensure all linting tests pass
    • [x] changelog entry
    opened by amagee 13
  • BUG: error message appearing in debugging mode

    BUG: error message appearing in debugging mode

    • [X] I have checked that this issue has not already been reported.

    • [X] I have confirmed this bug exists on the latest version of staircase.


    Note: Please read this guide detailing how to provide the necessary information for us to reproduce your bug.

    Code Sample, a copy-pastable example

    # This in the code
    import staircase as sc
    import pandas as pd
    
    # this run in debug interactive mode
    pd.DataFrame()
    
    
    

    Problem description

    I'm still trying to make a minimum reproducible example, but for me it seems that somehow the only inclusion of the staircase dependence even without using it, is changing some pandas behaviour and raising an error. The error is non breaking and it appears when debugging code (interactively) that contains pandas dataframes even if staircase is not actually used in any operation and only imported at some point.

    The problem appears with delay and is not breaking, I'm not sure how to find the issue.

    Expected Output

    Nothing

    Dependency Versions``

    staircase 2.4.2 pandas 1.4.1 numpy 1.22.2

    Traceback (most recent call last): File "C:\Program Files\JetBrains\PyCharm Community Edition 2021.1.1\plugins\python-ce\helpers\pydev_pydevd_bundle\pydevd_comm.py", line 1590, in do_it msg = _pydev_completer.generate_completions_as_xml(frame, self.act_tok) File "C:\Program Files\JetBrains\PyCharm Community Edition 2021.1.1\plugins\python-ce\helpers\pydev_pydev_bundle_pydev_completer.py", line 172, in generate_completions_as_xml completions = completer.complete(act_tok) File "C:\Program Files\JetBrains\PyCharm Community Edition 2021.1.1\plugins\python-ce\helpers\pydev_pydev_bundle_pydev_completer.py", line 89, in complete return self.attr_matches(text) File "C:\Program Files\JetBrains\PyCharm Community Edition 2021.1.1\plugins\python-ce\helpers\pydev_pydev_bundle_pydev_completer.py", line 151, in attr_matches words = dir2(obj, filter=filter) File "C:\Program Files\JetBrains\PyCharm Community Edition 2021.1.1\plugins\python-ce\helpers\pydev_pydev_bundle_pydev_imports_tipper.py", line 178, in generate_imports_tip_for_module dir_comps = dir(obj_to_complete) File "C:\Users\PabloRuiz\AppData\Local\pypoetry\Cache\virtualenvs\transformer-_p2xsS-z-py3.8\lib\site-packages\pandas\core\accessor.py", line 40, in dir rv = (rv - self._dir_deletions()) | self._dir_additions() File "C:\Users\PabloRuiz\AppData\Local\pypoetry\Cache\virtualenvs\transformer-_p2xsS-z-py3.8\lib\site-packages\pandas\core\generic.py", line 5632, in _dir_additions additions = super()._dir_additions() File "C:\Users\PabloRuiz\AppData\Local\pypoetry\Cache\virtualenvs\transformer-_p2xsS-z-py3.8\lib\site-packages\pandas\core\accessor.py", line 29, in _dir_additions return {accessor for accessor in self._accessors if hasattr(self, accessor)} File "C:\Users\PabloRuiz\AppData\Local\pypoetry\Cache\virtualenvs\transformer-_p2xsS-z-py3.8\lib\site-packages\pandas\core\accessor.py", line 29, in return {accessor for accessor in self._accessors if hasattr(self, accessor)} File "C:\Users\PabloRuiz\AppData\Local\pypoetry\Cache\virtualenvs\transformer-_p2xsS-z-py3.8\lib\site-packages\pandas\core\accessor.py", line 182, in get accessor_obj = self._accessor(obj) File "C:\Users\PabloRuiz\AppData\Local\pypoetry\Cache\virtualenvs\transformer-_p2xsS-z-py3.8\lib\site-packages\staircase\core\arrays\accessor.py", line 14, in init raise TypeError( TypeError: sc accessor only valid for Series with Stairs dtype. Convert using .astype("Stairs").

    I think the problem is related to the word "sc" in the import and:

    @register_series_accessor("sc") class StairsAccessor: ...

    in accessor.

    opened by PabloRuizCuevas 11
  • Typo Update in 05_comparing.rst

    Typo Update in 05_comparing.rst

    Below is the changes made to the file in

    - To answer the question of "what if the step function is not boolean valued" we appeal to the boolean definition that Python applies to numbers;anything not zero is considered *true*, and consequently only zero is false.
    +To answer the question of "what if the step function is not boolean valued" we appeal to the boolean definition that Python applies to numbers: anything not zero is considered *true*, and consequently only zero is false.
    

    Checklist:

    • [x] I have read freeCodeCamp's contribution guidelines.
    • [x] My pull request has a descriptive title (not a vague title like Update index.md)
    • [x] My pull request targets the main branch of freeCodeCamp.
    • [x] I have tested these changes either locally on my machine, or GitPod.

    Closes #99

    The list of files can be found in below link link

    • [x] closes #99
    opened by Pavithradevadiga 4
  • ENH: raise error if operands of binary operations do not have same

    ENH: raise error if operands of binary operations do not have same "closed" value

    Describe the solution you'd like

    Stairs instances have a "closed" property which can be "left" or "right" indicates if they are left-closed, right-open or right-closed left-open.

    When performing binary operations with two Stairs objects, like addition, &, >= etc it is important that both operands have the same value of closed, and an error should be raised if not.

    It would be great to have this functionality implemented as a function decorator, which knows to compare self and other parameters which appear in these binary operator methods.

    enhancement advanced high priority 
    opened by venaturum 4
  • ENH: Add Stairs.from_values method

    ENH: Add Stairs.from_values method

    Is your feature request related to a problem?

    Allow a user who has timeseries data in pandas, which represents step function values, to create a Stairs instance from this data.

    Describe the solution you'd like

    Similar to https://github.com/staircase-dev/staircase/blob/00da1209eec21d3ea11f0f18f538637d2bae5ecf/staircase/core/stairs.py#L73

    Parameters should include

    • initial_value (float or nan)
    • values (pandas.Series)
    • closed (either "left" or "right")

    values should be indexed by change points of step function. The values of values should be step function values when approaching the change points from the right (this is the internal representation used in "value" column of Stairs._data).

    Additional context

    See https://www.staircase.dev/en/latest/development/guide/internals.html for more context

    enhancement beginner high priority 
    opened by venaturum 4
  • BUG: staircase.make_test_data fails with numpy < 1.17

    BUG: staircase.make_test_data fails with numpy < 1.17

    • [ ] I have checked that this issue has not already been reported.

    • [ ] I have confirmed this bug exists on the latest version of staircase.


    Note: Please read this guide detailing how to provide the necessary information for us to reproduce your bug.

    Code Sample, a copy-pastable example

    >>> import staircase as sc
    >>> sc.make_test_data()
    Traceback (most recent call last):
      File "<stdin>", line 1, in <module>
      File "C:\Users\rclement\Documents\temp11\.venv\lib\site-packages\staircase\test_data.py", line 56, in make_test_data
        return _make_test_data(dates=dates, positive_only=positive_only, seed=seed)
      File "C:\Users\rclement\Documents\temp11\.venv\lib\site-packages\staircase\test_data.py", line 71, in _make_test_data
        rng = np.random.default_rng()
    AttributeError: module 'numpy.random' has no attribute 'default_rng'
    
    

    Problem description

    Expected Output

    Dependency Versions``

    staircase 2.0.0 pandas 1.0.0 numpy 1.16.0

    bug intermediate high priority 
    opened by venaturum 4
  • DOC: Typo in

    DOC: Typo in "How can I compare my step function"

    Location of the documentation

    https://www.staircase.dev/en/latest/getting_started/intro_tutorials/05_comparing.html#logical

    Note: You can check the latest versions of the docs on master here.

    Documentation problem

    Typo below:

    To answer the question of “what if the step function is not boolean valued” we appeal to the boolean definition that Python applies to numbers;anything not zero is considered true, and consequently only zero is false.

    documentation beginner medium priority 
    opened by venaturum 3
  • ENH: Allow Stairs.fillna to take a Stairs parameter

    ENH: Allow Stairs.fillna to take a Stairs parameter

    Is your feature request related to a problem?

    We want the value parameter in Stairs.fillna to be able to take a staircase.Stairs parameter

    current definition https://github.com/staircase-dev/staircase/blob/cb0265d1b182ad3a8452f544c197ce54aef6feea/staircase/core/ops/masking.py#L160

    docstring https://github.com/staircase-dev/staircase/blob/cb0265d1b182ad3a8452f544c197ce54aef6feea/staircase/core/ops/docstrings.py#L459

    Describe the solution you'd like

    See https://www.staircase.dev/en/latest/user_guide/cookbook.html#fill-undefined-intervals-of-one-step-function-with-another for an implementation

    enhancement beginner medium priority 
    opened by venaturum 3
  • Bump mistune from 0.8.4 to 2.0.3

    Bump mistune from 0.8.4 to 2.0.3

    Bumps mistune from 0.8.4 to 2.0.3.

    Release notes

    Sourced from mistune's releases.

    Version 2.0.2

    Fix escape_url via lepture/mistune#295

    Version 2.0.1

    Fix XSS for image link syntax.

    Version 2.0.0

    First release of Mistune v2.

    Version 2.0.0 RC1

    In this release, we have a Security Fix for harmful links.

    Version 2.0.0 Alpha 1

    This is the first release of v2. An alpha version for users to have a preview of the new mistune.

    Changelog

    Sourced from mistune's changelog.

    Changelog

    Here is the full history of mistune v2.

    Version 2.0.4

    
    Released on Jul 15, 2022
    
    • Fix url plugin in &lt;a&gt; tag
    • Fix * formatting

    Version 2.0.3

    Released on Jun 27, 2022

    • Fix table plugin
    • Security fix for CVE-2022-34749

    Version 2.0.2

    
    Released on Jan 14, 2022
    

    Fix escape_url

    Version 2.0.1

    Released on Dec 30, 2021

    XSS fix for image link syntax.

    Version 2.0.0

    
    Released on Dec 5, 2021
    

    This is the first non-alpha release of mistune v2.

    Version 2.0.0rc1

    Released on Feb 16, 2021

    Version 2.0.0a6

    
    </tr></table> 
    

    ... (truncated)

    Commits
    • 3f422f1 Version bump 2.0.3
    • a6d4321 Fix asteris emphasis regex CVE-2022-34749
    • 5638e46 Merge pull request #307 from jieter/patch-1
    • 0eba471 Fix typo in guide.rst
    • 61e9337 Fix table plugin
    • 76dec68 Add documentation for renderer heading when TOC enabled
    • 799cd11 Version bump 2.0.2
    • babb0cf Merge pull request #295 from dairiki/bug.escape_url
    • fc2cd53 Make mistune.util.escape_url less aggressive
    • 3e8d352 Version bump 2.0.1
    • Additional commits viewable in compare view

    Dependabot compatibility score

    Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.


    Dependabot commands and options

    You can trigger Dependabot actions by commenting on this PR:

    • @dependabot rebase will rebase this PR
    • @dependabot recreate will recreate this PR, overwriting any edits that have been made to it
    • @dependabot merge will merge this PR after your CI passes on it
    • @dependabot squash and merge will squash and merge this PR after your CI passes on it
    • @dependabot cancel merge will cancel a previously requested merge and block automerging
    • @dependabot reopen will reopen this PR if it is closed
    • @dependabot close will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually
    • @dependabot ignore this major version will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself)
    • @dependabot ignore this minor version will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself)
    • @dependabot ignore this dependency will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)
    • @dependabot use these labels will set the current labels as the default for future PRs for this repo and language
    • @dependabot use these reviewers will set the current reviewers as the default for future PRs for this repo and language
    • @dependabot use these assignees will set the current assignees as the default for future PRs for this repo and language
    • @dependabot use this milestone will set the current milestone as the default for future PRs for this repo and language

    You can disable automated security fix PRs for this repo from the Security Alerts page.

    dependencies 
    opened by dependabot[bot] 2
  • Test logical xor stairs 1

    Test logical xor stairs 1

    Seems to do what was asked for in the ticket, let me know if you want anything more / different.

    • [x] closes #105
    • [x] tests added / passed
    • [x] ensure all linting tests pass
    • [x] changelog entry
    opened by amagee 2
  • Updated black version, and flake 8

    Updated black version, and flake 8

    • [X] closes #165
    • [ ] tests added / passed
    • [x] ensure all linting tests pass
    • [ ] changelog entry

    No relevant changes, I thought we would have some little style change cause the new version of black, but nothing changed, so the PR is not the greatest contribution.

    Anyway, I set in the pypoetry.toml the line-lenght of black to be 88, which was already the the default, so no also not changes there, it is black and pandas default too, so it makes sense for this project. thought I usually like more 120 or so.

    opened by PabloRuizCuevas 1
  • Bump setuptools from 65.5.0 to 65.5.1

    Bump setuptools from 65.5.0 to 65.5.1

    Bumps setuptools from 65.5.0 to 65.5.1.

    Changelog

    Sourced from setuptools's changelog.

    v65.5.1

    Misc ^^^^

    • #3638: Drop a test dependency on the mock package, always use :external+python:py:mod:unittest.mock -- by :user:hroncok
    • #3659: Fixed REDoS vector in package_index.
    Commits

    Dependabot compatibility score

    Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.


    Dependabot commands and options

    You can trigger Dependabot actions by commenting on this PR:

    • @dependabot rebase will rebase this PR
    • @dependabot recreate will recreate this PR, overwriting any edits that have been made to it
    • @dependabot merge will merge this PR after your CI passes on it
    • @dependabot squash and merge will squash and merge this PR after your CI passes on it
    • @dependabot cancel merge will cancel a previously requested merge and block automerging
    • @dependabot reopen will reopen this PR if it is closed
    • @dependabot close will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually
    • @dependabot ignore this major version will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself)
    • @dependabot ignore this minor version will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself)
    • @dependabot ignore this dependency will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)
    • @dependabot use these labels will set the current labels as the default for future PRs for this repo and language
    • @dependabot use these reviewers will set the current reviewers as the default for future PRs for this repo and language
    • @dependabot use these assignees will set the current assignees as the default for future PRs for this repo and language
    • @dependabot use this milestone will set the current milestone as the default for future PRs for this repo and language

    You can disable automated security fix PRs for this repo from the Security Alerts page.

    dependencies 
    opened by dependabot[bot] 1
  • Bump certifi from 2022.9.24 to 2022.12.7

    Bump certifi from 2022.9.24 to 2022.12.7

    Bumps certifi from 2022.9.24 to 2022.12.7.

    Commits

    Dependabot compatibility score

    Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.


    Dependabot commands and options

    You can trigger Dependabot actions by commenting on this PR:

    • @dependabot rebase will rebase this PR
    • @dependabot recreate will recreate this PR, overwriting any edits that have been made to it
    • @dependabot merge will merge this PR after your CI passes on it
    • @dependabot squash and merge will squash and merge this PR after your CI passes on it
    • @dependabot cancel merge will cancel a previously requested merge and block automerging
    • @dependabot reopen will reopen this PR if it is closed
    • @dependabot close will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually
    • @dependabot ignore this major version will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself)
    • @dependabot ignore this minor version will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself)
    • @dependabot ignore this dependency will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)
    • @dependabot use these labels will set the current labels as the default for future PRs for this repo and language
    • @dependabot use these reviewers will set the current reviewers as the default for future PRs for this repo and language
    • @dependabot use these assignees will set the current assignees as the default for future PRs for this repo and language
    • @dependabot use this milestone will set the current milestone as the default for future PRs for this repo and language

    You can disable automated security fix PRs for this repo from the Security Alerts page.

    dependencies 
    opened by dependabot[bot] 0
  • Bump pillow from 9.2.0 to 9.3.0

    Bump pillow from 9.2.0 to 9.3.0

    Bumps pillow from 9.2.0 to 9.3.0.

    Release notes

    Sourced from pillow's releases.

    9.3.0

    https://pillow.readthedocs.io/en/stable/releasenotes/9.3.0.html

    Changes

    ... (truncated)

    Changelog

    Sourced from pillow's changelog.

    9.3.0 (2022-10-29)

    • Limit SAMPLESPERPIXEL to avoid runtime DOS #6700 [wiredfool]

    • Initialize libtiff buffer when saving #6699 [radarhere]

    • Inline fname2char to fix memory leak #6329 [nulano]

    • Fix memory leaks related to text features #6330 [nulano]

    • Use double quotes for version check on old CPython on Windows #6695 [hugovk]

    • Remove backup implementation of Round for Windows platforms #6693 [cgohlke]

    • Fixed set_variation_by_name offset #6445 [radarhere]

    • Fix malloc in _imagingft.c:font_setvaraxes #6690 [cgohlke]

    • Release Python GIL when converting images using matrix operations #6418 [hmaarrfk]

    • Added ExifTags enums #6630 [radarhere]

    • Do not modify previous frame when calculating delta in PNG #6683 [radarhere]

    • Added support for reading BMP images with RLE4 compression #6674 [npjg, radarhere]

    • Decode JPEG compressed BLP1 data in original mode #6678 [radarhere]

    • Added GPS TIFF tag info #6661 [radarhere]

    • Added conversion between RGB/RGBA/RGBX and LAB #6647 [radarhere]

    • Do not attempt normalization if mode is already normal #6644 [radarhere]

    ... (truncated)

    Commits

    Dependabot compatibility score

    Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.


    Dependabot commands and options

    You can trigger Dependabot actions by commenting on this PR:

    • @dependabot rebase will rebase this PR
    • @dependabot recreate will recreate this PR, overwriting any edits that have been made to it
    • @dependabot merge will merge this PR after your CI passes on it
    • @dependabot squash and merge will squash and merge this PR after your CI passes on it
    • @dependabot cancel merge will cancel a previously requested merge and block automerging
    • @dependabot reopen will reopen this PR if it is closed
    • @dependabot close will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually
    • @dependabot ignore this major version will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself)
    • @dependabot ignore this minor version will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself)
    • @dependabot ignore this dependency will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)
    • @dependabot use these labels will set the current labels as the default for future PRs for this repo and language
    • @dependabot use these reviewers will set the current reviewers as the default for future PRs for this repo and language
    • @dependabot use these assignees will set the current assignees as the default for future PRs for this repo and language
    • @dependabot use this milestone will set the current milestone as the default for future PRs for this repo and language

    You can disable automated security fix PRs for this repo from the Security Alerts page.

    dependencies 
    opened by dependabot[bot] 0
  • DOC: upgrade pydata-sphinx-theme

    DOC: upgrade pydata-sphinx-theme

    Current docs based on 0.8.1 and latest is 0.11.0

    Upgrading will result in several breaking changes which will result in build crashes, and unwanted theming.

    CSS and logos may need to be redone.

    opened by venaturum 0
  • DOC: demonstrate

    DOC: demonstrate "string timestamp" functionality in user guide and case studies

    In v2.4.0 the ability to pass string representations of timestamps into Stairs.clip, Stairs.mask, Stairs.where was introduced. This is so users can do this

    my_stepfunction.clip("2020", "2021")

    instead of my_stepfunction.clip(pd.Timestamp("2020"), pd.Timestamp("2021"))

    There are opportunities to utilise this new feature in examples found in the user guide and case studies. Perhaps the best approach is, for each page, use the longer version first, then shorter version in subsequent code, noting that the feature is being used.

    documentation beginner 
    opened by venaturum 0
Releases(v2.5.0)
  • v2.5.0(Oct 19, 2022)

    This is a minor release in the 2.x series and includes a couple of minor bugfixes. There is no added functionality however support for Python 3.6 has been dropped.

    See the changelogfor details.

    Source code(tar.gz)
    Source code(zip)
  • v2.4.2(May 13, 2022)

  • v2.4.1(May 9, 2022)

    This is a patch release in the 2.4.x series and includes a fix for a bug introduced in v2.4.0 where a pandas warning was generated when constructing/layering Stairs objects with NA-like arguments.

    Source code(tar.gz)
    Source code(zip)
  • v2.4.0(May 2, 2022)

  • v2.3.0(Dec 21, 2021)

  • v2.2.0(Nov 27, 2021)

    This is a minor release in the 2.x series and includes new functionality for pandas extensions including Extension Array, Series accessor, and several new methods.

    See the changelog for details.

    Source code(tar.gz)
    Source code(zip)
  • v2.1.1(Nov 2, 2021)

    This is a patch release in the 2.1.x series and includes a single bug fix. The bug is inherited from a known issue in pandas (https://github.com/pandas-dev/pandas/issues/34251) which causes staircase.Stairs.plot to fail when style="hlines" and numexpr package is installed.

    Source code(tar.gz)
    Source code(zip)
  • v2.1.0(Oct 29, 2021)

  • v2.0.4(Oct 12, 2021)

    This is a patch release in the 2.0.x series and includes a single bug fix:

    • bugfix for array aggregation where array contains a constant value step function over non-float domain (#GH119)
    Source code(tar.gz)
    Source code(zip)
  • v2.0.3(Oct 7, 2021)

  • v2.0.2(Oct 3, 2021)

  • v2.0.1(Sep 13, 2021)

  • v2.0.0(Aug 25, 2021)

    A major release with several enhancements added, namely masking, slicing and the implicit handling of dates. The library is now based on pandas and numpy, resulting in significant speedups. While much of the API has remained the same, or similar, there are many backwards incompatible changes and deprecations. See documentation at www.staircase.dev for details.

    Source code(tar.gz)
    Source code(zip)
Making the DAEN information accessible.

The purpose of this repository is to make the information on Australian COVID-19 adverse events accessible. The Therapeutics Goods Administration (TGA) keeps a database of adverse reactions to medica

10 May 10, 2022
Driver Analysis with Factors and Forests: An Automated Data Science Tool using Python

Driver Analysis with Factors and Forests: An Automated Data Science Tool using Python 📊

Thomas 2 May 26, 2022
Utilize data analytics skills to solve real-world business problems using Humana’s big data

Humana-Mays-2021-HealthCare-Analytics-Case-Competition- The goal of the project is to utilize data analytics skills to solve real-world business probl

Yongxian (Caroline) Lun 1 Dec 27, 2021
Universal data analysis tools for atmospheric sciences

U_analysis Universal data analysis tools for atmospheric sciences Script written in python 3. This file defines multiple functions that can be used fo

Luis Ackermann 1 Oct 10, 2021
pyETT: Python library for Eleven VR Table Tennis data

pyETT: Python library for Eleven VR Table Tennis data Documentation Documentation for pyETT is located at https://pyett.readthedocs.io/. Installation

Tharsis Souza 5 Nov 19, 2022
A real-time financial data streaming pipeline and visualization platform using Apache Kafka, Cassandra, and Bokeh.

Realtime Financial Market Data Visualization and Analysis Introduction This repo shows my project about real-time stock data pipeline. All the code is

6 Sep 07, 2022
This creates a ohlc timeseries from downloaded CSV files from NSE India website and makes a SQLite database for your research.

NSE-timeseries-form-CSV-file-creator-and-SQL-appender- This creates a ohlc timeseries from downloaded CSV files from National Stock Exchange India (NS

PILLAI, Amal 1 Oct 02, 2022
A Python module for clustering creators of social media content into networks

sm_content_clustering A Python module for clustering creators of social media content into networks. Currently supports identifying potential networks

72 Dec 30, 2022
A Python package for Bayesian forecasting with object-oriented design and probabilistic models under the hood.

Disclaimer This project is stable and being incubated for long-term support. It may contain new experimental code, for which APIs are subject to chang

Uber Open Source 1.6k Dec 29, 2022
Data pipelines built with polars

valves Warning: the project is very much work in progress. Valves is a collection of functions for your data .pipe()-lines. This project aimes to host

14 Jan 03, 2023
Monitor the stability of a pandas or spark dataframe ⚙︎

Population Shift Monitoring popmon is a package that allows one to check the stability of a dataset. popmon works with both pandas and spark datasets.

ING Bank 403 Dec 07, 2022
Created covid data pipeline using PySpark and MySQL that collected data stream from API and do some processing and store it into MYSQL database.

Created covid data pipeline using PySpark and MySQL that collected data stream from API and do some processing and store it into MYSQL database.

2 Nov 20, 2021
MIR Cheatsheet - Survival Guidebook for MIR Researchers in the Lab

MIR Cheatsheet - Survival Guidebook for MIR Researchers in the Lab

SeungHeonDoh 3 Jul 02, 2022
DefAP is a program developed to facilitate the exploration of a material's defect chemistry

DefAP is a program developed to facilitate the exploration of a material's defect chemistry. A large number of features are provided and rapid exploration is supported through the use of autoplotting

6 Oct 25, 2022
PyPSA: Python for Power System Analysis

1 Python for Power System Analysis Contents 1 Python for Power System Analysis 1.1 About 1.2 Documentation 1.3 Functionality 1.4 Example scripts as Ju

758 Dec 30, 2022
The micro-framework to create dataframes from functions.

The micro-framework to create dataframes from functions.

Stitch Fix Technology 762 Jan 07, 2023
nrgpy is the Python package for processing NRG Data Files

nrgpy nrgpy is the Python package for processing NRG Data Files Website and source: https://github.com/nrgpy/nrgpy Documentation: https://nrgpy.github

NRG Tech Services 23 Dec 08, 2022
[CVPR2022] This repository contains code for the paper "Nested Collaborative Learning for Long-Tailed Visual Recognition", published at CVPR 2022

Nested Collaborative Learning for Long-Tailed Visual Recognition This repository is the official PyTorch implementation of the paper in CVPR 2022: Nes

Jun Li 65 Dec 09, 2022
This is a repo documenting the best practices in PySpark.

Spark-Syntax This is a public repo documenting all of the "best practices" of writing PySpark code from what I have learnt from working with PySpark f

Eric Xiao 447 Dec 25, 2022
Python ELT Studio, an application for building ELT (and ETL) data flows.

The Python Extract, Load, Transform Studio is an application for performing ELT (and ETL) tasks. Under the hood the application consists of a two parts.

Schlerp 55 Nov 18, 2022