Tidy interface to polars

Last update: Jan 08, 2023

Related tags

Overview

tidypolars

tidypolars is a data frame library built on top of the blazingly fast polars library that gives access to methods and functions familiar to R tidyverse users.

Installation

$ pip3 install tidypolars

General syntax

tidypolars methods are designed to work like tidyverse functions:

import tidypolars as tp
from tidypolars import col, desc

df = tp.Tibble(x = range(3), y = range(3, 6), z = ['a', 'a', 'b'])

(
    df
    .select('x', 'y', 'z')
    .filter(col('x') < 4, col('y') > 1)
    .arrange(desc('z'), 'x')
    .mutate(double_x = col('x') * 2,
            x_plus_y = col('x') + col('y'))
)
┌─────┬─────┬─────┬──────────┬──────────┐
│ x   ┆ y   ┆ z   ┆ double_x ┆ x_plus_y │
│ --- ┆ --- ┆ --- ┆ ---      ┆ ---      │
│ i64 ┆ i64 ┆ str ┆ i64      ┆ i64      │
╞═════╪═════╪═════╪══════════╪══════════╡
│ 2   ┆ 5   ┆ b   ┆ 4        ┆ 7        │
├╌╌╌╌╌┼╌╌╌╌╌┼╌╌╌╌╌┼╌╌╌╌╌╌╌╌╌╌┼╌╌╌╌╌╌╌╌╌╌┤
│ 0   ┆ 3   ┆ a   ┆ 0        ┆ 3        │
├╌╌╌╌╌┼╌╌╌╌╌┼╌╌╌╌╌┼╌╌╌╌╌╌╌╌╌╌┼╌╌╌╌╌╌╌╌╌╌┤
│ 1   ┆ 4   ┆ a   ┆ 2        ┆ 5        │
└─────┴─────┴─────┴──────────┴──────────┘

The key difference from R is that column names must be wrapped in col() in the following methods:

.filter()
.mutate()
.summarize()

The general idea - when doing calculations on a column you need to wrap it in col(). When doing simple column selections (like in .select()) you can pass the column names as strings.

Group by syntax

Methods operate by group by calling the by arg.

A single column can be passed with by = 'z'
Multiple columns can be passed with by = ['y', 'z']

(
    df
    .summarize(avg_x = tp.mean(col('x')),
               by = 'z')
)
┌─────┬───────┐
│ z   ┆ avg_x │
│ --- ┆ ---   │
│ str ┆ f64   │
╞═════╪═══════╡
│ a   ┆ 0.5   │
├╌╌╌╌╌┼╌╌╌╌╌╌╌┤
│ b   ┆ 2     │
└─────┴───────┘

Selecting/dropping columns

tidyselect functions can be mixed with normal selection when selecting columns:

df = tp.Tibble(x1 = range(3), x2 = range(3), y = range(3), z = range(3))

df.select(tp.starts_with('x'), 'z')
┌─────┬─────┬─────┐
│ x1  ┆ x2  ┆ z   │
│ --- ┆ --- ┆ --- │
│ i64 ┆ i64 ┆ i64 │
╞═════╪═════╪═════╡
│ 0   ┆ 0   ┆ 0   │
├╌╌╌╌╌┼╌╌╌╌╌┼╌╌╌╌╌┤
│ 1   ┆ 1   ┆ 1   │
├╌╌╌╌╌┼╌╌╌╌╌┼╌╌╌╌╌┤
│ 2   ┆ 2   ┆ 2   │
└─────┴─────┴─────┘

To drop columns use the .drop() method:

df.drop(tp.starts_with('x'), 'z')
┌─────┐
│ y   │
│ --- │
│ i64 │
╞═════╡
│ 0   │
├╌╌╌╌╌┤
│ 1   │
├╌╌╌╌╌┤
│ 2   │
└─────┘

Converting to/from pandas data frames

If you need to use a package that requires pandas data frames, you can convert from a tidypolars Tibble to a pandas DataFrame.

To do this you'll first need to install pyarrow:

pip3 install pyarrow

To convert to a pandas DataFrame:

df = df.to_pandas()

To convert from a pandas DataFrame to a tidypolars Tibble:

df = tp.from_pandas(df)

Speed Comparisons

A few notes:

Comparing times from separate functions typically isn't very useful. For example - the .summarize() tests were performed on a different dataset from .pivot_wider().
All tests are run 5 times. The times shown are the median of those 5 runs.
All timings are in milliseconds.
All tests can be found in the source code here.
FAQ - Why are some tidypolars functions faster than their polars counterpart?
- Short answer - they're not! After all they're just using polars in the background.
- Long answer - All python functions have some slight natural variation in their execution time. By chance the tidypolars runs were slightly shorter on those specific functions on this iteration of the tests. However one goal of these tests is to show that the "time cost" of translating syntax to polars is very negligible to the user (especially on medium-to-large datasets).
Lastly I'd like to mention that these tests were not rigorously created to cover all angles equally. They are just meant to be used as general insight into the performance of these packages.

┌─────────────┬────────────┬─────────┬──────────┐
│ func_tested ┆ tidypolars ┆ polars  ┆ pandas   │
│ ---         ┆ ---        ┆ ---     ┆ ---      │
│ str         ┆ f64        ┆ f64     ┆ f64      │
╞═════════════╪════════════╪═════════╪══════════╡
│ arrange     ┆ 190.345    ┆ 169.478 ┆ 500.112  │
├╌╌╌╌╌╌╌╌╌╌╌╌╌┼╌╌╌╌╌╌╌╌╌╌╌╌┼╌╌╌╌╌╌╌╌╌┼╌╌╌╌╌╌╌╌╌╌┤
│ case_when   ┆ 87.348     ┆ 79.427  ┆ 152.623  │
├╌╌╌╌╌╌╌╌╌╌╌╌╌┼╌╌╌╌╌╌╌╌╌╌╌╌┼╌╌╌╌╌╌╌╌╌┼╌╌╌╌╌╌╌╌╌╌┤
│ distinct    ┆ 16.888     ┆ 16.282  ┆ 28.725   │
├╌╌╌╌╌╌╌╌╌╌╌╌╌┼╌╌╌╌╌╌╌╌╌╌╌╌┼╌╌╌╌╌╌╌╌╌┼╌╌╌╌╌╌╌╌╌╌┤
│ filter      ┆ 29.789     ┆ 29.91   ┆ 231.397  │
├╌╌╌╌╌╌╌╌╌╌╌╌╌┼╌╌╌╌╌╌╌╌╌╌╌╌┼╌╌╌╌╌╌╌╌╌┼╌╌╌╌╌╌╌╌╌╌┤
│ full_join   ┆ 236.784    ┆ 231.283 ┆ 1042.689 │
├╌╌╌╌╌╌╌╌╌╌╌╌╌┼╌╌╌╌╌╌╌╌╌╌╌╌┼╌╌╌╌╌╌╌╌╌┼╌╌╌╌╌╌╌╌╌╌┤
│ inner_join  ┆ 49.71      ┆ 47.563  ┆ 630.98   │
├╌╌╌╌╌╌╌╌╌╌╌╌╌┼╌╌╌╌╌╌╌╌╌╌╌╌┼╌╌╌╌╌╌╌╌╌┼╌╌╌╌╌╌╌╌╌╌┤
│ left_join   ┆ 113.792    ┆ 115     ┆ 1100.607 │
├╌╌╌╌╌╌╌╌╌╌╌╌╌┼╌╌╌╌╌╌╌╌╌╌╌╌┼╌╌╌╌╌╌╌╌╌┼╌╌╌╌╌╌╌╌╌╌┤
│ mutate      ┆ 7.979      ┆ 7.408   ┆ 117.283  │
├╌╌╌╌╌╌╌╌╌╌╌╌╌┼╌╌╌╌╌╌╌╌╌╌╌╌┼╌╌╌╌╌╌╌╌╌┼╌╌╌╌╌╌╌╌╌╌┤
│ pivot_wider ┆ 42.764     ┆ 39.939  ┆ 49.048   │
├╌╌╌╌╌╌╌╌╌╌╌╌╌┼╌╌╌╌╌╌╌╌╌╌╌╌┼╌╌╌╌╌╌╌╌╌┼╌╌╌╌╌╌╌╌╌╌┤
│ summarize   ┆ 59.434     ┆ 58.011  ┆ 453.707  │
└─────────────┴────────────┴─────────┴──────────┘

Contributing

Interested in contributing? Check out the contributing guidelines. Please note that this project is released with a Code of Conduct. By contributing to this project, you agree to abide by its terms.

Comments

`drop` with error `RuntimeError: Any(NotFound("^x.*$"))`

import sys
import tidypolars as tp
sys.version
# '3.9.7 (default, Sep 16 2021, 13:09:58) \n[GCC 7.5.0]'
tp.__version__
# '0.2.1'
## error
df = tp.Tibble(x1 = range(3), x2 = range(3), y=range(3), z = range(3))
df.drop([tp.starts_with('x'), 'z'])
df.drop()
`
---------------------------------------------------------------------------
RuntimeError                              Traceback (most recent call last)
/tmp/ipykernel_12815/866601321.py in <module>
----> 1 df.drop(tp.starts_with('x'))

~/miniconda3/envs/py39/lib/python3.9/site-packages/polars/eager/frame.py in drop(self, name)
   2253             return df
   2254 
-> 2255         return wrap_df(self._df.drop(name))
   2256 
   2257     def drop_in_place(self, name: str) -> "pl.Series":

RuntimeError: Any(NotFound("^x.*$"))
`

opened by ztsweet 9

`AttributeError: arrange not found`

import tidypolars as tp
from tidypolars import col, desc
import sys
sys.version
# '3.10.0 | packaged by conda-forge | (default, Oct 12 2021, 21:24:52) [GCC 9.4.0]'
tp.__version__
# '0.2.1'
df = tp.Tibble({'x': ['a', 'a', 'b'], 'y': range(3)})
df.arrange('x', 'y')
`
---------------------------------------------------------------------------
RuntimeError                              Traceback (most recent call last)
~/miniconda3/envs/py310/lib/python3.10/site-packages/polars/eager/frame.py in __getattr__(self, item)
    882         try:
--> 883             return pl.eager.series.wrap_s(self._df.column(item))
    884         except RuntimeError:

RuntimeError: Any(NotFound("arrange"))

During handling of the above exception, another exception occurred:

AttributeError                            Traceback (most recent call last)
/tmp/ipykernel_21110/1194586334.py in <module>
----> 1 df.arrange('x', 'y')

~/miniconda3/envs/py310/lib/python3.10/site-packages/polars/eager/frame.py in __getattr__(self, item)
    883             return pl.eager.series.wrap_s(self._df.column(item))
    884         except RuntimeError:
--> 885             raise AttributeError(f"{item} not found")
    886 
    887     def __iter__(self) -> Iterator[Any]:

AttributeError: arrange not found
`

bug

opened by ztsweet 6

Missing attributes when chaining

Hi Mark, thanks for putting this package together. It looks very cool.

I'm having a tough time getting the motivating examples to work, though. For example, the following triggers an error:

import tidypolars as tp
from tidypolars import col, desc

df = tp.Tibble(x = range(3), y = range(3, 6), z = ['a', 'a', 'b'])

df.filter(col('x') < 2).arrange(desc('z'), 'x')

---------------------------------------------------------------------------
AttributeError                            Traceback (most recent call last)
Untitled-1 in <cell line: 1>()
----> <a href='untitled:Untitled-1?line=7'>8</a> df.filter(col('x') < 2).arrange(desc('z'), 'x')

AttributeError: 'DataFrame' object has no attribute 'arrange'

What's genuinely odd about the above is that arrange works on its own and when it comes before filter.

# All of these work as expected
df.filter(col('x') < 2)
df.arrange(desc('z'), 'x')
df.arrange(desc('z'), 'x').filter(col('x') < 2)

A seemingly related issue is that I can't pass two arguments to filter when it follows arrange (or other verbs likeselect for that matter).

df.filter(col('x') < 2, col('y')>3) ## works
df.arrange(desc('z'), 'x').filter(col('x') < 2, col('y')>3) ## errors with "filter() takes 2 positional arguments but 3 were given"

Any ideas?

I'm on Python 3.9.2 installed via Homebrew on a 2019 Macbook (so regular Intel chip) and running the latest version of tidypolars (0.2.15).

bug

opened by grantmcdermott 5

Is it possible to have dplyr's `group_by` + `mutate` behavior?
First of all, I really like this package and I've started to use it a lot in my work. As a Pythonista whose first language is R, I really enjoy tidypolars.

In R, we can do something like the following

library(dplyr) data(iris) iris %>% group_by(Species) %>% mutate( result = Petal.Width - mean(Petal.Width) )

Since we have a group_by(Species) call, dplyr will subtract the mean that corresponds to each group in the mutate() operation (not the mean across all observations from all species).

As far as I understand, this is still not possible with tidypolars since we don't have a group_by function that behaves in a similar way to the one in dplyr. So my questions are

Is it possible to have this behavior in tidypolars now?

If yes, how?

If not, is it going to be possible? I could volunteer to try to implement it. I'm not familiar with the existing codebase, but I suspect that Python eager evaluation of function arguments is what makes it harder to have such a feature?

Again, thanks for the fantastic library!
opened by tomicapretto 5

idiomatic way to add list as column

Forgive what's probably a dumb question, but is there a way to get .mutate to return the same object as the .bind_cols line?

import tidypolars as tp

tb = tp.Tibble({'a': [1, 2, 3]})
x = [4, 5, 6]
# gives desired output 
tb.bind_cols(tp.Tibble({'b': x}))
# gives error: ValueError: could not convert value '[4, 5, 6]' as a Literal
tb.mutate(b = x)

feature

opened by eutwt 4

purrr functions!?

I noticed that in tidytable, you have purrr functions like map.(), but not in tidypolars.

Using for loops + lambda functions are just not desirable for collaborative coding / code readability/comprehension. In Python, even if there is a bit of sacrifice in performance, if it allows better code readability, it would be really nice to have.

Would something like map.() be in the scope of this repo?
feature

opened by exsell-jc 3

```ValueError``` with ```filter```

When I chain filter expressions with | (error message said to use | and not or), I receive a ValueError message:

tp.Tibble(chr_col = tp.Series(['this is a test 1', 'this is a test 2', 'this is a test 3']))\
    .filter(col('chr_col') == 'this is a test 1' |
            col('chr_col') == 'this is a test 2')

ValueError: Since Expr are lazy, the truthiness of an Expr is ambiguous. 
Hint: use '&' or '|' to chain Expr together, not and/or.

It works fine if I do one or the other:

tp.Tibble(chr_col = tp.Series(['this is a test 1', 'this is a test 2', 'this is a test 3']))\
    .filter(# col('chr_col') == 'this is a test 1' |
            col('chr_col') == 'this is a test 2')

# chr_col
#   --
#   str
# "this is a test 2"

tp.Tibble(chr_col = tp.Series(['this is a test 1', 'this is a test 2', 'this is a test 3']))\
    .filter(col('chr_col') == 'this is a test 1' # |
            # col('chr_col') == 'this is a test 2'
            )

# chr_col
#   --
#   str
# "this is a test 1"

opened by alexandro-ag 3

```as_date``` with ```RuntimeError: please define a fmt```
Good afternoon,

I think I found an issue with the as_date method. In the example per the documentation, the following succeeds:

import tidypolars as tp from tidypolars import col date_df = tp.Tibble(date = ['2021-12-31']) # Year-Month-Day (%Y-%m-%d) date_df.mutate(date_parsed = tp.as_date(col('date'))) # Success

However when parsing different formats (using the fmt argument), the date fails to parse:

import tidypolars as tp from tidypolars import col date_df = tp.Tibble(date = ['12/31/2021']) # Month/Day/Year (%m/%d/%Y) date_df.mutate(date_parsed = tp.as_date(col('date'), fmt='%m/%d/%Y')) # RuntimeError

I also extend my appreciation for all the work on this package. I've been searching for a tidyverse implementation in python and this one knocks my expectations out of the park. Thank you.
opened by alexandro-ag 3
Revisit `.rename()` syntax

Should the syntax be the same as pl.DataFrame.rename? Currently polars mimics pandas syntax. Or should it be something that attempts to mimic tidyverse syntax?

Note: polars also has a .rename_col() with syntax df.rename_col('old', 'new').

opened by markfairbanks 3
Compatibility with polars v0.14.0

PR that caused the break: https://github.com/pola-rs/polars/pull/4309

Old behavior that tidypolars relied on: https://github.com/pola-rs/polars/pull/2862
feature

opened by markfairbanks 2

Basics: tp.read_csv(), df.drop(x1, x2, x3, ...), and df.colnames?

Really new to the library, but looking at the documentation did not really help with understanding.

Problem 1

import polars as pl
import tidypolars as tp
import csv
import requests

url = f'https://raw.githubusercontent.com/rfordatascience/tidytuesday/master/data/2022/2022-05-24/Scrumqueens-data-2022-05-23.csv'

df = tp.read_csv(file = url) # does not work
df = pl.read_csv(file = url) # works??

Problem 2

df = df.drop('...1', 'Notes') # does not work
df = df.drop('...1') # works separately
df = df.drop('Notes') # works separately

Problem 3

df.colnames
df.names
df.colnames()
df.names()
# None of these work

What am I missing, exactly?

opened by exsell-jc 2

plans for adding type hints

Hi, it seems that the codebase is not annotated making the discoverability of methods difficult and static code analysis not working. Any plans on adding type hints?
feature

opened by mr-majkel 1
`write_csv()` returns `'super' object has no attribute 'to_csv'`
Hi, There seems to be a problem with write_csv(). I can import tidypolars and the data just fine:

import tidypolars as tp rents = tp.read_csv("https://raw.githubusercontent.com/rfordatascience/tidytuesday/master/data/2022/2022-07-05/rent.csv")

But when I try to export the data frame as a csv file:

rents.write_csv("rents.csv")

I get an error stating 'super' object has no attribute 'to_csv'.

The data come from the Tidytuesday repo. Python version is 3.10.8 and tidypolars is 0.2.19. I'm on macOS 13.
bug
opened by alesvomacka 1
Calculating time
In R with lubridate, it would look like this:

one_year_before = some_date - years(1) one_year_before = some_date - months(12)

But in tidypolars functions list, there doesn't seem to be a years or months function: https://tidypolars.readthedocs.io/en/latest/reference.html
feature
opened by exsell-jc 4

Releases(v0.2.19)

v0.2.19(Oct 10, 2022)
New functions

make_date()

make_datetime()

Source code(tar.gz)
Source code(zip)
v0.2.18(Oct 10, 2022)

polars >=0.14.18 compatibility
Source code(tar.gz)
Source code(zip)
v0.2.15(Apr 22, 2022)
Added support for python 3.7

Source code(tar.gz)
Source code(zip)
v0.2.14(Apr 22, 2022)

Source code(tar.gz)
Source code(zip)
v0.2.13(Apr 7, 2022)
New functions

cor()

cov()

log()

log10()

rep()

var()

Methods with notable speed improvements

.separate()

Source code(tar.gz)
Source code(zip)
v0.2.12(Apr 7, 2022)
New Tibble methods

.unite()

Source code(tar.gz)
Source code(zip)
v0.2.11(Feb 11, 2022)
New functions

across()

as_boolean()

Functionality improvements

Can pass an empty list to by

.mutate()

Column expressions are evaluated sequentially in order to match dplyr semantics

Can add a new column with a constant without tp.lit()

Source code(tar.gz)
Source code(zip)
v0.2.10(Feb 8, 2022)
New Tibble methods

.separate()

New functions

coalesce()

n()

row_number()

str_c()

str_ends()

str_starts()

Source code(tar.gz)
Source code(zip)
v0.2.9(Feb 6, 2022)

Source code(tar.gz)
Source code(zip)
v0.2.8(Dec 8, 2021)
Bug fixes

Can use fmt arg in as_date() and as_datetime() (#155)

Source code(tar.gz)
Source code(zip)
v0.2.7(Nov 19, 2021)
New Tibble methods

.to_dict()

Source code(tar.gz)
Source code(zip)
v0.2.6(Nov 18, 2021)
New functions

count()

floor()

length()

quantile()

sqrt()

Functionality improvements

.bind_rows(): Auto-aligns columns by name

Source code(tar.gz)
Source code(zip)
v0.2.5(Nov 16, 2021)

Source code(tar.gz)
Source code(zip)
v0.2.4(Nov 16, 2021)
New functions

paste()

paste0()

Improved functionality

.relocate(): tidyselect helpers work

Source code(tar.gz)
Source code(zip)
v0.2.1(Nov 8, 2021)
New Tibble methods

.replace_null()

.set_names()

New functions

replace_null()

Source code(tar.gz)
Source code(zip)
v0.2.0(Nov 5, 2021)
v0.2.0 (2021/11/05)

New Functions

as_float()

as_integer()

as_string()

between()

cast()

desc()

is_finite()

is_in()

is_infinite()

is_not()

is_not_in()

is_not_null()

is_null()

round()

lubridate

as_date()

as_datetime()

dt_round()

hour()

mday()

minute()

month()

quarter()

second()

wday()

week()

yday()

year()

stringr

str_detect()

str_extract()

str_length()

str_remove_all()

str_remove()

str_replace_all()

str_replace()

str_sub()

str_to_lower()

str_to_upper()

str_trim()

Improved functionality

.drop(): tidyselect helpers work

Source code(tar.gz)
Source code(zip)
v0.1.7(Oct 20, 2021)
New Tibble methods

.count()

.drop_null()

.inner_join()/.left_join()/.outer_join()

.write_csv()

.write_parquet()

New functions

tp.abs()

tp.case_when()

tp.first()

tp.if_else()

tp.lag()

tp.last()

tp.lead()

tp.max()

tp.mean()

tp.median()

tp.min()

tp.n_distinct()

tp.sd()

tp.sum()

tp.read_csv()

tp.read_parquet()

tidyselect

tp.contains()

tp.ends_with()

tp.everything()

tp.starts_with()

Improved functionality

.bind_cols()/.bind_rows(): Can append multiple data frames in one call

Source code(tar.gz)
Source code(zip)
v0.1.6(Oct 15, 2021)
Improved functionality

.rename(): Can now use both a dplyr-like and pandas-like interface

New attributes

.names

.ncol

.nrow

Source code(tar.gz)
Source code(zip)
v0.1.5(Oct 12, 2021)
New Tibble methods

.fill()

.head()

.pivot_longer()

.pivot_wider()

.tail()

.slice_head()

.slice_tail()

New expression methods

.lag()

.lead()

Source code(tar.gz)
Source code(zip)
v0.1.4(Oct 7, 2021)
New methods

.bind_cols()

.bind_rows()

.distinct()

.pull()

.rename()

.slice()

Source code(tar.gz)
Source code(zip)
v0.1.3(Oct 4, 2021)

Source code(tar.gz)
Source code(zip)

Owner

Mark Fairbanks

GitHub Repository http://tidypolars.readthedocs.io

RTSeg: Real-time Semantic Segmentation Comparative Study

Real-time Semantic Segmentation Comparative Study The repository contains the official TensorFlow code used in our papers: RTSEG: REAL-TIME SEMANTIC S

592 Nov 18, 2022

Github project for Attention-guided Temporal Coherent Video Object Matting.

Attention-guided Temporal Coherent Video Object Matting This is the Github project for our paper Attention-guided Temporal Coherent Video Object Matti

71 Dec 19, 2022

A library for implementing Decentralized Graph Neural Network algorithms.

decentralized-gnn A package for implementing and simulating decentralized Graph Neural Network algorithms for classification of peer-to-peer nodes. De

5 Nov 07, 2022

PyTorch implementation of SimCLR: A Simple Framework for Contrastive Learning of Visual Representations

1.7k Dec 28, 2022

Normalizing Flows with a resampled base distribution

Resampling Base Distributions of Normalizing Flows Normalizing flows are a popular class of models for approximating probability distributions. Howeve

24 Nov 03, 2022

This project is a loose implementation of paper "Algorithmic Financial Trading with Deep Convolutional Neural Networks: Time Series to Image Conversion Approach"

Stock Market Buy/Sell/Hold prediction Using convolutional Neural Network This repo is an attempt to implement the research paper titled "Algorithmic F

136 Dec 28, 2022

Official and maintained implementation of the paper "OSS-Net: Memory Efficient High Resolution Semantic Segmentation of 3D Medical Data" [BMVC 2021].

OSS-Net: Memory Efficient High Resolution Semantic Segmentation of 3D Medical Data Christoph Reich, Tim Prangemeier, Özdemir Cetin & Heinz Koeppl | Pr

23 Sep 21, 2022

Computer vision - fun segmentation experience using classic and deep tools :)

Computer_Vision_Segmentation_Fun Segmentation of Images and Video. Tools: pytorch Models: Classic model - GrabCut Deep model - Deeplabv3_resnet101 Flo

1 Dec 18, 2021

Built a deep neural network (DNN) that functions as an end-to-end machine translation pipeline

Built a deep neural network (DNN) that functions as an end-to-end machine translation pipeline. The pipeline accepts english text as input and returns the French translation.

1 Jan 24, 2022

Accelerating BERT Inference for Sequence Labeling via Early-Exit

Sequence-Labeling-Early-Exit Code for ACL 2021 paper: Accelerating BERT Inference for Sequence Labeling via Early-Exit Requirement: Please refer to re

23 Oct 14, 2022

DyStyle: Dynamic Neural Network for Multi-Attribute-Conditioned Style Editing

DyStyle: Dynamic Neural Network for Multi-Attribute-Conditioned Style Editing Figure: Joint multi-attribute edits using DyStyle model. Great diversity

74 Dec 03, 2022

A library for efficient similarity search and clustering of dense vectors.

Faiss Faiss is a library for efficient similarity search and clustering of dense vectors. It contains algorithms that search in sets of vectors of any

18.8k Jan 08, 2023

A Pytorch Implementation for Compact Bilinear Pooling.

CompactBilinearPooling-Pytorch A Pytorch Implementation for Compact Bilinear Pooling. Adapted from tensorflow_compact_bilinear_pooling Prerequisites I

169 Dec 23, 2022

Evaluation framework for testing segmentation networks in PyTorch

Evaluation framework for testing segmentation networks in PyTorch. What segmentation network to choose for next Kaggle competition? This benchmark knows the answer!

37 Apr 27, 2022

ConE: Cone Embeddings for Multi-Hop Reasoning over Knowledge Graphs

ConE: Cone Embeddings for Multi-Hop Reasoning over Knowledge Graphs This is the code of paper ConE: Cone Embeddings for Multi-Hop Reasoning over Knowl

33 Dec 07, 2022

Sequence to Sequence (seq2seq) Recurrent Neural Network (RNN) for Time Series Forecasting

Sequence to Sequence (seq2seq) Recurrent Neural Network (RNN) for Time Series Forecasting Note: You can find here the accompanying seq2seq RNN forecas

1k Dec 25, 2022

Complete the code of prefix-tuning in low data setting

Prefix Tuning Note: 作者在论文中提到使用真实的word去初始化prefix的操作（Initializing the prefix with activations of real words，significantly improves generation）。我在使用作者提供的

4 Jul 11, 2022

Code & Experiments for "LILA: Language-Informed Latent Actions" to be presented at the Conference on Robot Learning (CoRL) 2021.

LILA LILA: Language-Informed Latent Actions Code and Experiments for Language-Informed Latent Actions (LILA), for using natural language to guide assi

11 Nov 25, 2022

Text mining project; Using distilBERT to predict authors in the classification task authorship attribution.

DistilBERT-Text-mining-authorship-attribution Dataset used: https://www.kaggle.com/azimulh/tweets-data-for-authorship-attribution-modelling/version/2

1 Jan 13, 2022

Poisson Surface Reconstruction for LiDAR Odometry and Mapping

Poisson Surface Reconstruction for LiDAR Odometry and Mapping Surfels TSDF Our Approach Table: Qualitative comparison between the different mapping te

305 Dec 21, 2022

Tidy interface to polars

Related tags

Overview

tidypolars

Installation

General syntax

Group by syntax

Selecting/dropping columns

Converting to/from pandas data frames

Speed Comparisons

Contributing

Comments

Releases(v0.2.19)

v0.2.19(Oct 10, 2022)

New functions

v0.2.18(Oct 10, 2022)

v0.2.15(Apr 22, 2022)

v0.2.14(Apr 22, 2022)

v0.2.13(Apr 7, 2022)

New functions

Methods with notable speed improvements

v0.2.12(Apr 7, 2022)

New Tibble methods

v0.2.11(Feb 11, 2022)

New functions

Functionality improvements

v0.2.10(Feb 8, 2022)

New Tibble methods

New functions

v0.2.9(Feb 6, 2022)

v0.2.8(Dec 8, 2021)

Bug fixes

v0.2.7(Nov 19, 2021)

New Tibble methods

v0.2.6(Nov 18, 2021)

New functions

Functionality improvements

v0.2.5(Nov 16, 2021)

v0.2.4(Nov 16, 2021)

New functions

Improved functionality

v0.2.1(Nov 8, 2021)

New Tibble methods

New functions

v0.2.0(Nov 5, 2021)

v0.2.0 (2021/11/05)

New Functions

Improved functionality

v0.1.7(Oct 20, 2021)

v0.1.6(Oct 15, 2021)

v0.1.5(Oct 12, 2021)

v0.1.4(Oct 7, 2021)

v0.1.3(Oct 4, 2021)

Owner

Mark Fairbanks

RTSeg: Real-time Semantic Segmentation Comparative Study

Github project for Attention-guided Temporal Coherent Video Object Matting.

A library for implementing Decentralized Graph Neural Network algorithms.

PyTorch implementation of SimCLR: A Simple Framework for Contrastive Learning of Visual Representations

Normalizing Flows with a resampled base distribution

This project is a loose implementation of paper "Algorithmic Financial Trading with Deep Convolutional Neural Networks: Time Series to Image Conversion Approach"

Official and maintained implementation of the paper "OSS-Net: Memory Efficient High Resolution Semantic Segmentation of 3D Medical Data" [BMVC 2021].

Computer vision - fun segmentation experience using classic and deep tools :)

Built a deep neural network (DNN) that functions as an end-to-end machine translation pipeline

Accelerating BERT Inference for Sequence Labeling via Early-Exit

DyStyle: Dynamic Neural Network for Multi-Attribute-Conditioned Style Editing

A library for efficient similarity search and clustering of dense vectors.

A Pytorch Implementation for Compact Bilinear Pooling.

Evaluation framework for testing segmentation networks in PyTorch

ConE: Cone Embeddings for Multi-Hop Reasoning over Knowledge Graphs

Sequence to Sequence (seq2seq) Recurrent Neural Network (RNN) for Time Series Forecasting

Complete the code of prefix-tuning in low data setting

Code & Experiments for "LILA: Language-Informed Latent Actions" to be presented at the Conference on Robot Learning (CoRL) 2021.

Text mining project; Using distilBERT to predict authors in the classification task authorship attribution.

Poisson Surface Reconstruction for LiDAR Odometry and Mapping