Working Time Statistics of working hours and working conditions by industry and company

Last update: Nov 04, 2022

Related tags

Data Analysis worktime

Overview

Working Time

统计各行业，各公司工作时间与工作条件。

原始数据来源：https://github.com/WorkerLivesMatter/WorkingTime，向发起人致敬。

经过少量处理，整理为供PostgreSQL直接可以使用的数据表。

Public Demo: http://demo.pigsty.cc/d/worktime-query

如何使用?

如果你已经有了pigsty环境, 使用管理用户在管理节点上克隆本项目并执行 make all 即可

git clone https://github.com/Vonng/worktime && cd worktime
make all

数据说明

数据模式位于：sql/000_base.sql
数据内容位于：data/worktime.csv
原始数据文件：data/raw.xlsx，可以从 WorkerLivesMatter/WorkingTime 下载。

CREATE TABLE worktime.worktime
(
    id          INTEGER NOT NULL,
    company     TEXT,
    department  TEXT,
    job         TEXT,
    base        TEXT,
    work_begin  TEXT,
    work_end    TEXT,
    launch_time TEXT,
    dinner_time TEXT,
    wed         TEXT,
    fri         TEXT,
    workdays    TEXT,
    summary     TEXT,
    remark      TEXT,
    category    TEXT,
    suggestion  TEXT,
    struct      TEXT,
    welfare     TEXT,
    is_foreign  BOOLEAN,
    domain      TEXT NOT NULL
) partition by list (domain);

CREATE TABLE worktime.internet PARTITION OF worktime.worktime FOR VALUES IN ('互联网');
CREATE TABLE worktime.finance  PARTITION OF worktime.worktime FOR VALUES IN ('金融');
CREATE TABLE worktime.foreign  PARTITION OF worktime.worktime FOR VALUES IN ('外企');
CREATE TABLE worktime.misc     PARTITION OF worktime.worktime FOR VALUES IN ('其他');

COMMENT ON TABLE worktime.worktime IS '企业工作时间统计表';
COMMENT ON TABLE worktime.internet IS '企业工作时间统计表：互联网行业';
COMMENT ON TABLE worktime.finance IS '企业工作时间统计表：金融行业';
COMMENT ON TABLE worktime.foreign IS '企业工作时间统计表：外企';
COMMENT ON TABLE worktime.misc IS '企业工作时间统计表：其他';

CREATE INDEX ON worktime.worktime(company, department);
COMMENT ON COLUMN worktime.worktime.id IS '原始数据行号';
COMMENT ON COLUMN worktime.worktime.company IS '公司';
COMMENT ON COLUMN worktime.worktime.department IS '部门';
COMMENT ON COLUMN worktime.worktime.job IS '岗位';
COMMENT ON COLUMN worktime.worktime.base IS 'base地';
COMMENT ON COLUMN worktime.worktime.work_begin IS '上班时间';
COMMENT ON COLUMN worktime.worktime.work_end IS '下班时间';
COMMENT ON COLUMN worktime.worktime.launch_time IS '午饭时间';
COMMENT ON COLUMN worktime.worktime.dinner_time IS '晚饭时间';
COMMENT ON COLUMN worktime.worktime.wed IS '周三是否特殊';
COMMENT ON COLUMN worktime.worktime.fri IS '周五是否特殊';
COMMENT ON COLUMN worktime.worktime.workdays IS '一周工作天数';
COMMENT ON COLUMN worktime.worktime.summary IS '新人是否日报/周报';
COMMENT ON COLUMN worktime.worktime.remark IS '备注';
COMMENT ON COLUMN worktime.worktime.category IS '行业/公司性质';
COMMENT ON COLUMN worktime.worktime.suggestion IS '建议';
COMMENT ON COLUMN worktime.worktime.struct IS '组内 35 岁及以上基层员工（ 组长及以下）比例，格式为 x / y，x 为 35岁以上的人数，y 为总人数';
COMMENT ON COLUMN worktime.worktime.welfare IS '是否有其他福利（如：五险一金，带薪年假，公费旅游，免费三餐）';
COMMENT ON COLUMN worktime.worktime.is_foreign IS '是否为外资企业？';
COMMENT ON COLUMN worktime.worktime.domain IS '大分类：互联网、金融、外企、其他';

Working Time Statistics of working hours and working conditions by industry and company

Related tags

Overview

Working Time

如何使用?

数据说明

Owner

Feng Ruohang

GWpy is a collaboration-driven Python package providing tools for studying data from ground-based gravitational-wave detectors

Tuplex is a parallel big data processing framework that runs data science pipelines written in Python at the speed of compiled code

A notebook to analyze Amazon Recommendation Review Dataset.

.npy, .npz, .mtx converter.

A probabilistic programming library for Bayesian deep learning, generative models, based on Tensorflow

Single-Cell Analysis in Python. Scales to >1M cells.

📊 Python Flask game that consolidates data from Nasdaq, allowing the user to practice buying and selling stocks.

A tax calculator for stocks and dividends activities.

Data collection, enhancement, and metrics calculation.

CRISP: Critical Path Analysis of Microservice Traces

A stock analysis app with streamlit

COVID-19 deaths statistics around the world

MeSH2Matrix - A set of Python codes for the generation of biomedical ontologies from the MeSH keywords of the PubMed scholarly publications

Integrate bus data from a variety of sources (batch processing and real time processing).

Aggregating gridded data (xarray) to polygons

OpenDrift is a software for modeling the trajectories and fate of objects or substances drifting in the ocean, or even in the atmosphere.

Driver Analysis with Factors and Forests: An Automated Data Science Tool using Python

Full automated data pipeline using docker images

Probabilistic Programming in Python: Bayesian Modeling and Probabilistic Machine Learning with Theano

Full ELT process on GCP environment.