当前位置:网站首页>Kettle [practice 02] TXT type files are classified and imported, execute SQL to convert data types and put them into storage (complete process instance cloud resource sharing: including sql+kjb+ktr+ t
Kettle [practice 02] TXT type files are classified and imported, execute SQL to convert data types and put them into storage (complete process instance cloud resource sharing: including sql+kjb+ktr+ t
2022-07-18 12:16:00 【Kite】
The resource link is at the end of the article , Include files :
1. Requirement specification
An application related to administrative division boundary data , Administrative division data from Hydrographic micrograph Download from , You need to import this data into the database GreenPlum, And standardize the data format . There are two types of source data :
- Area _ spot .txt( Area name and center point coordinates )
- Area _ Noodles .txt( Area name and area boundary point coordinates )
The data needs to be converted into two types :
- spot
POINT(longitude,latitude) - polygon
POLYGON( POINT, POINT, POINT, ... )
2. Process instance
2.1 Point and face file data warehousing
The data cleaning process of point and surface is the same , Here we use surface data to illustrate , First of all Filter all face data files under the source data folder :

Parse the contents of the file , Here are some points to pay attention to :
- File name field
边栏推荐
- Redis data structure practice, see how microblogging, wechat, shopping cart, lottery applet is used?
- 三個步驟,一天就搞定了MySQL,讓我順利拿下了天猫offer
- 面试高频:MySQL是怎么保证高可用的?
- Installing MySQL on Linux
- GDB or delve debug Go program, check variable display < optimized out > solution
- Strings containing numbers are eliminated and letters are incremented according to the step size
- AcWing 368. Galaxy problem solution (strongly connected components as difference constraints)
- Transformation of brushless motor on disk
- Map set to object, map field has horizontal lines, object to map
- 传统健身房困于规模化,乐刻运动“S2B2C”模式成参考答案?
猜你喜欢

Statistics of Top100 domestic NFT Platform Alliance chain and public chain usage

Traditional gyms are trapped in large-scale, and Lexus sports "s2b2c" mode is the reference answer?

openEuler 知:ip addr 查不到 ip 的解决方法

【Jailhouse 文章】Bao: A Lightweight Static Partitioning Hypervisor for Modern Multi-Core Embedded...

Binary tree traversal

Why do consumers buy iPhones instead of domestic ones? Because the depreciation of domestic mobile phones is too fast

面试高频:MySQL是怎么保证高可用的?

Overview of database system -- overview of data model

逆向学习笔记(一)

Introduction to Chang'an chain-01
随机推荐
Installing MySQL on Linux
C# 使用JObject解析嵌套json
自动推理的逻辑02-命题微积分
Why do consumers buy iPhones instead of domestic ones? Because the depreciation of domestic mobile phones is too fast
洛谷题单-高精度
Transfer learning model training II
Mockito中模拟静态方法
vue+mysql连接数据库实现登录注册
力扣(LeetCode)196. 删除重复的电子邮箱(2022.07.15)
这20道Redis经典面试题你还不会,面试官看都不看你一眼
第50篇-某查查请求头参数分析【2022-07-14】
Jupyterab installation
GDB common instructions
Solution to script loss when the prefab of unity ngui is migrated to other projects
长安链介绍-02
接口自动化测试:Postman实战教程
使用nlmeas对图像进行去噪
Traditional gyms are trapped in large-scale, and Lexus sports "s2b2c" mode is the reference answer?
Rust语言——小小白的入门学习10
Differences among screenwidth, clientwidth, offsetwidth, and scrollwidth