当前位置:网站首页>Sparkcore core design: RDD, 220716,
Sparkcore core design: RDD, 220716,
2022-07-19 03:47:00 【Ah, six six six】

port Already in user
Chain programming ,



No such function : It is recommended to install professional version , Ask me to install the package
upload
remember :Windows In the revised , Be sure to synchronize
Running in a distributed cluster , Printing is also in the cluster
Go to log See, get
The cluster is used to test whether there is any problem in the cluster environment
RDD Function operation :Executor
Printed on Executor In the running log of
Executor function Worker Node
Printed on worker journal ,
1-Driver Printed in , We can see
2-Executor Printed in , We can't see (18080,stdout, Can see )
Can you see the results of all local mode outputs It's not that there is only one local mode driver
RDD. Operator printing
print:Driver Running in
Operator by Task To call execution ,Task function Executor in ,Executor Running on the Worker Node
sbin: Cluster management commands
bin: Client commands
demand : Use the client command to submit the program to the cluster for running
pyspark:python Command line client
spark-sql/beeline: Submit SQL Client you
spark-submit: Submit python File client
argv

Each program has only 1 individual Driver
kill,status It can be displayed in the monitoring interface ,
0- command ,1- Options ,2- file ,3- Parameters

1-Master,2-Worker 3-Client
Driver Where does the process run ?
![]()
Why? Driver The print in will show ?

Local mode ,18080 Can see ,8080 Out of sight ,
0- command ,1- Options ,2- file ,3- Parameters

hdfs://node1:8020/export/data/pyspark_core_word_args.py
YARN In the interface Spark The program can jump directly 18080
YARN In the interface MR The program directly jumps to 19888
yarn Submit ,8032 ,
It will start according to the resources of the slave node , You can start as many as you can

driver Where to run

Only drive branch task,
towards drive Reverse registration ,

appmaster Only on the slave node ,
Development : Analyze and process the data
python It's a single node ,

dataframe: Data sheet , data + The structure of the table

Yes RDD The conversion operation of , Essentially, RDD All partitions operate in parallel

Files are logic , Physically stored blocks

3 Zones 3 individual task,
Global grouping should shuffle,

tonight,review,
Spark When running in program cluster mode, two processes will be started :Driver Drive process + Executor Computing process , Every process needs resources to run

ResourceManager
NodeManager
AppMaster
Container
MapTask and ReduceTask

preview

Be careful :PySpark In local mode wholeTextFiles Yes Bug, This will result in insufficient memory for a single process , The cluster environment can be used normally
边栏推荐
- 数学建模比赛论文模板格式
- [C language errata] error in getting array length in function
- Reptile learning (5): teach you reptile requests practice hand in hand
- 自然语言处理 知识点积累
- 电脑端实现微信双开(登录两个微信)
- S32k148evb about eNet loopback experiment
- JMeter中如何实现接口之间的关联?
- XX City high school network topology overall planning configuration
- 为什么越来越多人开始选择过“低配生活”?
- [2016 CCPC Hangzhou j] just a math problem (Mobius inversion)
猜你喜欢

10. Redis 面试常见问答

Installing PWA application in Google Chrome browser will display more description information

【Nodejs】npm/nrm无法加载文件、因为在此系统禁止执行脚本解决方式

STM32 serial port sending and receiving multiple data tutorial based on gas sensor practice

Thinkphp5.0模型操作使用page进行分页

kubernetes学习之持久化存储StorageClass(4)

options has an unknown property ‘before‘

SwiftUI 考试题库项目之支持题库和考试题库数量(教程含源码)

第二章 线性表

STM32串口发送和接收多个数据教程基于气体传感器实战
随机推荐
GNOME-BOXES虚拟机创建安装
【LeetCode】346. 数据流中的移动平均值
Web semantics (emphasis tag EM italic) (emphasis tag strong bold) (custom list: DL, DT, DD)
XX City high school network topology overall planning configuration
数学建模比赛论文模板格式
爬虫学习(5):手把手教你爬虫requests实战演练
如何在自动化测试中使用MitmProxy获取数据返回?
Oracle queries the host name and the corresponding IP address
How to read and write a single document based on MFC
v-for 中 key 的作用
基于Matlab的男女声音信号分析与处理
缩短饿了么tabs 组件线条宽度
渗透测试-02漏洞扫描
VGG (Visual Geometry Group)
Operator, assignment statement, structure description statement
电脑绘画软件哪个好用:试试Artweaver Plus吧,媲美sai绘画软件 | 最新版本的artweaver下载
[2016 CCPC 杭州J] Just a Math Problem (莫比乌斯反演)
Jmeter常用功能-参数化介绍
Unity solves the problem of Z-fighting caused by overlapping objects with the same material
Oracle closes the recycle bin
