Presto: SQL on Everything


SQL
Connector API

SQL StatementCoordinatorHTTP
cipepser HTTP
Coordinator node
Statement()worker
Worker nodes
External Storage System





Data Layout Properties

Data Layout
optimizershufflephysical data layout使
physical data layout
joinnested loop join
stage
worker node
shuffle
stage
HTTP long polling
CPUoptimizer
hivesparkdisk
shuffle fileReduce

4shuffle
data layout propertiesshuffle

Data Layoutshuffle


pipelines
task
stagetask
splits
task
hash join2pipelines
join
LocalShuffle
pipeline1pipeline2



appendix

stagecoordinatorworkers
stagewoerker
stage scheduling2
all-at-once
wall clock time
sensitive
phased
Task scheduling
task schedulerplan treeleaf stagesintermediate stages
leaf
connectornetwork
shared-nothing
CPUconnectordecompressing
leaf stageworker node
shared-storage modenetwork intensive
intermediate
Split Scheduling
split
redis: table info, key value, hosts list