I'm sorry, but I think we need to sort out the terms in Data E | L̶u̵m̶i̵n̷o̴u̶s̶m̶e̵n̵B̶l̵o̵g̵
I'm sorry, but I think we need to sort out the terms in Data Engineering - I'm starting to get confused myself. Let me give you some thinking on what I usually mean:
Pipeline - a sequence of stages
Stage/Job - includes several tasks
Task - a single atomic process to be executed (script, command, utility)
Workflow - automation technology of a business process
The difference between pipeline and workflow:
Pipeline a clearly described process in which tasks are executed sequentially, or processes executed in parallel.
Workflow is usually non-linear and has an often abstract human description, processes may not run in parallel. Workflow has branches and loops.
#big_data