Drag action structure
Natural language description of ending position
Natural language description of starting position
Action type
Drag action structure