I am totally confused about your question of parallel... by default, parallel worker processes is not 'on' in a default configuration. if parallel worker processes are available then the work is divided by a much equation.
Before using parallel mode, i'd be more interested in other join types.
N-ary is short-circuited nested loop.
You can avoid unneeded reads by having proper indexes and good statistics on the join columns.