Manipulating Spark data using both dplyr and SQL