Symptom
When a user runs a query command in the Spark-shell, the system displays a message indicating that the number of groups exceeds the upper limit. However, the same SQL statement can be queried in Spark-SQL and spark-beeline.
Solution
When a table is created, null characters are not converted to null characters. As a result, an array violation exception occurs when the spark-shell query field is empty. Add 'serialization.null.format' = 'or ROW FORMAT delimited fields terminated by' \t 'lines terminated by' \ n 'null defined as'' in the table creation statement to solve the problem.