A column family is used to write 30000 data records per second in 200 columns. The mutate is used to add data. The size of the clientbuffer cache is 10M. Four test machines, 128G memory, and 60G are allocated to the HBase. How to optimize the data?
The file can be written in the bulkload mode. The mr program is used to produce the hfile file. The hfile file is imported by using the bulkload file, which is very fast.
The file can be written in the bulkload mode. The mr program is used to produce the hfile file. The hfile file is imported by using the bulkload file, which is very fast.
Notice: To protect the legitimate rights and interests of you, the community, and third parties, do not release content that may bring legal risks to all parties, including but are not limited to the following:
Politically sensitive content
Content concerning pornography, gambling, and drug abuse
Content that may disclose or infringe upon others ' commercial secrets, intellectual properties, including trade marks, copyrights, and patents, and personal privacy
Do not share your account and password with others. All operations performed using your account will be regarded as your own actions and all consequences arising therefrom will be borne by you. For details, see " User Agreement."