Optimization Options
Compute Bound
- Use
tree_method=histfor faster and just as accurate models - Use GPUs (
tree_method=gpu_hist) - Consider using XGBoost-Ray
Memory Bound
- Use
max_cache_hist_nodeto limit CPU cache size - Use
QuantizeDMatrixto reduce intermediate memory size (only useful if entire memory can fit on the machine)
Large Data Size
Use memory mapping via the DataIterator API, see Using XGBoost External Memory Version for more info
- Use
grow_policy=depthwiseto iterate over data as efficiently as possible