[CUDA] CUDA Quantized Training (fixes #5606) #5933

shiyu1994 · 2023-06-16T03:06:48Z

Fixes #5606.

Adds quantized training for CUDA version.

…htGBM into quantized-training

fix msvc compilation errors and warnings

…upported

…quantized training

enlarge allowed package size to 100M

shiyu1994 · 2023-10-01T15:14:14Z

@guolinke This is ready. Please check.

shiyu1994 · 2023-10-01T15:17:37Z

src/treelearner/cuda/cuda_best_split_finder.cpp

@@ -40,6 +40,9 @@ CUDABestSplitFinder::CUDABestSplitFinder(
  select_features_by_node_(select_features_by_node),
  cuda_hist_(cuda_hist) {
  InitFeatureMetaInfo(train_data);
+  if (has_categorical_feature_ && config->use_quantized_grad) {


shiyu1994 · 2023-10-01T15:18:56Z

@jameslamb I've enlarged the size limitation for distributed package to 100M. Because we add a few more templates in the PR which add to the size of compiled file. Do you think it is OK?

jameslamb · 2023-10-01T16:54:57Z

I've enlarged the size limitation for distributed package to 100M. Because we add a few more templates in the PR which add to the size of compiled file. Do you think it is OK?

Thanks for the @.

For now, since we're not distributing these CUDA wheels on PyPI, I think it's ok. Let's not let it block this PR.

But if we pursue shipping a fat wheel in the future with CUDA support precompiled (like we talked about in Slack), 100MB will be a problem.

There are limits on PyPI for both individual file size and cumulative project size. I don't know the exact numbers but shipping 100MB wheels would put us in the range of hitting them, I think.

See these discussions:

There are also other concerns with such large wheels, e.g. for people using function-as-a-service things like AWS Lambda. See for example:

Suggestion: remove tests from the distribution pandas-dev/pandas#30741

I'll open a new issue in the next few days to discuss publishing wheels with CUDA support.

jameslamb · 2023-10-05T01:01:52Z

I removed the feature label from this and left efficiency. For release-drafter, I think it can only be one of the labels specified here, not multiple:

LightGBM/.github/release-drafter.yml

Lines 3 to 15 in f175ceb

    
           categories: 
        
             - title: '💡 New Features' 
        
               label: 'feature' 
        
             - title: '🔨 Breaking' 
        
               label: 'breaking' 
        
             - title: '🚀 Efficiency Improvement' 
        
               label: 'efficiency' 
        
             - title: '🐛 Bug Fixes' 
        
               label: 'fix' 
        
             - title: '📖 Documentation' 
        
               label: 'doc' 
        
             - title: '🧰 Maintenance' 
        
               label: 'maintenance'

guolinke

Thank you!

* add quantized training (first stage) * add histogram construction functions for integer gradients * add stochastic rounding * update docs * fix compilation errors by adding template instantiations * update files for compilation * fix compilation of gpu version * initialize gradient discretizer before share states * add a test case for quantized training * add quantized training for data distributed training * Delete origin.pred * Delete ifelse.pred * Delete LightGBM_model.txt * remove useless changes * fix lint error * remove debug loggings * fix mismatch of vector and allocator types * remove changes in main.cpp * fix bugs with uninitialized gradient discretizer * initialize ordered gradients in gradient discretizer * disable quantized training with gpu and cuda fix msvc compilation errors and warnings * fix bug in data parallel tree learner * make quantized training test deterministic * make quantized training in test case more accurate * refactor test_quantized_training * fix leaf splits initialization with quantized training * check distributed quantized training result * add cuda gradient discretizer * add quantized training for CUDA version in tree learner * remove cuda computability 6.1 and 6.2 * fix parts of gpu quantized training errors and warnings * fix build-python.sh to install locally built version * fix memory access bugs * fix lint errors * mark cuda quantized training on cuda with categorical features as unsupported * rename cuda_utils.h to cuda_utils.hu * enable quantized training with cuda * fix cuda quantized training with sparse row data * allow using global memory buffer in histogram construction with cuda quantized training * recover build-python.sh enlarge allowed package size to 100M

shiyu1994 added 30 commits December 22, 2022 02:09

add quantized training (first stage)

8187759

Merge remote-tracking branch 'origin/master' into quantized-training

8480873

add histogram construction functions for integer gradients

9e5d46b

add stochastic rounding

dd2a3b4

update docs

41c6c79

fix compilation errors by adding template instantiations

dfb5bc4

update files for compilation

d830128

fix compilation of gpu version

e82675b

initialize gradient discretizer before share states

1d68e97

Merge remote-tracking branch 'origin/master' into quantized-training

4ccdf34

add a test case for quantized training

27dbf8c

add quantized training for data distributed training

5c8aac1

Delete origin.pred

1fd115a

Delete ifelse.pred

197b394

Delete LightGBM_model.txt

7140bb8

remove useless changes

1f142d5

Merge remote-tracking branch 'origin/master' into quantized-training

22a98b7

Merge branch 'quantized-training' of https://github.com/Microsoft/Lig…

bc848a0

…htGBM into quantized-training

fix lint error

d5fc93d

remove debug loggings

ed066d0

fix mismatch of vector and allocator types

06826f0

remove changes in main.cpp

025ad39

fix bugs with uninitialized gradient discretizer

baef468

initialize ordered gradients in gradient discretizer

ce93015

disable quantized training with gpu and cuda

2b1118c

fix msvc compilation errors and warnings

fix bug in data parallel tree learner

487f2c4

make quantized training test deterministic

8c0e67b

make quantized training in test case more accurate

6a76fde

refactor test_quantized_training

0812403

fix leaf splits initialization with quantized training

9c8894b

shiyu1994 added 4 commits August 9, 2023 03:00

fix lint errors

cf12051

mark cuda quantized training on cuda with categorical features as uns…

043fbcb

…upported

rename cuda_utils.h to cuda_utils.hu

89357e5

enable quantized training with cuda

c7c5d57

jameslamb mentioned this pull request Sep 8, 2023

Add quantized training #5606

Closed

jameslamb changed the title ~~[CUDA] CUDA Quantized Training~~ [CUDA] CUDA Quantized Training (fixes #5606) Sep 8, 2023

shiyu1994 added 6 commits September 12, 2023 02:56

Merge branch 'master' into cuda-quantized-training

6e3a271

fix cuda quantized training with sparse row data

800a378

allow using global memory buffer in histogram construction with cuda …

6b687b0

…quantized training

Merge branch 'master' into cuda-quantized-training

bd5935d

Merge branch 'master' into cuda-quantized-training

811e729

recover build-python.sh

2cb1abb

enlarge allowed package size to 100M

shiyu1994 mentioned this pull request Oct 1, 2023

[CUDA] CUDA Quantized Training with Categorical Features #6119

Open

shiyu1994 commented Oct 1, 2023

View reviewed changes

Merge branch 'master' into cuda-quantized-training

b71fc86

jameslamb added awaiting review and removed in progress feature labels Oct 5, 2023

guolinke approved these changes Oct 7, 2023

View reviewed changes

shiyu1994 merged commit f901f47 into master Oct 8, 2023
41 checks passed

shiyu1994 deleted the cuda-quantized-training branch October 8, 2023 15:25

shiyu1994 mentioned this pull request Oct 9, 2023

[Question] Is Microsoft still supporting this project? #6128

Closed

jameslamb removed the awaiting review label Dec 10, 2023

This was referenced Apr 18, 2024

Use less memory when decreasing parameter max_bin #6319

Closed

device=cuda_exp is slower than device=cuda on lightgbm.cv #5693

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[CUDA] CUDA Quantized Training (fixes #5606) #5933

[CUDA] CUDA Quantized Training (fixes #5606) #5933

shiyu1994 commented Jun 16, 2023 •

edited by jameslamb

Loading

shiyu1994 commented Oct 1, 2023

shiyu1994 Oct 1, 2023

shiyu1994 commented Oct 1, 2023

jameslamb commented Oct 1, 2023

jameslamb commented Oct 5, 2023

guolinke left a comment

[CUDA] CUDA Quantized Training (fixes #5606) #5933

[CUDA] CUDA Quantized Training (fixes #5606) #5933

Conversation

shiyu1994 commented Jun 16, 2023 • edited by jameslamb Loading

shiyu1994 commented Oct 1, 2023

shiyu1994 Oct 1, 2023

Choose a reason for hiding this comment

shiyu1994 commented Oct 1, 2023

jameslamb commented Oct 1, 2023

jameslamb commented Oct 5, 2023

guolinke left a comment

Choose a reason for hiding this comment

shiyu1994 commented Jun 16, 2023 •

edited by jameslamb

Loading