2025-05-07T19:42:38.9725558Z Current runner version: '2.323.0' 2025-05-07T19:42:38.9731198Z Runner name: 'i-04b10210667d81210' 2025-05-07T19:42:38.9732166Z Machine name: 'ip-10-0-35-5' 2025-05-07T19:42:38.9734716Z ##[group]GITHUB_TOKEN Permissions 2025-05-07T19:42:38.9737154Z Contents: read 2025-05-07T19:42:38.9737702Z Metadata: read 2025-05-07T19:42:38.9738242Z Packages: read 2025-05-07T19:42:38.9738835Z ##[endgroup] 2025-05-07T19:42:38.9741342Z Secret source: None 2025-05-07T19:42:38.9742440Z Prepare workflow directory 2025-05-07T19:42:39.0352303Z Prepare all required actions 2025-05-07T19:42:39.0391168Z Getting action download info 2025-05-07T19:42:39.2340864Z Download action repository 'actions/checkout@v4' (SHA:11bd71901bbe5b1630ceea73d27597364c9af683) 2025-05-07T19:42:39.4901690Z Download action repository 'actions/upload-artifact@v4' (SHA:ea165f8d65b6e75b540449e92b4886f43607fa02) 2025-05-07T19:42:40.0412681Z Complete job name: build_artifact (x86, linux.24xlarge, default, 3.13, 12.6.3, clang) 2025-05-07T19:42:40.1320115Z A job started hook has been configured by the self-hosted runner administrator 2025-05-07T19:42:40.1450908Z ##[group]Run '/home/ec2-user/runner-scripts/before_job.sh' 2025-05-07T19:42:40.1461735Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:42:40.1463377Z ##[endgroup] 2025-05-07T19:42:41.3148276Z Runner Type: linux.24xlarge 2025-05-07T19:42:41.3148744Z Instance Type: c5.24xlarge 2025-05-07T19:42:41.3149089Z AMI Name: unknown 2025-05-07T19:42:41.3184709Z AMI ID: ami-071226ecf16aa7d96 2025-05-07T19:42:46.3458376Z ##[group]Checking docker version 2025-05-07T19:42:46.3472105Z ##[command]/usr/bin/docker version --format '{{.Server.APIVersion}}' 2025-05-07T19:42:46.3678644Z '1.44' 2025-05-07T19:42:46.3695308Z Docker daemon API version: '1.44' 2025-05-07T19:42:46.3695842Z ##[command]/usr/bin/docker version --format '{{.Client.APIVersion}}' 2025-05-07T19:42:46.3878718Z '1.44' 2025-05-07T19:42:46.3888951Z Docker client API version: '1.44' 2025-05-07T19:42:46.3893733Z ##[endgroup] 2025-05-07T19:42:46.3896297Z ##[group]Clean up resources from previous jobs 2025-05-07T19:42:46.3901010Z ##[command]/usr/bin/docker ps --all --quiet --no-trunc --filter "label=2c91e9" 2025-05-07T19:42:46.4036608Z ##[command]/usr/bin/docker network prune --force --filter "label=2c91e9" 2025-05-07T19:42:46.4167507Z ##[endgroup] 2025-05-07T19:42:46.4167944Z ##[group]Create local container network 2025-05-07T19:42:46.4180884Z ##[command]/usr/bin/docker network create --label 2c91e9 github_network_9484ef32e44d40e598a92c2c5c95b912 2025-05-07T19:42:46.6899428Z b13ad66fb64a9cd7cf6d82d8dc4f956656c4bab684d34c908a68692ed9071e05 2025-05-07T19:42:46.6922296Z ##[endgroup] 2025-05-07T19:42:46.6946288Z ##[group]Starting job container 2025-05-07T19:42:46.6965355Z ##[command]/usr/bin/docker pull amazonlinux:2023 2025-05-07T19:42:46.8209461Z 2023: Pulling from library/amazonlinux 2025-05-07T19:42:46.8316083Z Digest: sha256:cb5b4c509d62ae388f674c139ae5e8281fc160c217d474445e912043e1941988 2025-05-07T19:42:46.8316992Z Status: Image is up to date for amazonlinux:2023 2025-05-07T19:42:46.8339889Z docker.io/library/amazonlinux:2023 2025-05-07T19:42:46.8434762Z ##[command]/usr/bin/docker create --name 31897149cfcd45e99da7b09c84542214_amazonlinux2023_3f340f --label 2c91e9 --workdir /__w/FBGEMM/FBGEMM --network github_network_9484ef32e44d40e598a92c2c5c95b912 --user root -e "HOME=/github/home" -e GITHUB_ACTIONS=true -e CI=true -v "/var/run/docker.sock":"/var/run/docker.sock" -v "/home/ec2-user/actions-runner/_work":"/__w" -v "/home/ec2-user/actions-runner/externals":"/__e":ro -v "/home/ec2-user/actions-runner/_work/_temp":"/__w/_temp" -v "/home/ec2-user/actions-runner/_work/_actions":"/__w/_actions" -v "/home/ec2-user/actions-runner/_work/_tool":"/__w/_tool" -v "/home/ec2-user/actions-runner/_work/_temp/_github_home":"/github/home" -v "/home/ec2-user/actions-runner/_work/_temp/_github_workflow":"/github/workflow" --entrypoint "tail" amazonlinux:2023 "-f" "/dev/null" 2025-05-07T19:42:46.8856353Z 2b31f69c500b43ab9782f5664ba09da9843f0f5b972ff047034471321a834ace 2025-05-07T19:42:46.8884586Z ##[command]/usr/bin/docker start 2b31f69c500b43ab9782f5664ba09da9843f0f5b972ff047034471321a834ace 2025-05-07T19:42:47.3253696Z 2b31f69c500b43ab9782f5664ba09da9843f0f5b972ff047034471321a834ace 2025-05-07T19:42:47.3272259Z ##[command]/usr/bin/docker ps --all --filter id=2b31f69c500b43ab9782f5664ba09da9843f0f5b972ff047034471321a834ace --filter status=running --no-trunc --format "{{.ID}} {{.Status}}" 2025-05-07T19:42:47.3432222Z 2b31f69c500b43ab9782f5664ba09da9843f0f5b972ff047034471321a834ace Up Less than a second 2025-05-07T19:42:47.3450273Z ##[command]/usr/bin/docker inspect --format "{{range .Config.Env}}{{println .}}{{end}}" 2b31f69c500b43ab9782f5664ba09da9843f0f5b972ff047034471321a834ace 2025-05-07T19:42:47.3596148Z HOME=/github/home 2025-05-07T19:42:47.3597541Z GITHUB_ACTIONS=true 2025-05-07T19:42:47.3598040Z CI=true 2025-05-07T19:42:47.3598515Z PATH=/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin 2025-05-07T19:42:47.3617465Z ##[endgroup] 2025-05-07T19:42:47.3628691Z ##[group]Waiting for all services to be ready 2025-05-07T19:42:47.3630611Z ##[endgroup] 2025-05-07T19:42:47.3717953Z ##[group]Run yum update -y; yum install -y binutils findutils git pciutils sudo tar wget which 2025-05-07T19:42:47.3718916Z yum update -y; yum install -y binutils findutils git pciutils sudo tar wget which 2025-05-07T19:42:47.3719850Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:42:47.3720282Z env: 2025-05-07T19:42:47.3720599Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:42:47.3721028Z BUILD_ENV: build_binary 2025-05-07T19:42:47.3721339Z BUILD_TARGET: default 2025-05-07T19:42:47.3721693Z BUILD_VARIANT: cuda 2025-05-07T19:42:47.3722000Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:42:47.3722353Z ##[endgroup] 2025-05-07T19:42:48.2140092Z Amazon Linux 2023 repository 67 MB/s | 37 MB 00:00 2025-05-07T19:42:54.8085481Z Last metadata expiration check: 0:00:06 ago on Wed May 7 19:42:48 2025. 2025-05-07T19:42:55.3637107Z Dependencies resolved. 2025-05-07T19:42:55.3812830Z Nothing to do. 2025-05-07T19:42:55.3814169Z Complete! 2025-05-07T19:42:55.6127680Z Last metadata expiration check: 0:00:07 ago on Wed May 7 19:42:48 2025. 2025-05-07T19:42:55.6769399Z Dependencies resolved. 2025-05-07T19:42:55.6996199Z ======================================================================================== 2025-05-07T19:42:55.6998084Z Package Arch Version Repository Size 2025-05-07T19:42:55.7000127Z ======================================================================================== 2025-05-07T19:42:55.7001488Z Installing: 2025-05-07T19:42:55.7002806Z binutils x86_64 2.41-50.amzn2023.0.3 amazonlinux 5.3 M 2025-05-07T19:42:55.7003916Z findutils x86_64 1:4.8.0-2.amzn2023.0.2 amazonlinux 539 k 2025-05-07T19:42:55.7004475Z git x86_64 2.47.1-1.amzn2023.0.2 amazonlinux 54 k 2025-05-07T19:42:55.7005129Z pciutils x86_64 3.7.0-3.amzn2023.0.2 amazonlinux 93 k 2025-05-07T19:42:55.7005776Z sudo x86_64 1.9.15-1.p5.amzn2023.0.1 amazonlinux 1.3 M 2025-05-07T19:42:55.7006325Z tar x86_64 2:1.34-1.amzn2023.0.4 amazonlinux 879 k 2025-05-07T19:42:55.7006887Z wget x86_64 1.21.3-1.amzn2023.0.4 amazonlinux 779 k 2025-05-07T19:42:55.7007380Z which x86_64 2.21-26.amzn2023.0.2 amazonlinux 42 k 2025-05-07T19:42:55.7007877Z Installing dependencies: 2025-05-07T19:42:55.7008314Z cracklib x86_64 2.9.6-27.amzn2023.0.2 amazonlinux 82 k 2025-05-07T19:42:55.7008912Z cyrus-sasl-lib x86_64 2.1.27-18.amzn2023.0.3 amazonlinux 786 k 2025-05-07T19:42:55.7009516Z elfutils-debuginfod-client x86_64 0.188-3.amzn2023.0.2 amazonlinux 41 k 2025-05-07T19:42:55.7010212Z git-core x86_64 2.47.1-1.amzn2023.0.2 amazonlinux 4.7 M 2025-05-07T19:42:55.7011078Z git-core-doc noarch 2.47.1-1.amzn2023.0.2 amazonlinux 2.8 M 2025-05-07T19:42:55.7011606Z gnutls x86_64 3.8.3-6.amzn2023.0.1 amazonlinux 1.1 M 2025-05-07T19:42:55.7012227Z groff-base x86_64 1.22.4-7.amzn2023.0.2 amazonlinux 1.0 M 2025-05-07T19:42:55.7012740Z gzip x86_64 1.12-1.amzn2023.0.1 amazonlinux 160 k 2025-05-07T19:42:55.7013292Z hwdata noarch 0.384-1.amzn2023.0.3 amazonlinux 1.6 M 2025-05-07T19:42:55.7013958Z jansson x86_64 2.14-0.amzn2023 amazonlinux 46 k 2025-05-07T19:42:55.7014489Z kmod-libs x86_64 29-2.amzn2023.0.5 amazonlinux 62 k 2025-05-07T19:42:55.7015043Z less x86_64 608-2.amzn2023.0.2 amazonlinux 168 k 2025-05-07T19:42:55.7015739Z libcbor x86_64 0.7.0-3.amzn2023.0.2 amazonlinux 57 k 2025-05-07T19:42:55.7016323Z libdb x86_64 5.3.28-49.amzn2023.0.2 amazonlinux 756 k 2025-05-07T19:42:55.7017209Z libeconf x86_64 0.4.0-1.amzn2023.0.3 amazonlinux 28 k 2025-05-07T19:42:55.7017849Z libedit x86_64 3.1-38.20210714cvs.amzn2023.0.2 amazonlinux 108 k 2025-05-07T19:42:55.7018472Z libfdisk x86_64 2.37.4-1.amzn2023.0.4 amazonlinux 153 k 2025-05-07T19:42:55.7019036Z libfido2 x86_64 1.10.0-2.amzn2023.0.2 amazonlinux 95 k 2025-05-07T19:42:55.7019706Z libmetalink x86_64 0.1.3-14.amzn2023.0.2 amazonlinux 31 k 2025-05-07T19:42:55.7020357Z libpwquality x86_64 1.4.4-6.amzn2023.0.2 amazonlinux 106 k 2025-05-07T19:42:55.7021069Z libsemanage x86_64 3.4-5.amzn2023.0.2 amazonlinux 121 k 2025-05-07T19:42:55.7021811Z libutempter x86_64 1.2.1-4.amzn2023.0.2 amazonlinux 26 k 2025-05-07T19:42:55.7022444Z nano x86_64 8.3-1.amzn2023 amazonlinux 706 k 2025-05-07T19:42:55.7022995Z ncurses x86_64 6.2-4.20200222.amzn2023.0.6 amazonlinux 394 k 2025-05-07T19:42:55.7023639Z nettle x86_64 3.10.1-1.amzn2023.0.1 amazonlinux 573 k 2025-05-07T19:42:55.7024197Z openldap x86_64 2.4.57-6.amzn2023.0.7 amazonlinux 256 k 2025-05-07T19:42:55.7024817Z openssh x86_64 8.7p1-8.amzn2023.0.14 amazonlinux 454 k 2025-05-07T19:42:55.7025503Z openssh-clients x86_64 8.7p1-8.amzn2023.0.14 amazonlinux 708 k 2025-05-07T19:42:55.7026090Z pam x86_64 1.5.1-8.amzn2023.0.4 amazonlinux 542 k 2025-05-07T19:42:55.7026708Z pciutils-libs x86_64 3.7.0-3.amzn2023.0.2 amazonlinux 41 k 2025-05-07T19:42:55.7154256Z perl-AutoLoader noarch 5.74-477.amzn2023.0.6 amazonlinux 22 k 2025-05-07T19:42:55.7154939Z perl-B x86_64 1.80-477.amzn2023.0.6 amazonlinux 179 k 2025-05-07T19:42:55.7155471Z perl-Carp noarch 1.50-458.amzn2023.0.2 amazonlinux 29 k 2025-05-07T19:42:55.7156032Z perl-Class-Struct noarch 0.66-477.amzn2023.0.6 amazonlinux 22 k 2025-05-07T19:42:55.7156646Z perl-Data-Dumper x86_64 2.174-460.amzn2023.0.2 amazonlinux 55 k 2025-05-07T19:42:55.7157282Z perl-Digest noarch 1.20-1.amzn2023.0.2 amazonlinux 26 k 2025-05-07T19:42:55.7157907Z perl-Digest-MD5 x86_64 2.58-2.amzn2023.0.2 amazonlinux 36 k 2025-05-07T19:42:55.7158568Z perl-DynaLoader x86_64 1.47-477.amzn2023.0.6 amazonlinux 26 k 2025-05-07T19:42:55.7159390Z perl-Encode x86_64 4:3.15-462.amzn2023.0.2 amazonlinux 1.7 M 2025-05-07T19:42:55.7159929Z perl-Errno x86_64 1.30-477.amzn2023.0.6 amazonlinux 15 k 2025-05-07T19:42:55.7160551Z perl-Error noarch 1:0.17029-5.amzn2023.0.2 amazonlinux 41 k 2025-05-07T19:42:55.7161080Z perl-Exporter noarch 5.74-459.amzn2023.0.2 amazonlinux 31 k 2025-05-07T19:42:55.7161597Z perl-Fcntl x86_64 1.13-477.amzn2023.0.6 amazonlinux 21 k 2025-05-07T19:42:55.7162175Z perl-File-Basename noarch 2.85-477.amzn2023.0.6 amazonlinux 18 k 2025-05-07T19:42:55.7162763Z perl-File-Find noarch 1.37-477.amzn2023.0.6 amazonlinux 26 k 2025-05-07T19:42:55.7163312Z perl-File-Path noarch 2.18-2.amzn2023.0.2 amazonlinux 36 k 2025-05-07T19:42:55.7164133Z perl-File-Temp noarch 1:0.231.100-2.amzn2023.0.2 amazonlinux 60 k 2025-05-07T19:42:55.7164826Z perl-File-stat noarch 1.09-477.amzn2023.0.6 amazonlinux 17 k 2025-05-07T19:42:55.7165434Z perl-FileHandle noarch 2.03-477.amzn2023.0.6 amazonlinux 16 k 2025-05-07T19:42:55.7166051Z perl-Getopt-Long noarch 1:2.52-2.amzn2023.0.2 amazonlinux 60 k 2025-05-07T19:42:55.7166643Z perl-Getopt-Std noarch 1.12-477.amzn2023.0.6 amazonlinux 16 k 2025-05-07T19:42:55.7167230Z perl-Git noarch 2.47.1-1.amzn2023.0.2 amazonlinux 42 k 2025-05-07T19:42:55.7167789Z perl-HTTP-Tiny noarch 0.078-1.amzn2023.0.3 amazonlinux 56 k 2025-05-07T19:42:55.7168533Z perl-IO x86_64 1.43-477.amzn2023.0.6 amazonlinux 87 k 2025-05-07T19:42:55.7169086Z perl-IPC-Open3 noarch 1.21-477.amzn2023.0.6 amazonlinux 23 k 2025-05-07T19:42:55.7169705Z perl-MIME-Base64 x86_64 3.16-2.amzn2023.0.2 amazonlinux 31 k 2025-05-07T19:42:55.7170535Z perl-Net-SSLeay x86_64 1.94-1.amzn2023.0.1 amazonlinux 392 k 2025-05-07T19:42:55.7171142Z perl-POSIX x86_64 1.94-477.amzn2023.0.6 amazonlinux 97 k 2025-05-07T19:42:55.7171760Z perl-PathTools x86_64 3.78-459.amzn2023.0.2 amazonlinux 85 k 2025-05-07T19:42:55.7172355Z perl-Pod-Escapes noarch 1:1.07-458.amzn2023.0.2 amazonlinux 20 k 2025-05-07T19:42:55.7172988Z perl-Pod-Perldoc noarch 3.28.01-459.amzn2023.0.3 amazonlinux 84 k 2025-05-07T19:42:55.7173610Z perl-Pod-Simple noarch 1:3.42-2.amzn2023.0.2 amazonlinux 215 k 2025-05-07T19:42:55.7174193Z perl-Pod-Usage noarch 4:2.01-2.amzn2023.0.2 amazonlinux 41 k 2025-05-07T19:42:55.7174834Z perl-Scalar-List-Utils x86_64 4:1.56-459.amzn2023.0.2 amazonlinux 71 k 2025-05-07T19:42:55.7175459Z perl-SelectSaver noarch 1.02-477.amzn2023.0.6 amazonlinux 12 k 2025-05-07T19:42:55.7176065Z perl-Socket x86_64 4:2.032-1.amzn2023.0.2 amazonlinux 55 k 2025-05-07T19:42:55.7176768Z perl-Storable x86_64 1:3.21-458.amzn2023.0.2 amazonlinux 96 k 2025-05-07T19:42:55.7177335Z perl-Symbol noarch 1.08-477.amzn2023.0.6 amazonlinux 15 k 2025-05-07T19:42:55.7177967Z perl-Term-ANSIColor noarch 5.01-459.amzn2023.0.2 amazonlinux 48 k 2025-05-07T19:42:55.7178570Z perl-Term-Cap noarch 1.17-458.amzn2023.0.2 amazonlinux 22 k 2025-05-07T19:42:55.7179176Z perl-TermReadKey x86_64 2.38-9.amzn2023.0.2 amazonlinux 36 k 2025-05-07T19:42:55.7179792Z perl-Text-ParseWords noarch 3.30-458.amzn2023.0.2 amazonlinux 17 k 2025-05-07T19:42:55.7180445Z perl-Text-Tabs+Wrap noarch 2021.0726-1.amzn2023.0.1 amazonlinux 22 k 2025-05-07T19:42:55.7181215Z perl-Time-Local noarch 2:1.300-5.amzn2023.0.2 amazonlinux 34 k 2025-05-07T19:42:55.7181784Z perl-URI noarch 5.09-1.amzn2023.0.2 amazonlinux 108 k 2025-05-07T19:42:55.7182355Z perl-base noarch 2.27-477.amzn2023.0.6 amazonlinux 17 k 2025-05-07T19:42:55.7182920Z perl-constant noarch 1.33-459.amzn2023.0.2 amazonlinux 23 k 2025-05-07T19:42:55.7183511Z perl-if noarch 0.60.800-477.amzn2023.0.6 amazonlinux 14 k 2025-05-07T19:42:55.7184110Z perl-interpreter x86_64 4:5.32.1-477.amzn2023.0.6 amazonlinux 71 k 2025-05-07T19:42:55.7184664Z perl-lib x86_64 0.65-477.amzn2023.0.6 amazonlinux 15 k 2025-05-07T19:42:55.7185217Z perl-libnet noarch 3.13-2.amzn2023.0.2 amazonlinux 126 k 2025-05-07T19:42:55.7185754Z perl-libs x86_64 4:5.32.1-477.amzn2023.0.6 amazonlinux 2.0 M 2025-05-07T19:42:55.7186375Z perl-mro x86_64 1.23-477.amzn2023.0.6 amazonlinux 29 k 2025-05-07T19:42:55.7186937Z perl-overload noarch 1.31-477.amzn2023.0.6 amazonlinux 46 k 2025-05-07T19:42:55.7187556Z perl-overloading noarch 0.02-477.amzn2023.0.6 amazonlinux 13 k 2025-05-07T19:42:55.7188173Z perl-parent noarch 1:0.238-458.amzn2023.0.2 amazonlinux 14 k 2025-05-07T19:42:55.7188779Z perl-podlators noarch 1:4.14-458.amzn2023.0.2 amazonlinux 112 k 2025-05-07T19:42:55.7189449Z perl-subs noarch 1.03-477.amzn2023.0.6 amazonlinux 12 k 2025-05-07T19:42:55.7189994Z perl-vars noarch 1.05-477.amzn2023.0.6 amazonlinux 13 k 2025-05-07T19:42:55.7190525Z shadow-utils x86_64 2:4.9-12.amzn2023.0.4 amazonlinux 1.1 M 2025-05-07T19:42:55.7191206Z systemd-libs x86_64 252.23-3.amzn2023 amazonlinux 613 k 2025-05-07T19:42:55.7191708Z util-linux x86_64 2.37.4-1.amzn2023.0.4 amazonlinux 2.2 M 2025-05-07T19:42:55.7192244Z util-linux-core x86_64 2.37.4-1.amzn2023.0.4 amazonlinux 432 k 2025-05-07T19:42:55.7192685Z Installing weak dependencies: 2025-05-07T19:42:55.7193113Z nano-default-editor noarch 8.3-1.amzn2023 amazonlinux 10 k 2025-05-07T19:42:55.7193704Z perl-IO-Socket-IP noarch 0.41-3.amzn2023.0.2 amazonlinux 42 k 2025-05-07T19:42:55.7194258Z perl-IO-Socket-SSL noarch 2.075-1.amzn2023.0.2 amazonlinux 218 k 2025-05-07T19:42:55.7194831Z perl-Mozilla-CA noarch 20200520-4.amzn2023.0.2 amazonlinux 13 k 2025-05-07T19:42:55.7195379Z perl-NDBM_File x86_64 1.15-477.amzn2023.0.6 amazonlinux 23 k 2025-05-07T19:42:55.7195919Z sudo-python-plugin x86_64 1.9.15-1.p5.amzn2023.0.1 amazonlinux 56 k 2025-05-07T19:42:55.7196248Z 2025-05-07T19:42:55.7196374Z Transaction Summary 2025-05-07T19:42:55.7196647Z ======================================================================================== 2025-05-07T19:42:55.7196987Z Install 107 Packages 2025-05-07T19:42:55.7197136Z 2025-05-07T19:42:55.7197278Z Total download size: 38 M 2025-05-07T19:42:55.7197559Z Installed size: 151 M 2025-05-07T19:42:55.7197839Z Downloading Packages: 2025-05-07T19:42:56.0063186Z (1/107): cracklib-2.9.6-27.amzn2023.0.2.x86_64. 3.9 MB/s | 82 kB 00:00 2025-05-07T19:42:56.0233405Z (2/107): cyrus-sasl-lib-2.1.27-18.amzn2023.0.3. 20 MB/s | 786 kB 00:00 2025-05-07T19:42:56.0457042Z (3/107): binutils-2.41-50.amzn2023.0.3.x86_64.r 87 MB/s | 5.3 MB 00:00 2025-05-07T19:42:56.0465205Z (4/107): elfutils-debuginfod-client-0.188-3.amz 1.2 MB/s | 41 kB 00:00 2025-05-07T19:42:56.0526425Z (5/107): findutils-4.8.0-2.amzn2023.0.2.x86_64. 19 MB/s | 539 kB 00:00 2025-05-07T19:42:56.0545799Z (6/107): git-2.47.1-1.amzn2023.0.2.x86_64.rpm 8.5 MB/s | 54 kB 00:00 2025-05-07T19:42:56.0744326Z (7/107): gnutls-3.8.3-6.amzn2023.0.1.x86_64.rpm 56 MB/s | 1.1 MB 00:00 2025-05-07T19:42:56.0900775Z (8/107): git-core-doc-2.47.1-1.amzn2023.0.2.noa 75 MB/s | 2.8 MB 00:00 2025-05-07T19:42:56.1004635Z (9/107): groff-base-1.22.4-7.amzn2023.0.2.x86_6 48 MB/s | 1.0 MB 00:00 2025-05-07T19:42:56.1198231Z (10/107): git-core-2.47.1-1.amzn2023.0.2.x86_64 66 MB/s | 4.7 MB 00:00 2025-05-07T19:42:56.1252643Z (11/107): gzip-1.12-1.amzn2023.0.1.x86_64.rpm 5.2 MB/s | 160 kB 00:00 2025-05-07T19:42:56.1333458Z (12/107): hwdata-0.384-1.amzn2023.0.3.noarch.rp 50 MB/s | 1.6 MB 00:00 2025-05-07T19:42:56.1352278Z (13/107): jansson-2.14-0.amzn2023.x86_64.rpm 3.5 MB/s | 46 kB 00:00 2025-05-07T19:42:56.1366428Z (14/107): kmod-libs-29-2.amzn2023.0.5.x86_64.rp 5.4 MB/s | 62 kB 00:00 2025-05-07T19:42:56.1401934Z (15/107): less-608-2.amzn2023.0.2.x86_64.rpm 26 MB/s | 168 kB 00:00 2025-05-07T19:42:56.1426991Z (16/107): libcbor-0.7.0-3.amzn2023.0.2.x86_64.r 9.2 MB/s | 57 kB 00:00 2025-05-07T19:42:56.1467467Z (17/107): libeconf-0.4.0-1.amzn2023.0.3.x86_64. 4.3 MB/s | 28 kB 00:00 2025-05-07T19:42:56.1520186Z (18/107): libdb-5.3.28-49.amzn2023.0.2.x86_64.r 49 MB/s | 756 kB 00:00 2025-05-07T19:42:56.1548002Z (19/107): libedit-3.1-38.20210714cvs.amzn2023.0 9.6 MB/s | 108 kB 00:00 2025-05-07T19:42:56.1571901Z (20/107): libfdisk-2.37.4-1.amzn2023.0.4.x86_64 15 MB/s | 153 kB 00:00 2025-05-07T19:42:56.1597332Z (21/107): libfido2-1.10.0-2.amzn2023.0.2.x86_64 14 MB/s | 95 kB 00:00 2025-05-07T19:42:56.1643944Z (22/107): libmetalink-0.1.3-14.amzn2023.0.2.x86 4.9 MB/s | 31 kB 00:00 2025-05-07T19:42:56.1673060Z (23/107): libpwquality-1.4.4-6.amzn2023.0.2.x86 11 MB/s | 106 kB 00:00 2025-05-07T19:42:56.1696261Z (24/107): libsemanage-3.4-5.amzn2023.0.2.x86_64 12 MB/s | 121 kB 00:00 2025-05-07T19:42:56.1714400Z (25/107): libutempter-1.2.1-4.amzn2023.0.2.x86_ 4.5 MB/s | 26 kB 00:00 2025-05-07T19:42:56.1861818Z (26/107): ncurses-6.2-4.20200222.amzn2023.0.6.x 26 MB/s | 394 kB 00:00 2025-05-07T19:42:56.1877739Z (27/107): nano-default-editor-8.3-1.amzn2023.no 574 kB/s | 10 kB 00:00 2025-05-07T19:42:56.1946749Z (28/107): nano-8.3-1.amzn2023.x86_64.rpm 28 MB/s | 706 kB 00:00 2025-05-07T19:42:56.1988608Z (29/107): openldap-2.4.57-6.amzn2023.0.7.x86_64 23 MB/s | 256 kB 00:00 2025-05-07T19:42:56.2035089Z (30/107): nettle-3.10.1-1.amzn2023.0.1.x86_64.r 34 MB/s | 573 kB 00:00 2025-05-07T19:42:56.2075940Z (31/107): openssh-8.7p1-8.amzn2023.0.14.x86_64. 38 MB/s | 454 kB 00:00 2025-05-07T19:42:56.2135461Z (32/107): openssh-clients-8.7p1-8.amzn2023.0.14 50 MB/s | 708 kB 00:00 2025-05-07T19:42:56.2181087Z (33/107): pam-1.5.1-8.amzn2023.0.4.x86_64.rpm 38 MB/s | 542 kB 00:00 2025-05-07T19:42:56.2200456Z (34/107): pciutils-3.7.0-3.amzn2023.0.2.x86_64. 8.3 MB/s | 93 kB 00:00 2025-05-07T19:42:56.2225233Z (35/107): pciutils-libs-3.7.0-3.amzn2023.0.2.x8 5.2 MB/s | 41 kB 00:00 2025-05-07T19:42:56.2346816Z (36/107): perl-Carp-1.50-458.amzn2023.0.2.noarc 2.5 MB/s | 29 kB 00:00 2025-05-07T19:42:56.2368935Z (37/107): perl-AutoLoader-5.74-477.amzn2023.0.6 1.3 MB/s | 22 kB 00:00 2025-05-07T19:42:56.2399249Z (38/107): perl-B-1.80-477.amzn2023.0.6.x86_64.r 9.3 MB/s | 179 kB 00:00 2025-05-07T19:42:56.2418294Z (39/107): perl-Class-Struct-0.66-477.amzn2023.0 3.4 MB/s | 22 kB 00:00 2025-05-07T19:42:56.2443689Z (40/107): perl-Data-Dumper-2.174-460.amzn2023.0 7.9 MB/s | 55 kB 00:00 2025-05-07T19:42:56.2463312Z (41/107): perl-Digest-1.20-1.amzn2023.0.2.noarc 4.2 MB/s | 26 kB 00:00 2025-05-07T19:42:56.2489262Z (42/107): perl-Digest-MD5-2.58-2.amzn2023.0.2.x 5.6 MB/s | 36 kB 00:00 2025-05-07T19:42:56.2506618Z (43/107): perl-DynaLoader-1.47-477.amzn2023.0.6 4.3 MB/s | 26 kB 00:00 2025-05-07T19:42:56.2691135Z (44/107): perl-Encode-3.15-462.amzn2023.0.2.x86 77 MB/s | 1.7 MB 00:00 2025-05-07T19:42:56.2698222Z (45/107): perl-Errno-1.30-477.amzn2023.0.6.x86_ 730 kB/s | 15 kB 00:00 2025-05-07T19:42:56.2722978Z (46/107): perl-Error-0.17029-5.amzn2023.0.2.noa 2.2 MB/s | 41 kB 00:00 2025-05-07T19:42:56.2750074Z (47/107): perl-Fcntl-1.13-477.amzn2023.0.6.x86_ 4.6 MB/s | 21 kB 00:00 2025-05-07T19:42:56.2779330Z (48/107): perl-Exporter-5.74-459.amzn2023.0.2.n 4.2 MB/s | 31 kB 00:00 2025-05-07T19:42:56.2793487Z (49/107): perl-File-Basename-2.85-477.amzn2023. 2.6 MB/s | 18 kB 00:00 2025-05-07T19:42:56.2815584Z (50/107): perl-File-Find-1.37-477.amzn2023.0.6. 4.7 MB/s | 26 kB 00:00 2025-05-07T19:42:56.2855525Z (51/107): perl-File-Path-2.18-2.amzn2023.0.2.no 6.4 MB/s | 36 kB 00:00 2025-05-07T19:42:56.2867609Z (52/107): perl-File-Temp-0.231.100-2.amzn2023.0 8.5 MB/s | 60 kB 00:00 2025-05-07T19:42:56.2890235Z (53/107): perl-File-stat-1.09-477.amzn2023.0.6. 2.5 MB/s | 17 kB 00:00 2025-05-07T19:42:56.2920073Z (54/107): perl-FileHandle-2.03-477.amzn2023.0.6 3.5 MB/s | 16 kB 00:00 2025-05-07T19:42:56.2941449Z (55/107): perl-Getopt-Long-2.52-2.amzn2023.0.2. 8.7 MB/s | 60 kB 00:00 2025-05-07T19:42:56.2958341Z (56/107): perl-Getopt-Std-1.12-477.amzn2023.0.6 2.4 MB/s | 16 kB 00:00 2025-05-07T19:42:56.2977137Z (57/107): perl-Git-2.47.1-1.amzn2023.0.2.noarch 7.8 MB/s | 42 kB 00:00 2025-05-07T19:42:56.3000536Z (58/107): perl-HTTP-Tiny-0.078-1.amzn2023.0.3.n 10 MB/s | 56 kB 00:00 2025-05-07T19:42:56.3020411Z (59/107): perl-IO-1.43-477.amzn2023.0.6.x86_64. 14 MB/s | 87 kB 00:00 2025-05-07T19:42:56.3043219Z (60/107): perl-IO-Socket-IP-0.41-3.amzn2023.0.2 7.2 MB/s | 42 kB 00:00 2025-05-07T19:42:56.3075620Z (61/107): perl-IO-Socket-SSL-2.075-1.amzn2023.0 30 MB/s | 218 kB 00:00 2025-05-07T19:42:56.3098020Z (62/107): perl-IPC-Open3-1.21-477.amzn2023.0.6. 3.2 MB/s | 23 kB 00:00 2025-05-07T19:42:56.3116720Z (63/107): perl-MIME-Base64-3.16-2.amzn2023.0.2. 4.6 MB/s | 31 kB 00:00 2025-05-07T19:42:56.3136133Z (64/107): perl-Mozilla-CA-20200520-4.amzn2023.0 2.4 MB/s | 13 kB 00:00 2025-05-07T19:42:56.3154872Z (65/107): perl-NDBM_File-1.15-477.amzn2023.0.6. 4.3 MB/s | 23 kB 00:00 2025-05-07T19:42:56.3202095Z (66/107): perl-Net-SSLeay-1.94-1.amzn2023.0.1.x 48 MB/s | 392 kB 00:00 2025-05-07T19:42:56.3226178Z (67/107): perl-POSIX-1.94-477.amzn2023.0.6.x86_ 11 MB/s | 97 kB 00:00 2025-05-07T19:42:56.3254031Z (68/107): perl-PathTools-3.78-459.amzn2023.0.2. 9.6 MB/s | 85 kB 00:00 2025-05-07T19:42:56.3272643Z (69/107): perl-Pod-Escapes-1.07-458.amzn2023.0. 2.8 MB/s | 20 kB 00:00 2025-05-07T19:42:56.3299591Z (70/107): perl-Pod-Perldoc-3.28.01-459.amzn2023 13 MB/s | 84 kB 00:00 2025-05-07T19:42:56.3340323Z (71/107): perl-Pod-Simple-3.42-2.amzn2023.0.2.n 26 MB/s | 215 kB 00:00 2025-05-07T19:42:56.3360172Z (72/107): perl-Pod-Usage-2.01-2.amzn2023.0.2.no 5.1 MB/s | 41 kB 00:00 2025-05-07T19:42:56.3384474Z (73/107): perl-Scalar-List-Utils-1.56-459.amzn2 9.7 MB/s | 71 kB 00:00 2025-05-07T19:42:56.3421319Z (74/107): perl-SelectSaver-1.02-477.amzn2023.0. 2.2 MB/s | 12 kB 00:00 2025-05-07T19:42:56.3448100Z (75/107): perl-Socket-2.032-1.amzn2023.0.2.x86_ 7.0 MB/s | 55 kB 00:00 2025-05-07T19:42:56.3468698Z (76/107): perl-Storable-3.21-458.amzn2023.0.2.x 12 MB/s | 96 kB 00:00 2025-05-07T19:42:56.3493247Z (77/107): perl-Symbol-1.08-477.amzn2023.0.6.noa 2.3 MB/s | 15 kB 00:00 2025-05-07T19:42:56.3516164Z (78/107): perl-Term-ANSIColor-5.01-459.amzn2023 7.7 MB/s | 48 kB 00:00 2025-05-07T19:42:56.3536254Z (79/107): perl-Term-Cap-1.17-458.amzn2023.0.2.n 3.9 MB/s | 22 kB 00:00 2025-05-07T19:42:56.3548420Z (80/107): perl-TermReadKey-2.38-9.amzn2023.0.2. 7.0 MB/s | 36 kB 00:00 2025-05-07T19:42:56.3571272Z (81/107): perl-Text-ParseWords-3.30-458.amzn202 3.3 MB/s | 17 kB 00:00 2025-05-07T19:42:56.3595598Z (82/107): perl-Time-Local-1.300-5.amzn2023.0.2. 8.0 MB/s | 34 kB 00:00 2025-05-07T19:42:56.3613753Z (83/107): perl-Text-Tabs+Wrap-2021.0726-1.amzn2 3.0 MB/s | 22 kB 00:00 2025-05-07T19:42:56.3691767Z (84/107): perl-URI-5.09-1.amzn2023.0.2.noarch.r 9.1 MB/s | 108 kB 00:00 2025-05-07T19:42:56.3708417Z (85/107): perl-base-2.27-477.amzn2023.0.6.noarc 1.5 MB/s | 17 kB 00:00 2025-05-07T19:42:56.3725501Z (86/107): perl-constant-1.33-459.amzn2023.0.2.n 2.1 MB/s | 23 kB 00:00 2025-05-07T19:42:56.3746570Z (87/107): perl-if-0.60.800-477.amzn2023.0.6.noa 2.8 MB/s | 14 kB 00:00 2025-05-07T19:42:56.3781563Z (88/107): perl-interpreter-5.32.1-477.amzn2023. 14 MB/s | 71 kB 00:00 2025-05-07T19:42:56.3804534Z (89/107): perl-lib-0.65-477.amzn2023.0.6.x86_64 2.2 MB/s | 15 kB 00:00 2025-05-07T19:42:56.3831579Z (90/107): perl-libnet-3.13-2.amzn2023.0.2.noarc 16 MB/s | 126 kB 00:00 2025-05-07T19:42:56.3973412Z (91/107): perl-libs-5.32.1-477.amzn2023.0.6.x86 111 MB/s | 2.0 MB 00:00 2025-05-07T19:42:56.3986399Z (92/107): perl-mro-1.23-477.amzn2023.0.6.x86_64 1.6 MB/s | 29 kB 00:00 2025-05-07T19:42:56.4004220Z (93/107): perl-overload-1.31-477.amzn2023.0.6.n 2.7 MB/s | 46 kB 00:00 2025-05-07T19:42:56.4024945Z (94/107): perl-overloading-0.02-477.amzn2023.0. 2.7 MB/s | 13 kB 00:00 2025-05-07T19:42:56.4059561Z (95/107): perl-parent-0.238-458.amzn2023.0.2.no 3.0 MB/s | 14 kB 00:00 2025-05-07T19:42:56.4083589Z (96/107): perl-podlators-4.14-458.amzn2023.0.2. 15 MB/s | 112 kB 00:00 2025-05-07T19:42:56.4098193Z (97/107): perl-subs-1.03-477.amzn2023.0.6.noarc 1.7 MB/s | 12 kB 00:00 2025-05-07T19:42:56.4117162Z (98/107): perl-vars-1.05-477.amzn2023.0.6.noarc 2.6 MB/s | 13 kB 00:00 2025-05-07T19:42:56.4220363Z (99/107): shadow-utils-4.9-12.amzn2023.0.4.x86_ 93 MB/s | 1.1 MB 00:00 2025-05-07T19:42:56.4309640Z (100/107): sudo-1.9.15-1.p5.amzn2023.0.1.x86_64 62 MB/s | 1.3 MB 00:00 2025-05-07T19:42:56.4323829Z (101/107): sudo-python-plugin-1.9.15-1.p5.amzn2 2.7 MB/s | 56 kB 00:00 2025-05-07T19:42:56.4377819Z (102/107): systemd-libs-252.23-3.amzn2023.x86_6 44 MB/s | 613 kB 00:00 2025-05-07T19:42:56.4444957Z (103/107): tar-1.34-1.amzn2023.0.4.x86_64.rpm 84 MB/s | 879 kB 00:00 2025-05-07T19:42:56.4528441Z (104/107): util-linux-core-2.37.4-1.amzn2023.0. 32 MB/s | 432 kB 00:00 2025-05-07T19:42:56.4634323Z (105/107): util-linux-2.37.4-1.amzn2023.0.4.x86 74 MB/s | 2.2 MB 00:00 2025-05-07T19:42:56.4692658Z (106/107): wget-1.21.3-1.amzn2023.0.4.x86_64.rp 37 MB/s | 779 kB 00:00 2025-05-07T19:42:56.4704748Z (107/107): which-2.21-26.amzn2023.0.2.x86_64.rp 2.4 MB/s | 42 kB 00:00 2025-05-07T19:42:56.4728264Z -------------------------------------------------------------------------------- 2025-05-07T19:42:56.4728801Z Total 49 MB/s | 38 MB 00:00 2025-05-07T19:42:57.5358787Z Running transaction check 2025-05-07T19:42:57.5829038Z Transaction check succeeded. 2025-05-07T19:42:57.5829925Z Running transaction test 2025-05-07T19:42:57.9535609Z Transaction test succeeded. 2025-05-07T19:42:57.9536657Z Running transaction 2025-05-07T19:42:59.0640784Z Preparing : 1/1 2025-05-07T19:42:59.0806087Z Installing : systemd-libs-252.23-3.amzn2023.x86_64 1/107 2025-05-07T19:42:59.1060028Z Installing : nettle-3.10.1-1.amzn2023.0.1.x86_64 2/107 2025-05-07T19:42:59.1285252Z Installing : gnutls-3.8.3-6.amzn2023.0.1.x86_64 3/107 2025-05-07T19:42:59.1365843Z Installing : util-linux-core-2.37.4-1.amzn2023.0.4.x86_64 4/107 2025-05-07T19:42:59.1444351Z Running scriptlet: util-linux-core-2.37.4-1.amzn2023.0.4.x86_64 4/107 2025-05-07T19:42:59.1545455Z Installing : pciutils-libs-3.7.0-3.amzn2023.0.2.x86_64 5/107 2025-05-07T19:42:59.1843599Z Installing : ncurses-6.2-4.20200222.amzn2023.0.6.x86_64 6/107 2025-05-07T19:42:59.1930612Z Installing : nano-8.3-1.amzn2023.x86_64 7/107 2025-05-07T19:42:59.1998808Z Installing : nano-default-editor-8.3-1.amzn2023.noarch 8/107 2025-05-07T19:42:59.2518809Z Installing : libsemanage-3.4-5.amzn2023.0.2.x86_64 9/107 2025-05-07T19:42:59.2610423Z Installing : shadow-utils-2:4.9-12.amzn2023.0.4.x86_64 10/107 2025-05-07T19:42:59.3070426Z Running scriptlet: libutempter-1.2.1-4.amzn2023.0.2.x86_64 11/107 2025-05-07T19:42:59.3139507Z Installing : libutempter-1.2.1-4.amzn2023.0.2.x86_64 11/107 2025-05-07T19:42:59.3210108Z Installing : libmetalink-0.1.3-14.amzn2023.0.2.x86_64 12/107 2025-05-07T19:42:59.3282308Z Installing : libfdisk-2.37.4-1.amzn2023.0.4.x86_64 13/107 2025-05-07T19:42:59.3344720Z Installing : libedit-3.1-38.20210714cvs.amzn2023.0.2.x86_64 14/107 2025-05-07T19:42:59.3496308Z Installing : libeconf-0.4.0-1.amzn2023.0.3.x86_64 15/107 2025-05-07T19:42:59.3555232Z Installing : libdb-5.3.28-49.amzn2023.0.2.x86_64 16/107 2025-05-07T19:42:59.3619948Z Installing : libcbor-0.7.0-3.amzn2023.0.2.x86_64 17/107 2025-05-07T19:42:59.3704430Z Installing : libfido2-1.10.0-2.amzn2023.0.2.x86_64 18/107 2025-05-07T19:42:59.3773020Z Installing : less-608-2.amzn2023.0.2.x86_64 19/107 2025-05-07T19:42:59.3834639Z Installing : kmod-libs-29-2.amzn2023.0.5.x86_64 20/107 2025-05-07T19:42:59.4268651Z Installing : jansson-2.14-0.amzn2023.x86_64 21/107 2025-05-07T19:42:59.4358958Z Installing : hwdata-0.384-1.amzn2023.0.3.noarch 22/107 2025-05-07T19:42:59.4523342Z Installing : gzip-1.12-1.amzn2023.0.1.x86_64 23/107 2025-05-07T19:42:59.4973798Z Installing : cracklib-2.9.6-27.amzn2023.0.2.x86_64 24/107 2025-05-07T19:42:59.5167628Z Installing : pam-1.5.1-8.amzn2023.0.4.x86_64 25/107 2025-05-07T19:42:59.5998700Z Installing : libpwquality-1.4.4-6.amzn2023.0.2.x86_64 26/107 2025-05-07T19:42:59.6000384Z Installing : util-linux-2.37.4-1.amzn2023.0.4.x86_64 27/107 2025-05-07T19:42:59.6001771Z warning: /etc/adjtime created as /etc/adjtime.rpmnew 2025-05-07T19:42:59.6002523Z 2025-05-07T19:42:59.6223758Z Running scriptlet: util-linux-2.37.4-1.amzn2023.0.4.x86_64 27/107 2025-05-07T19:42:59.6577810Z Running scriptlet: openssh-8.7p1-8.amzn2023.0.14.x86_64 28/107 2025-05-07T19:42:59.6776956Z Installing : openssh-8.7p1-8.amzn2023.0.14.x86_64 28/107 2025-05-07T19:42:59.6852101Z Installing : openssh-clients-8.7p1-8.amzn2023.0.14.x86_64 29/107 2025-05-07T19:42:59.7975129Z Running scriptlet: openssh-clients-8.7p1-8.amzn2023.0.14.x86_64 29/107 2025-05-07T19:42:59.9491169Z Installing : git-core-2.47.1-1.amzn2023.0.2.x86_64 30/107 2025-05-07T19:42:59.9624956Z Installing : git-core-doc-2.47.1-1.amzn2023.0.2.noarch 31/107 2025-05-07T19:43:00.0060264Z Running scriptlet: groff-base-1.22.4-7.amzn2023.0.2.x86_64 32/107 2025-05-07T19:43:00.0148886Z Installing : groff-base-1.22.4-7.amzn2023.0.2.x86_64 32/107 2025-05-07T19:43:00.0234255Z Running scriptlet: groff-base-1.22.4-7.amzn2023.0.2.x86_64 32/107 2025-05-07T19:43:00.0304389Z Installing : perl-Digest-1.20-1.amzn2023.0.2.noarch 33/107 2025-05-07T19:43:00.0398553Z Installing : perl-Digest-MD5-2.58-2.amzn2023.0.2.x86_64 34/107 2025-05-07T19:43:00.0455909Z Installing : perl-B-1.80-477.amzn2023.0.6.x86_64 35/107 2025-05-07T19:43:00.0504865Z Installing : perl-FileHandle-2.03-477.amzn2023.0.6.noarch 36/107 2025-05-07T19:43:00.0561997Z Installing : perl-AutoLoader-5.74-477.amzn2023.0.6.noarch 37/107 2025-05-07T19:43:00.0649362Z Installing : perl-Data-Dumper-2.174-460.amzn2023.0.2.x86_64 38/107 2025-05-07T19:43:00.0717089Z Installing : perl-libnet-3.13-2.amzn2023.0.2.noarch 39/107 2025-05-07T19:43:00.0816062Z Installing : perl-base-2.27-477.amzn2023.0.6.noarch 40/107 2025-05-07T19:43:00.1026297Z Installing : perl-URI-5.09-1.amzn2023.0.2.noarch 41/107 2025-05-07T19:43:00.1121050Z Installing : perl-Net-SSLeay-1.94-1.amzn2023.0.1.x86_64 42/107 2025-05-07T19:43:00.1174331Z Installing : perl-Text-Tabs+Wrap-2021.0726-1.amzn2023.0.1.noa 43/107 2025-05-07T19:43:00.1223268Z Installing : perl-Mozilla-CA-20200520-4.amzn2023.0.2.noarch 44/107 2025-05-07T19:43:00.1279435Z Installing : perl-if-0.60.800-477.amzn2023.0.6.noarch 45/107 2025-05-07T19:43:00.1340842Z Installing : perl-IO-Socket-IP-0.41-3.amzn2023.0.2.noarch 46/107 2025-05-07T19:43:00.1405346Z Installing : perl-Time-Local-2:1.300-5.amzn2023.0.2.noarch 47/107 2025-05-07T19:43:00.1497920Z Installing : perl-File-Path-2.18-2.amzn2023.0.2.noarch 48/107 2025-05-07T19:43:00.1562158Z Installing : perl-IO-Socket-SSL-2.075-1.amzn2023.0.2.noarch 49/107 2025-05-07T19:43:00.1609547Z Installing : perl-Pod-Escapes-1:1.07-458.amzn2023.0.2.noarch 50/107 2025-05-07T19:43:00.1671143Z Installing : perl-Class-Struct-0.66-477.amzn2023.0.6.noarch 51/107 2025-05-07T19:43:00.1731850Z Installing : perl-POSIX-1.94-477.amzn2023.0.6.x86_64 52/107 2025-05-07T19:43:00.1796211Z Installing : perl-Term-ANSIColor-5.01-459.amzn2023.0.2.noarch 53/107 2025-05-07T19:43:00.1844857Z Installing : perl-IPC-Open3-1.21-477.amzn2023.0.6.noarch 54/107 2025-05-07T19:43:00.1904460Z Installing : perl-subs-1.03-477.amzn2023.0.6.noarch 55/107 2025-05-07T19:43:00.1980214Z Installing : perl-File-Temp-1:0.231.100-2.amzn2023.0.2.noarch 56/107 2025-05-07T19:43:00.2042525Z Installing : perl-HTTP-Tiny-0.078-1.amzn2023.0.3.noarch 57/107 2025-05-07T19:43:00.2146953Z Installing : perl-Term-Cap-1.17-458.amzn2023.0.2.noarch 58/107 2025-05-07T19:43:00.2235256Z Installing : perl-Pod-Simple-1:3.42-2.amzn2023.0.2.noarch 59/107 2025-05-07T19:43:00.2290630Z Installing : perl-Socket-4:2.032-1.amzn2023.0.2.x86_64 60/107 2025-05-07T19:43:00.2339448Z Installing : perl-SelectSaver-1.02-477.amzn2023.0.6.noarch 61/107 2025-05-07T19:43:00.2388505Z Installing : perl-Symbol-1.08-477.amzn2023.0.6.noarch 62/107 2025-05-07T19:43:00.2464726Z Installing : perl-File-stat-1.09-477.amzn2023.0.6.noarch 63/107 2025-05-07T19:43:00.2565858Z Installing : perl-podlators-1:4.14-458.amzn2023.0.2.noarch 64/107 2025-05-07T19:43:00.2644372Z Installing : perl-Pod-Perldoc-3.28.01-459.amzn2023.0.3.noarch 65/107 2025-05-07T19:43:00.2699492Z Installing : perl-Fcntl-1.13-477.amzn2023.0.6.x86_64 66/107 2025-05-07T19:43:00.2761004Z Installing : perl-Text-ParseWords-3.30-458.amzn2023.0.2.noarc 67/107 2025-05-07T19:43:00.2830908Z Installing : perl-mro-1.23-477.amzn2023.0.6.x86_64 68/107 2025-05-07T19:43:00.2893593Z Installing : perl-IO-1.43-477.amzn2023.0.6.x86_64 69/107 2025-05-07T19:43:00.2950715Z Installing : perl-overloading-0.02-477.amzn2023.0.6.noarch 70/107 2025-05-07T19:43:00.3024190Z Installing : perl-Pod-Usage-4:2.01-2.amzn2023.0.2.noarch 71/107 2025-05-07T19:43:00.3075032Z Installing : perl-Errno-1.30-477.amzn2023.0.6.x86_64 72/107 2025-05-07T19:43:00.3126993Z Installing : perl-File-Basename-2.85-477.amzn2023.0.6.noarch 73/107 2025-05-07T19:43:00.3186114Z Installing : perl-Getopt-Std-1.12-477.amzn2023.0.6.noarch 74/107 2025-05-07T19:43:00.3264059Z Installing : perl-MIME-Base64-3.16-2.amzn2023.0.2.x86_64 75/107 2025-05-07T19:43:00.3343610Z Installing : perl-Scalar-List-Utils-4:1.56-459.amzn2023.0.2.x 76/107 2025-05-07T19:43:00.3413038Z Installing : perl-constant-1.33-459.amzn2023.0.2.noarch 77/107 2025-05-07T19:43:00.3471054Z Installing : perl-Storable-1:3.21-458.amzn2023.0.2.x86_64 78/107 2025-05-07T19:43:00.3527819Z Installing : perl-overload-1.31-477.amzn2023.0.6.noarch 79/107 2025-05-07T19:43:00.3575198Z Installing : perl-parent-1:0.238-458.amzn2023.0.2.noarch 80/107 2025-05-07T19:43:00.3642223Z Installing : perl-vars-1.05-477.amzn2023.0.6.noarch 81/107 2025-05-07T19:43:00.3699860Z Installing : perl-Getopt-Long-1:2.52-2.amzn2023.0.2.noarch 82/107 2025-05-07T19:43:00.3756074Z Installing : perl-DynaLoader-1.47-477.amzn2023.0.6.x86_64 83/107 2025-05-07T19:43:00.3810542Z Installing : perl-Carp-1.50-458.amzn2023.0.2.noarch 84/107 2025-05-07T19:43:00.3870945Z Installing : perl-Exporter-5.74-459.amzn2023.0.2.noarch 85/107 2025-05-07T19:43:00.3950750Z Installing : perl-NDBM_File-1.15-477.amzn2023.0.6.x86_64 86/107 2025-05-07T19:43:00.4489746Z Installing : perl-PathTools-3.78-459.amzn2023.0.2.x86_64 87/107 2025-05-07T19:43:00.5451824Z Installing : perl-Encode-4:3.15-462.amzn2023.0.2.x86_64 88/107 2025-05-07T19:43:00.5579962Z Installing : perl-libs-4:5.32.1-477.amzn2023.0.6.x86_64 89/107 2025-05-07T19:43:00.5658999Z Installing : perl-interpreter-4:5.32.1-477.amzn2023.0.6.x86_6 90/107 2025-05-07T19:43:00.5738030Z Installing : perl-Error-1:0.17029-5.amzn2023.0.2.noarch 91/107 2025-05-07T19:43:00.5802878Z Installing : perl-File-Find-1.37-477.amzn2023.0.6.noarch 92/107 2025-05-07T19:43:00.5868872Z Installing : perl-TermReadKey-2.38-9.amzn2023.0.2.x86_64 93/107 2025-05-07T19:43:00.5925502Z Installing : perl-lib-0.65-477.amzn2023.0.6.x86_64 94/107 2025-05-07T19:43:00.5988488Z Installing : perl-Git-2.47.1-1.amzn2023.0.2.noarch 95/107 2025-05-07T19:43:00.6064024Z Installing : git-2.47.1-1.amzn2023.0.2.x86_64 96/107 2025-05-07T19:43:00.6260850Z Installing : elfutils-debuginfod-client-0.188-3.amzn2023.0.2. 97/107 2025-05-07T19:43:00.6383846Z Installing : cyrus-sasl-lib-2.1.27-18.amzn2023.0.3.x86_64 98/107 2025-05-07T19:43:00.6470846Z Installing : openldap-2.4.57-6.amzn2023.0.7.x86_64 99/107 2025-05-07T19:43:00.6875144Z Installing : sudo-python-plugin-1.9.15-1.p5.amzn2023.0.1.x86_ 100/107 2025-05-07T19:43:00.8100812Z Installing : sudo-1.9.15-1.p5.amzn2023.0.1.x86_64 101/107 2025-05-07T19:43:00.8191194Z Installing : binutils-2.41-50.amzn2023.0.3.x86_64 102/107 2025-05-07T19:43:00.8316413Z Running scriptlet: binutils-2.41-50.amzn2023.0.3.x86_64 102/107 2025-05-07T19:43:00.8615442Z Installing : pciutils-3.7.0-3.amzn2023.0.2.x86_64 103/107 2025-05-07T19:43:00.8718163Z Installing : wget-1.21.3-1.amzn2023.0.4.x86_64 104/107 2025-05-07T19:43:00.8968911Z Installing : which-2.21-26.amzn2023.0.2.x86_64 105/107 2025-05-07T19:43:00.9181061Z Installing : tar-2:1.34-1.amzn2023.0.4.x86_64 106/107 2025-05-07T19:43:00.9267382Z Installing : findutils-1:4.8.0-2.amzn2023.0.2.x86_64 107/107 2025-05-07T19:43:00.9395116Z Running scriptlet: pam-1.5.1-8.amzn2023.0.4.x86_64 107/107 2025-05-07T19:43:01.7059190Z Running scriptlet: findutils-1:4.8.0-2.amzn2023.0.2.x86_64 107/107 2025-05-07T19:43:01.7059953Z Verifying : binutils-2.41-50.amzn2023.0.3.x86_64 1/107 2025-05-07T19:43:01.7060766Z Verifying : cracklib-2.9.6-27.amzn2023.0.2.x86_64 2/107 2025-05-07T19:43:01.7061355Z Verifying : cyrus-sasl-lib-2.1.27-18.amzn2023.0.3.x86_64 3/107 2025-05-07T19:43:01.7062031Z Verifying : elfutils-debuginfod-client-0.188-3.amzn2023.0.2. 4/107 2025-05-07T19:43:01.7062655Z Verifying : findutils-1:4.8.0-2.amzn2023.0.2.x86_64 5/107 2025-05-07T19:43:01.7063292Z Verifying : git-2.47.1-1.amzn2023.0.2.x86_64 6/107 2025-05-07T19:43:01.7063914Z Verifying : git-core-2.47.1-1.amzn2023.0.2.x86_64 7/107 2025-05-07T19:43:01.7064507Z Verifying : git-core-doc-2.47.1-1.amzn2023.0.2.noarch 8/107 2025-05-07T19:43:01.7065494Z Verifying : gnutls-3.8.3-6.amzn2023.0.1.x86_64 9/107 2025-05-07T19:43:01.7066081Z Verifying : groff-base-1.22.4-7.amzn2023.0.2.x86_64 10/107 2025-05-07T19:43:01.7066728Z Verifying : gzip-1.12-1.amzn2023.0.1.x86_64 11/107 2025-05-07T19:43:01.7067373Z Verifying : hwdata-0.384-1.amzn2023.0.3.noarch 12/107 2025-05-07T19:43:01.7067967Z Verifying : jansson-2.14-0.amzn2023.x86_64 13/107 2025-05-07T19:43:01.7068613Z Verifying : kmod-libs-29-2.amzn2023.0.5.x86_64 14/107 2025-05-07T19:43:01.7069181Z Verifying : less-608-2.amzn2023.0.2.x86_64 15/107 2025-05-07T19:43:01.7069805Z Verifying : libcbor-0.7.0-3.amzn2023.0.2.x86_64 16/107 2025-05-07T19:43:01.7070672Z Verifying : libdb-5.3.28-49.amzn2023.0.2.x86_64 17/107 2025-05-07T19:43:01.7071251Z Verifying : libeconf-0.4.0-1.amzn2023.0.3.x86_64 18/107 2025-05-07T19:43:01.7071892Z Verifying : libedit-3.1-38.20210714cvs.amzn2023.0.2.x86_64 19/107 2025-05-07T19:43:01.7072473Z Verifying : libfdisk-2.37.4-1.amzn2023.0.4.x86_64 20/107 2025-05-07T19:43:01.7073118Z Verifying : libfido2-1.10.0-2.amzn2023.0.2.x86_64 21/107 2025-05-07T19:43:01.7073759Z Verifying : libmetalink-0.1.3-14.amzn2023.0.2.x86_64 22/107 2025-05-07T19:43:01.7074353Z Verifying : libpwquality-1.4.4-6.amzn2023.0.2.x86_64 23/107 2025-05-07T19:43:01.7075008Z Verifying : libsemanage-3.4-5.amzn2023.0.2.x86_64 24/107 2025-05-07T19:43:01.7075603Z Verifying : libutempter-1.2.1-4.amzn2023.0.2.x86_64 25/107 2025-05-07T19:43:01.7076257Z Verifying : nano-8.3-1.amzn2023.x86_64 26/107 2025-05-07T19:43:01.7076922Z Verifying : nano-default-editor-8.3-1.amzn2023.noarch 27/107 2025-05-07T19:43:01.7077529Z Verifying : ncurses-6.2-4.20200222.amzn2023.0.6.x86_64 28/107 2025-05-07T19:43:01.7078174Z Verifying : nettle-3.10.1-1.amzn2023.0.1.x86_64 29/107 2025-05-07T19:43:01.7078767Z Verifying : openldap-2.4.57-6.amzn2023.0.7.x86_64 30/107 2025-05-07T19:43:01.7079402Z Verifying : openssh-8.7p1-8.amzn2023.0.14.x86_64 31/107 2025-05-07T19:43:01.7080043Z Verifying : openssh-clients-8.7p1-8.amzn2023.0.14.x86_64 32/107 2025-05-07T19:43:01.7080649Z Verifying : pam-1.5.1-8.amzn2023.0.4.x86_64 33/107 2025-05-07T19:43:01.7081282Z Verifying : pciutils-3.7.0-3.amzn2023.0.2.x86_64 34/107 2025-05-07T19:43:01.7081868Z Verifying : pciutils-libs-3.7.0-3.amzn2023.0.2.x86_64 35/107 2025-05-07T19:43:01.7082549Z Verifying : perl-AutoLoader-5.74-477.amzn2023.0.6.noarch 36/107 2025-05-07T19:43:01.7083343Z Verifying : perl-B-1.80-477.amzn2023.0.6.x86_64 37/107 2025-05-07T19:43:01.7083920Z Verifying : perl-Carp-1.50-458.amzn2023.0.2.noarch 38/107 2025-05-07T19:43:01.7084601Z Verifying : perl-Class-Struct-0.66-477.amzn2023.0.6.noarch 39/107 2025-05-07T19:43:01.7085204Z Verifying : perl-Data-Dumper-2.174-460.amzn2023.0.2.x86_64 40/107 2025-05-07T19:43:01.7085849Z Verifying : perl-Digest-1.20-1.amzn2023.0.2.noarch 41/107 2025-05-07T19:43:01.7086468Z Verifying : perl-Digest-MD5-2.58-2.amzn2023.0.2.x86_64 42/107 2025-05-07T19:43:01.7087112Z Verifying : perl-DynaLoader-1.47-477.amzn2023.0.6.x86_64 43/107 2025-05-07T19:43:01.7087749Z Verifying : perl-Encode-4:3.15-462.amzn2023.0.2.x86_64 44/107 2025-05-07T19:43:01.7088349Z Verifying : perl-Errno-1.30-477.amzn2023.0.6.x86_64 45/107 2025-05-07T19:43:01.7089078Z Verifying : perl-Error-1:0.17029-5.amzn2023.0.2.noarch 46/107 2025-05-07T19:43:01.7089655Z Verifying : perl-Exporter-5.74-459.amzn2023.0.2.noarch 47/107 2025-05-07T19:43:01.7090351Z Verifying : perl-Fcntl-1.13-477.amzn2023.0.6.x86_64 48/107 2025-05-07T19:43:01.7090994Z Verifying : perl-File-Basename-2.85-477.amzn2023.0.6.noarch 49/107 2025-05-07T19:43:01.7091583Z Verifying : perl-File-Find-1.37-477.amzn2023.0.6.noarch 50/107 2025-05-07T19:43:01.7092276Z Verifying : perl-File-Path-2.18-2.amzn2023.0.2.noarch 51/107 2025-05-07T19:43:01.7092940Z Verifying : perl-File-Temp-1:0.231.100-2.amzn2023.0.2.noarch 52/107 2025-05-07T19:43:01.7093538Z Verifying : perl-File-stat-1.09-477.amzn2023.0.6.noarch 53/107 2025-05-07T19:43:01.7094094Z Verifying : perl-FileHandle-2.03-477.amzn2023.0.6.noarch 54/107 2025-05-07T19:43:01.7094683Z Verifying : perl-Getopt-Long-1:2.52-2.amzn2023.0.2.noarch 55/107 2025-05-07T19:43:01.7095244Z Verifying : perl-Getopt-Std-1.12-477.amzn2023.0.6.noarch 56/107 2025-05-07T19:43:01.7095819Z Verifying : perl-Git-2.47.1-1.amzn2023.0.2.noarch 57/107 2025-05-07T19:43:01.7096399Z Verifying : perl-HTTP-Tiny-0.078-1.amzn2023.0.3.noarch 58/107 2025-05-07T19:43:01.7097065Z Verifying : perl-IO-1.43-477.amzn2023.0.6.x86_64 59/107 2025-05-07T19:43:01.7097657Z Verifying : perl-IO-Socket-IP-0.41-3.amzn2023.0.2.noarch 60/107 2025-05-07T19:43:01.7098218Z Verifying : perl-IO-Socket-SSL-2.075-1.amzn2023.0.2.noarch 61/107 2025-05-07T19:43:01.7098814Z Verifying : perl-IPC-Open3-1.21-477.amzn2023.0.6.noarch 62/107 2025-05-07T19:43:01.7099365Z Verifying : perl-MIME-Base64-3.16-2.amzn2023.0.2.x86_64 63/107 2025-05-07T19:43:01.7099961Z Verifying : perl-Mozilla-CA-20200520-4.amzn2023.0.2.noarch 64/107 2025-05-07T19:43:01.7100560Z Verifying : perl-NDBM_File-1.15-477.amzn2023.0.6.x86_64 65/107 2025-05-07T19:43:01.7101107Z Verifying : perl-Net-SSLeay-1.94-1.amzn2023.0.1.x86_64 66/107 2025-05-07T19:43:01.7101685Z Verifying : perl-POSIX-1.94-477.amzn2023.0.6.x86_64 67/107 2025-05-07T19:43:01.7102230Z Verifying : perl-PathTools-3.78-459.amzn2023.0.2.x86_64 68/107 2025-05-07T19:43:01.7102819Z Verifying : perl-Pod-Escapes-1:1.07-458.amzn2023.0.2.noarch 69/107 2025-05-07T19:43:01.7103411Z Verifying : perl-Pod-Perldoc-3.28.01-459.amzn2023.0.3.noarch 70/107 2025-05-07T19:43:01.7103970Z Verifying : perl-Pod-Simple-1:3.42-2.amzn2023.0.2.noarch 71/107 2025-05-07T19:43:01.7104547Z Verifying : perl-Pod-Usage-4:2.01-2.amzn2023.0.2.noarch 72/107 2025-05-07T19:43:01.7105098Z Verifying : perl-Scalar-List-Utils-4:1.56-459.amzn2023.0.2.x 73/107 2025-05-07T19:43:01.7105798Z Verifying : perl-SelectSaver-1.02-477.amzn2023.0.6.noarch 74/107 2025-05-07T19:43:01.7106350Z Verifying : perl-Socket-4:2.032-1.amzn2023.0.2.x86_64 75/107 2025-05-07T19:43:01.7106919Z Verifying : perl-Storable-1:3.21-458.amzn2023.0.2.x86_64 76/107 2025-05-07T19:43:01.7107502Z Verifying : perl-Symbol-1.08-477.amzn2023.0.6.noarch 77/107 2025-05-07T19:43:01.7108050Z Verifying : perl-Term-ANSIColor-5.01-459.amzn2023.0.2.noarch 78/107 2025-05-07T19:43:01.7108632Z Verifying : perl-Term-Cap-1.17-458.amzn2023.0.2.noarch 79/107 2025-05-07T19:43:01.7109177Z Verifying : perl-TermReadKey-2.38-9.amzn2023.0.2.x86_64 80/107 2025-05-07T19:43:01.7109765Z Verifying : perl-Text-ParseWords-3.30-458.amzn2023.0.2.noarc 81/107 2025-05-07T19:43:01.7110325Z Verifying : perl-Text-Tabs+Wrap-2021.0726-1.amzn2023.0.1.noa 82/107 2025-05-07T19:43:01.7110965Z Verifying : perl-Time-Local-2:1.300-5.amzn2023.0.2.noarch 83/107 2025-05-07T19:43:01.7111599Z Verifying : perl-URI-5.09-1.amzn2023.0.2.noarch 84/107 2025-05-07T19:43:01.7112125Z Verifying : perl-base-2.27-477.amzn2023.0.6.noarch 85/107 2025-05-07T19:43:01.7112686Z Verifying : perl-constant-1.33-459.amzn2023.0.2.noarch 86/107 2025-05-07T19:43:01.7113221Z Verifying : perl-if-0.60.800-477.amzn2023.0.6.noarch 87/107 2025-05-07T19:43:01.7113787Z Verifying : perl-interpreter-4:5.32.1-477.amzn2023.0.6.x86_6 88/107 2025-05-07T19:43:01.7114355Z Verifying : perl-lib-0.65-477.amzn2023.0.6.x86_64 89/107 2025-05-07T19:43:01.7114884Z Verifying : perl-libnet-3.13-2.amzn2023.0.2.noarch 90/107 2025-05-07T19:43:01.7115443Z Verifying : perl-libs-4:5.32.1-477.amzn2023.0.6.x86_64 91/107 2025-05-07T19:43:01.7115988Z Verifying : perl-mro-1.23-477.amzn2023.0.6.x86_64 92/107 2025-05-07T19:43:01.7116567Z Verifying : perl-overload-1.31-477.amzn2023.0.6.noarch 93/107 2025-05-07T19:43:01.7117155Z Verifying : perl-overloading-0.02-477.amzn2023.0.6.noarch 94/107 2025-05-07T19:43:01.7117705Z Verifying : perl-parent-1:0.238-458.amzn2023.0.2.noarch 95/107 2025-05-07T19:43:01.7118275Z Verifying : perl-podlators-1:4.14-458.amzn2023.0.2.noarch 96/107 2025-05-07T19:43:01.7118811Z Verifying : perl-subs-1.03-477.amzn2023.0.6.noarch 97/107 2025-05-07T19:43:01.7119378Z Verifying : perl-vars-1.05-477.amzn2023.0.6.noarch 98/107 2025-05-07T19:43:01.7119911Z Verifying : shadow-utils-2:4.9-12.amzn2023.0.4.x86_64 99/107 2025-05-07T19:43:01.7120452Z Verifying : sudo-1.9.15-1.p5.amzn2023.0.1.x86_64 100/107 2025-05-07T19:43:01.7121019Z Verifying : sudo-python-plugin-1.9.15-1.p5.amzn2023.0.1.x86_ 101/107 2025-05-07T19:43:01.7121585Z Verifying : systemd-libs-252.23-3.amzn2023.x86_64 102/107 2025-05-07T19:43:01.7122132Z Verifying : tar-2:1.34-1.amzn2023.0.4.x86_64 103/107 2025-05-07T19:43:01.7122640Z Verifying : util-linux-2.37.4-1.amzn2023.0.4.x86_64 104/107 2025-05-07T19:43:01.7123204Z Verifying : util-linux-core-2.37.4-1.amzn2023.0.4.x86_64 105/107 2025-05-07T19:43:01.7123728Z Verifying : wget-1.21.3-1.amzn2023.0.4.x86_64 106/107 2025-05-07T19:43:01.8212617Z Verifying : which-2.21-26.amzn2023.0.2.x86_64 107/107 2025-05-07T19:43:01.8213712Z 2025-05-07T19:43:01.8213994Z Installed: 2025-05-07T19:43:01.8214938Z binutils-2.41-50.amzn2023.0.3.x86_64 2025-05-07T19:43:01.8216541Z cracklib-2.9.6-27.amzn2023.0.2.x86_64 2025-05-07T19:43:01.8217103Z cyrus-sasl-lib-2.1.27-18.amzn2023.0.3.x86_64 2025-05-07T19:43:01.8218168Z elfutils-debuginfod-client-0.188-3.amzn2023.0.2.x86_64 2025-05-07T19:43:01.8218762Z findutils-1:4.8.0-2.amzn2023.0.2.x86_64 2025-05-07T19:43:01.8219268Z git-2.47.1-1.amzn2023.0.2.x86_64 2025-05-07T19:43:01.8219785Z git-core-2.47.1-1.amzn2023.0.2.x86_64 2025-05-07T19:43:01.8220307Z git-core-doc-2.47.1-1.amzn2023.0.2.noarch 2025-05-07T19:43:01.8220844Z gnutls-3.8.3-6.amzn2023.0.1.x86_64 2025-05-07T19:43:01.8221376Z groff-base-1.22.4-7.amzn2023.0.2.x86_64 2025-05-07T19:43:01.8221878Z gzip-1.12-1.amzn2023.0.1.x86_64 2025-05-07T19:43:01.8222397Z hwdata-0.384-1.amzn2023.0.3.noarch 2025-05-07T19:43:01.8223190Z jansson-2.14-0.amzn2023.x86_64 2025-05-07T19:43:01.8223690Z kmod-libs-29-2.amzn2023.0.5.x86_64 2025-05-07T19:43:01.8224155Z less-608-2.amzn2023.0.2.x86_64 2025-05-07T19:43:01.8224644Z libcbor-0.7.0-3.amzn2023.0.2.x86_64 2025-05-07T19:43:01.8225134Z libdb-5.3.28-49.amzn2023.0.2.x86_64 2025-05-07T19:43:01.8225604Z libeconf-0.4.0-1.amzn2023.0.3.x86_64 2025-05-07T19:43:01.8226124Z libedit-3.1-38.20210714cvs.amzn2023.0.2.x86_64 2025-05-07T19:43:01.8226620Z libfdisk-2.37.4-1.amzn2023.0.4.x86_64 2025-05-07T19:43:01.8227170Z libfido2-1.10.0-2.amzn2023.0.2.x86_64 2025-05-07T19:43:01.8227684Z libmetalink-0.1.3-14.amzn2023.0.2.x86_64 2025-05-07T19:43:01.8228208Z libpwquality-1.4.4-6.amzn2023.0.2.x86_64 2025-05-07T19:43:01.8228709Z libsemanage-3.4-5.amzn2023.0.2.x86_64 2025-05-07T19:43:01.8229225Z libutempter-1.2.1-4.amzn2023.0.2.x86_64 2025-05-07T19:43:01.8229689Z nano-8.3-1.amzn2023.x86_64 2025-05-07T19:43:01.8230195Z nano-default-editor-8.3-1.amzn2023.noarch 2025-05-07T19:43:01.8230716Z ncurses-6.2-4.20200222.amzn2023.0.6.x86_64 2025-05-07T19:43:01.8231218Z nettle-3.10.1-1.amzn2023.0.1.x86_64 2025-05-07T19:43:01.8231712Z openldap-2.4.57-6.amzn2023.0.7.x86_64 2025-05-07T19:43:01.8232196Z openssh-8.7p1-8.amzn2023.0.14.x86_64 2025-05-07T19:43:01.8232741Z openssh-clients-8.7p1-8.amzn2023.0.14.x86_64 2025-05-07T19:43:01.8233231Z pam-1.5.1-8.amzn2023.0.4.x86_64 2025-05-07T19:43:01.8233726Z pciutils-3.7.0-3.amzn2023.0.2.x86_64 2025-05-07T19:43:01.8234237Z pciutils-libs-3.7.0-3.amzn2023.0.2.x86_64 2025-05-07T19:43:01.8234761Z perl-AutoLoader-5.74-477.amzn2023.0.6.noarch 2025-05-07T19:43:01.8235300Z perl-B-1.80-477.amzn2023.0.6.x86_64 2025-05-07T19:43:01.8235803Z perl-Carp-1.50-458.amzn2023.0.2.noarch 2025-05-07T19:43:01.8236367Z perl-Class-Struct-0.66-477.amzn2023.0.6.noarch 2025-05-07T19:43:01.8236914Z perl-Data-Dumper-2.174-460.amzn2023.0.2.x86_64 2025-05-07T19:43:01.8237583Z perl-Digest-1.20-1.amzn2023.0.2.noarch 2025-05-07T19:43:01.8238154Z perl-Digest-MD5-2.58-2.amzn2023.0.2.x86_64 2025-05-07T19:43:01.8238697Z perl-DynaLoader-1.47-477.amzn2023.0.6.x86_64 2025-05-07T19:43:01.8239260Z perl-Encode-4:3.15-462.amzn2023.0.2.x86_64 2025-05-07T19:43:01.8239883Z perl-Errno-1.30-477.amzn2023.0.6.x86_64 2025-05-07T19:43:01.8240510Z perl-Error-1:0.17029-5.amzn2023.0.2.noarch 2025-05-07T19:43:01.8241065Z perl-Exporter-5.74-459.amzn2023.0.2.noarch 2025-05-07T19:43:01.8241601Z perl-Fcntl-1.13-477.amzn2023.0.6.x86_64 2025-05-07T19:43:01.8242147Z perl-File-Basename-2.85-477.amzn2023.0.6.noarch 2025-05-07T19:43:01.8243768Z perl-File-Find-1.37-477.amzn2023.0.6.noarch 2025-05-07T19:43:01.8244318Z perl-File-Path-2.18-2.amzn2023.0.2.noarch 2025-05-07T19:43:01.8244840Z perl-File-Temp-1:0.231.100-2.amzn2023.0.2.noarch 2025-05-07T19:43:01.8245379Z perl-File-stat-1.09-477.amzn2023.0.6.noarch 2025-05-07T19:43:01.8245930Z perl-FileHandle-2.03-477.amzn2023.0.6.noarch 2025-05-07T19:43:01.8246458Z perl-Getopt-Long-1:2.52-2.amzn2023.0.2.noarch 2025-05-07T19:43:01.8247004Z perl-Getopt-Std-1.12-477.amzn2023.0.6.noarch 2025-05-07T19:43:01.8247516Z perl-Git-2.47.1-1.amzn2023.0.2.noarch 2025-05-07T19:43:01.8248050Z perl-HTTP-Tiny-0.078-1.amzn2023.0.3.noarch 2025-05-07T19:43:01.8248550Z perl-IO-1.43-477.amzn2023.0.6.x86_64 2025-05-07T19:43:01.8249085Z perl-IO-Socket-IP-0.41-3.amzn2023.0.2.noarch 2025-05-07T19:43:01.8249633Z perl-IO-Socket-SSL-2.075-1.amzn2023.0.2.noarch 2025-05-07T19:43:01.8250164Z perl-IPC-Open3-1.21-477.amzn2023.0.6.noarch 2025-05-07T19:43:01.8250712Z perl-MIME-Base64-3.16-2.amzn2023.0.2.x86_64 2025-05-07T19:43:01.8251253Z perl-Mozilla-CA-20200520-4.amzn2023.0.2.noarch 2025-05-07T19:43:01.8251798Z perl-NDBM_File-1.15-477.amzn2023.0.6.x86_64 2025-05-07T19:43:01.8252327Z perl-Net-SSLeay-1.94-1.amzn2023.0.1.x86_64 2025-05-07T19:43:01.8252845Z perl-POSIX-1.94-477.amzn2023.0.6.x86_64 2025-05-07T19:43:01.8253378Z perl-PathTools-3.78-459.amzn2023.0.2.x86_64 2025-05-07T19:43:01.8253899Z perl-Pod-Escapes-1:1.07-458.amzn2023.0.2.noarch 2025-05-07T19:43:01.8254441Z perl-Pod-Perldoc-3.28.01-459.amzn2023.0.3.noarch 2025-05-07T19:43:01.8254953Z perl-Pod-Simple-1:3.42-2.amzn2023.0.2.noarch 2025-05-07T19:43:01.8255471Z perl-Pod-Usage-4:2.01-2.amzn2023.0.2.noarch 2025-05-07T19:43:01.8255990Z perl-Scalar-List-Utils-4:1.56-459.amzn2023.0.2.x86_64 2025-05-07T19:43:01.8256618Z perl-SelectSaver-1.02-477.amzn2023.0.6.noarch 2025-05-07T19:43:01.8257356Z perl-Socket-4:2.032-1.amzn2023.0.2.x86_64 2025-05-07T19:43:01.8257953Z perl-Storable-1:3.21-458.amzn2023.0.2.x86_64 2025-05-07T19:43:01.8258538Z perl-Symbol-1.08-477.amzn2023.0.6.noarch 2025-05-07T19:43:01.8259250Z perl-Term-ANSIColor-5.01-459.amzn2023.0.2.noarch 2025-05-07T19:43:01.8259832Z perl-Term-Cap-1.17-458.amzn2023.0.2.noarch 2025-05-07T19:43:01.8260437Z perl-TermReadKey-2.38-9.amzn2023.0.2.x86_64 2025-05-07T19:43:01.8261030Z perl-Text-ParseWords-3.30-458.amzn2023.0.2.noarch 2025-05-07T19:43:01.8261674Z perl-Text-Tabs+Wrap-2021.0726-1.amzn2023.0.1.noarch 2025-05-07T19:43:01.8262263Z perl-Time-Local-2:1.300-5.amzn2023.0.2.noarch 2025-05-07T19:43:01.8262838Z perl-URI-5.09-1.amzn2023.0.2.noarch 2025-05-07T19:43:01.8263509Z perl-base-2.27-477.amzn2023.0.6.noarch 2025-05-07T19:43:01.8264035Z perl-constant-1.33-459.amzn2023.0.2.noarch 2025-05-07T19:43:01.8264582Z perl-if-0.60.800-477.amzn2023.0.6.noarch 2025-05-07T19:43:01.8265175Z perl-interpreter-4:5.32.1-477.amzn2023.0.6.x86_64 2025-05-07T19:43:01.8265715Z perl-lib-0.65-477.amzn2023.0.6.x86_64 2025-05-07T19:43:01.8266235Z perl-libnet-3.13-2.amzn2023.0.2.noarch 2025-05-07T19:43:01.8266773Z perl-libs-4:5.32.1-477.amzn2023.0.6.x86_64 2025-05-07T19:43:01.8267295Z perl-mro-1.23-477.amzn2023.0.6.x86_64 2025-05-07T19:43:01.8267814Z perl-overload-1.31-477.amzn2023.0.6.noarch 2025-05-07T19:43:01.8268386Z perl-overloading-0.02-477.amzn2023.0.6.noarch 2025-05-07T19:43:01.8268930Z perl-parent-1:0.238-458.amzn2023.0.2.noarch 2025-05-07T19:43:01.8269485Z perl-podlators-1:4.14-458.amzn2023.0.2.noarch 2025-05-07T19:43:01.8270045Z perl-subs-1.03-477.amzn2023.0.6.noarch 2025-05-07T19:43:01.8270960Z perl-vars-1.05-477.amzn2023.0.6.noarch 2025-05-07T19:43:01.8271549Z shadow-utils-2:4.9-12.amzn2023.0.4.x86_64 2025-05-07T19:43:01.8272091Z sudo-1.9.15-1.p5.amzn2023.0.1.x86_64 2025-05-07T19:43:01.8272677Z sudo-python-plugin-1.9.15-1.p5.amzn2023.0.1.x86_64 2025-05-07T19:43:01.8273249Z systemd-libs-252.23-3.amzn2023.x86_64 2025-05-07T19:43:01.8273815Z tar-2:1.34-1.amzn2023.0.4.x86_64 2025-05-07T19:43:01.8274358Z util-linux-2.37.4-1.amzn2023.0.4.x86_64 2025-05-07T19:43:01.8274905Z util-linux-core-2.37.4-1.amzn2023.0.4.x86_64 2025-05-07T19:43:01.8275498Z wget-1.21.3-1.amzn2023.0.4.x86_64 2025-05-07T19:43:01.8276005Z which-2.21-26.amzn2023.0.2.x86_64 2025-05-07T19:43:01.8276350Z 2025-05-07T19:43:01.8276452Z Complete! 2025-05-07T19:43:01.9057976Z ##[group]Run actions/checkout@v4 2025-05-07T19:43:01.9058350Z with: 2025-05-07T19:43:01.9058614Z submodules: true 2025-05-07T19:43:01.9058878Z repository: pytorch/FBGEMM 2025-05-07T19:43:01.9059433Z token: *** 2025-05-07T19:43:01.9059665Z ssh-strict: true 2025-05-07T19:43:01.9059943Z ssh-user: git 2025-05-07T19:43:01.9060192Z persist-credentials: true 2025-05-07T19:43:01.9060499Z clean: true 2025-05-07T19:43:01.9060782Z sparse-checkout-cone-mode: true 2025-05-07T19:43:01.9061087Z fetch-depth: 1 2025-05-07T19:43:01.9061349Z fetch-tags: false 2025-05-07T19:43:01.9061590Z show-progress: true 2025-05-07T19:43:01.9061859Z lfs: false 2025-05-07T19:43:01.9062101Z set-safe-directory: true 2025-05-07T19:43:01.9062630Z env: 2025-05-07T19:43:01.9062871Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:43:01.9063233Z BUILD_ENV: build_binary 2025-05-07T19:43:01.9063502Z BUILD_TARGET: default 2025-05-07T19:43:01.9063782Z BUILD_VARIANT: cuda 2025-05-07T19:43:01.9064112Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:43:01.9064381Z ##[endgroup] 2025-05-07T19:43:01.9109322Z ##[command]/usr/bin/docker exec 2b31f69c500b43ab9782f5664ba09da9843f0f5b972ff047034471321a834ace sh -c "cat /etc/*release | grep ^ID" 2025-05-07T19:43:02.2843531Z Syncing repository: pytorch/FBGEMM 2025-05-07T19:43:02.2845284Z ##[group]Getting Git version info 2025-05-07T19:43:02.2845650Z Working directory is '/__w/FBGEMM/FBGEMM' 2025-05-07T19:43:02.2846203Z [command]/usr/bin/git version 2025-05-07T19:43:02.2846481Z git version 2.47.1 2025-05-07T19:43:02.2847470Z ##[endgroup] 2025-05-07T19:43:02.2851543Z Temporarily overriding HOME='/__w/_temp/46b1aa03-60df-4cf8-a664-b9bfbcadb722' before making global git config changes 2025-05-07T19:43:02.2852340Z Adding repository directory to the temporary git global config as a safe directory 2025-05-07T19:43:02.2855899Z [command]/usr/bin/git config --global --add safe.directory /__w/FBGEMM/FBGEMM 2025-05-07T19:43:02.2888310Z [command]/usr/bin/git config --local --get remote.origin.url 2025-05-07T19:43:02.2904770Z https://github.com/pytorch/FBGEMM 2025-05-07T19:43:02.2919947Z ##[group]Removing previously created refs, to avoid conflicts 2025-05-07T19:43:02.2923201Z [command]/usr/bin/git rev-parse --symbolic-full-name --verify --quiet HEAD 2025-05-07T19:43:02.2940739Z HEAD 2025-05-07T19:43:02.2975340Z ##[endgroup] 2025-05-07T19:43:02.2976084Z [command]/usr/bin/git submodule status 2025-05-07T19:43:02.3410779Z e5d7c0bd5d9aec44d68830187138149e6a8c4e32 external/asmjit (e5d7c0b) 2025-05-07T19:43:02.3473993Z 4a61bdd4bd4ed730e078aebc7c0fcf046ff29406 external/composable_kernel (remotes/origin/FBGEMM) 2025-05-07T19:43:02.3584251Z 6543fec09b2f04ac4a666882998b534afc9c1349 external/cpuinfo (6543fec) 2025-05-07T19:43:02.3654575Z 3ed8d2ec4ba35ef5d9d8353826209b6f868f63d3 external/cutlass (remotes/origin/FBGEMM) 2025-05-07T19:43:02.3872582Z f8d7d77c06936315286eb55f8de22cd23c188571 external/googletest (release-1.8.0-3335-gf8d7d77c) 2025-05-07T19:43:02.3945709Z 420084499c7c1e1c2d801922f40df202eac5f3a0 external/hipify_torch (remotes/origin/mmelesse-9-g4200844) 2025-05-07T19:43:02.3978278Z 9cca280a4d0ccf0c08f47a99aa71d1b0e52f8d03 external/json (v3.11.2-84-g9cca280a) 2025-05-07T19:43:02.3994831Z ##[group]Cleaning the repository 2025-05-07T19:43:02.3995490Z [command]/usr/bin/git clean -ffdx 2025-05-07T19:43:02.7075873Z Removing build_only/ 2025-05-07T19:43:02.7076208Z Removing collect_env.py 2025-05-07T19:43:02.7076525Z Removing fbgemm_gpu/_skbuild/ 2025-05-07T19:43:02.7076870Z Removing fbgemm_gpu/codegen/genscript/__pycache__/ 2025-05-07T19:43:02.7077219Z Removing fbgemm_gpu/dist/ 2025-05-07T19:43:02.7077542Z Removing fbgemm_gpu/fbgemm_gpu/docs/version.py 2025-05-07T19:43:02.7077917Z Removing fbgemm_gpu/fbgemm_gpu_nightly.egg-info/ 2025-05-07T19:43:02.7081005Z [command]/usr/bin/git reset --hard HEAD 2025-05-07T19:43:02.8142367Z HEAD is now at 1c9ad64 Merge f6528e7b1e8f5602e7dba30cd73b48ae6630981c into fd4df5f456e0cca514bacd98a39efb72990fd9f4 2025-05-07T19:43:02.8147695Z ##[endgroup] 2025-05-07T19:43:02.8149493Z ##[group]Disabling automatic garbage collection 2025-05-07T19:43:02.8153250Z [command]/usr/bin/git config --local gc.auto 0 2025-05-07T19:43:02.8181512Z ##[endgroup] 2025-05-07T19:43:02.8182606Z ##[group]Setting up auth 2025-05-07T19:43:02.8184338Z [command]/usr/bin/git config --local --name-only --get-regexp core\.sshCommand 2025-05-07T19:43:02.8208863Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'core\.sshCommand' && git config --local --unset-all 'core.sshCommand' || :" 2025-05-07T19:43:02.8494048Z Entering 'external/asmjit' 2025-05-07T19:43:02.8540048Z Entering 'external/composable_kernel' 2025-05-07T19:43:02.8598630Z Entering 'external/cpuinfo' 2025-05-07T19:43:02.8647782Z Entering 'external/cutlass' 2025-05-07T19:43:02.8700637Z Entering 'external/googletest' 2025-05-07T19:43:02.8747924Z Entering 'external/hipify_torch' 2025-05-07T19:43:02.8797344Z Entering 'external/json' 2025-05-07T19:43:02.8857593Z [command]/usr/bin/git config --local --name-only --get-regexp http\.https\:\/\/github\.com\/\.extraheader 2025-05-07T19:43:02.8881605Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'http\.https\:\/\/github\.com\/\.extraheader' && git config --local --unset-all 'http.https://github.com/.extraheader' || :" 2025-05-07T19:43:02.9162252Z Entering 'external/asmjit' 2025-05-07T19:43:02.9212162Z Entering 'external/composable_kernel' 2025-05-07T19:43:02.9263377Z Entering 'external/cpuinfo' 2025-05-07T19:43:02.9319672Z Entering 'external/cutlass' 2025-05-07T19:43:02.9373952Z Entering 'external/googletest' 2025-05-07T19:43:02.9420659Z Entering 'external/hipify_torch' 2025-05-07T19:43:02.9469175Z Entering 'external/json' 2025-05-07T19:43:02.9530912Z [command]/usr/bin/git config --local http.https://github.com/.extraheader AUTHORIZATION: basic *** 2025-05-07T19:43:02.9572009Z ##[endgroup] 2025-05-07T19:43:02.9573160Z ##[group]Fetching the repository 2025-05-07T19:43:02.9575796Z [command]/usr/bin/git -c protocol.version=2 fetch --no-tags --prune --no-recurse-submodules --depth=1 origin +a2f4c52051596e74bc8c16e3d2867a4ecdd271e0:refs/remotes/pull/4066/merge 2025-05-07T19:43:03.2220134Z From https://github.com/pytorch/FBGEMM 2025-05-07T19:43:03.2220766Z + 1c9ad64...a2f4c52 a2f4c52051596e74bc8c16e3d2867a4ecdd271e0 -> pull/4066/merge (forced update) 2025-05-07T19:43:03.2244097Z ##[endgroup] 2025-05-07T19:43:03.2244570Z ##[group]Determining the checkout info 2025-05-07T19:43:03.2245504Z ##[endgroup] 2025-05-07T19:43:03.2250039Z [command]/usr/bin/git sparse-checkout disable 2025-05-07T19:43:03.2813142Z [command]/usr/bin/git config --local --unset-all extensions.worktreeConfig 2025-05-07T19:43:03.2815001Z ##[group]Checking out the ref 2025-05-07T19:43:03.2815519Z [command]/usr/bin/git checkout --progress --force refs/remotes/pull/4066/merge 2025-05-07T19:43:03.2896135Z Warning: you are leaving 1 commit behind, not connected to 2025-05-07T19:43:03.2896722Z any of your branches: 2025-05-07T19:43:03.2897638Z 2025-05-07T19:43:03.2899106Z 1c9ad64 Merge f6528e7b1e8f5602e7dba30cd73b48ae6630981c into fd4df5f456e0cca514bacd98a39efb72990fd9f4 2025-05-07T19:43:03.2900543Z 2025-05-07T19:43:03.2901201Z If you want to keep it by creating a new branch, this may be a good time 2025-05-07T19:43:03.2902320Z to do so with: 2025-05-07T19:43:03.2902463Z 2025-05-07T19:43:03.2902648Z git branch 1c9ad64 2025-05-07T19:43:03.2902869Z 2025-05-07T19:43:03.2903440Z HEAD is now at a2f4c52 Merge 6060cd4b5f971680caecdcc657faccb5720d1c3e into fd4df5f456e0cca514bacd98a39efb72990fd9f4 2025-05-07T19:43:03.2904855Z ##[endgroup] 2025-05-07T19:43:03.2905339Z ##[group]Setting up auth for fetching submodules 2025-05-07T19:43:03.2905999Z [command]/usr/bin/git config --global http.https://github.com/.extraheader AUTHORIZATION: basic *** 2025-05-07T19:43:03.2957049Z [command]/usr/bin/git config --global --unset-all url.https://github.com/.insteadOf 2025-05-07T19:43:03.2976392Z [command]/usr/bin/git config --global --add url.https://github.com/.insteadOf git@github.com: 2025-05-07T19:43:03.3002988Z [command]/usr/bin/git config --global --add url.https://github.com/.insteadOf org-21003710@github.com: 2025-05-07T19:43:03.3029471Z ##[endgroup] 2025-05-07T19:43:03.3030584Z ##[group]Fetching submodules 2025-05-07T19:43:03.3031478Z [command]/usr/bin/git submodule sync 2025-05-07T19:43:03.3346455Z Synchronizing submodule url for 'external/asmjit' 2025-05-07T19:43:03.3347878Z Synchronizing submodule url for 'external/composable_kernel' 2025-05-07T19:43:03.3349148Z Synchronizing submodule url for 'external/cpuinfo' 2025-05-07T19:43:03.3350317Z Synchronizing submodule url for 'external/cutlass' 2025-05-07T19:43:03.3351516Z Synchronizing submodule url for 'external/googletest' 2025-05-07T19:43:03.3353215Z Synchronizing submodule url for 'external/hipify_torch' 2025-05-07T19:43:03.3354560Z Synchronizing submodule url for 'external/json' 2025-05-07T19:43:03.3355951Z [command]/usr/bin/git -c protocol.version=2 submodule update --init --force --depth=1 2025-05-07T19:43:03.4117490Z Submodule path 'external/asmjit': checked out 'e5d7c0bd5d9aec44d68830187138149e6a8c4e32' 2025-05-07T19:43:03.6838480Z Submodule path 'external/composable_kernel': checked out '4a61bdd4bd4ed730e078aebc7c0fcf046ff29406' 2025-05-07T19:43:03.7859950Z Submodule path 'external/cpuinfo': checked out '6543fec09b2f04ac4a666882998b534afc9c1349' 2025-05-07T19:43:04.4482598Z Submodule path 'external/cutlass': checked out '3ed8d2ec4ba35ef5d9d8353826209b6f868f63d3' 2025-05-07T19:43:04.4918443Z Submodule path 'external/googletest': checked out 'f8d7d77c06936315286eb55f8de22cd23c188571' 2025-05-07T19:43:04.5018434Z Submodule path 'external/hipify_torch': checked out '420084499c7c1e1c2d801922f40df202eac5f3a0' 2025-05-07T19:43:04.6230817Z Submodule path 'external/json': checked out '9cca280a4d0ccf0c08f47a99aa71d1b0e52f8d03' 2025-05-07T19:43:04.6242688Z [command]/usr/bin/git submodule foreach git config --local gc.auto 0 2025-05-07T19:43:04.6534301Z Entering 'external/asmjit' 2025-05-07T19:43:04.6558308Z Entering 'external/composable_kernel' 2025-05-07T19:43:04.6596422Z Entering 'external/cpuinfo' 2025-05-07T19:43:04.6621732Z Entering 'external/cutlass' 2025-05-07T19:43:04.6652496Z Entering 'external/googletest' 2025-05-07T19:43:04.6685819Z Entering 'external/hipify_torch' 2025-05-07T19:43:04.6716275Z Entering 'external/json' 2025-05-07T19:43:04.6762811Z ##[endgroup] 2025-05-07T19:43:04.6764110Z ##[group]Persisting credentials for submodules 2025-05-07T19:43:04.6766898Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'url\.https\:\/\/github\.com\/\.insteadOf' && git config --local --unset-all 'url.https://github.com/.insteadOf' || :" 2025-05-07T19:43:04.7042132Z Entering 'external/asmjit' 2025-05-07T19:43:04.7072817Z url.https://github.com/.insteadof 2025-05-07T19:43:04.7073204Z url.https://github.com/.insteadof 2025-05-07T19:43:04.7116347Z Entering 'external/composable_kernel' 2025-05-07T19:43:04.7149345Z url.https://github.com/.insteadof 2025-05-07T19:43:04.7149797Z url.https://github.com/.insteadof 2025-05-07T19:43:04.7201836Z Entering 'external/cpuinfo' 2025-05-07T19:43:04.7246243Z url.https://github.com/.insteadof 2025-05-07T19:43:04.7246880Z url.https://github.com/.insteadof 2025-05-07T19:43:04.7284017Z Entering 'external/cutlass' 2025-05-07T19:43:04.7317877Z url.https://github.com/.insteadof 2025-05-07T19:43:04.7318312Z url.https://github.com/.insteadof 2025-05-07T19:43:04.7367460Z Entering 'external/googletest' 2025-05-07T19:43:04.7412377Z url.https://github.com/.insteadof 2025-05-07T19:43:04.7413345Z url.https://github.com/.insteadof 2025-05-07T19:43:04.7452569Z Entering 'external/hipify_torch' 2025-05-07T19:43:04.7496856Z url.https://github.com/.insteadof 2025-05-07T19:43:04.7497873Z url.https://github.com/.insteadof 2025-05-07T19:43:04.7536138Z Entering 'external/json' 2025-05-07T19:43:04.7569352Z url.https://github.com/.insteadof 2025-05-07T19:43:04.7569803Z url.https://github.com/.insteadof 2025-05-07T19:43:04.7621601Z [command]/usr/bin/git submodule foreach sh -c "git config --local 'http.https://github.com/.extraheader' 'AUTHORIZATION: basic ***' && git config --local --show-origin --name-only --get-regexp remote.origin.url" 2025-05-07T19:43:04.7894091Z Entering 'external/asmjit' 2025-05-07T19:43:04.7938632Z file:/__w/FBGEMM/FBGEMM/.git/modules/external/asmjit/config remote.origin.url 2025-05-07T19:43:04.7939136Z Entering 'external/composable_kernel' 2025-05-07T19:43:04.8000285Z file:/__w/FBGEMM/FBGEMM/.git/modules/external/composable_kernel/config remote.origin.url 2025-05-07T19:43:04.8001418Z Entering 'external/cpuinfo' 2025-05-07T19:43:04.8054900Z file:/__w/FBGEMM/FBGEMM/.git/modules/external/cpuinfo/config remote.origin.url 2025-05-07T19:43:04.8058047Z Entering 'external/cutlass' 2025-05-07T19:43:04.8107456Z file:/__w/FBGEMM/FBGEMM/.git/modules/external/cutlass/config remote.origin.url 2025-05-07T19:43:04.8112758Z Entering 'external/googletest' 2025-05-07T19:43:04.8169696Z file:/__w/FBGEMM/FBGEMM/.git/modules/external/googletest/config remote.origin.url 2025-05-07T19:43:04.8172444Z Entering 'external/hipify_torch' 2025-05-07T19:43:04.8225529Z file:/__w/FBGEMM/FBGEMM/.git/modules/external/hipify_torch/config remote.origin.url 2025-05-07T19:43:04.8230179Z Entering 'external/json' 2025-05-07T19:43:04.8281711Z file:/__w/FBGEMM/FBGEMM/.git/modules/external/json/config remote.origin.url 2025-05-07T19:43:04.8368924Z [command]/usr/bin/git submodule foreach git config --local --add 'url.https://github.com/.insteadOf' 'git@github.com:' 2025-05-07T19:43:04.8678192Z Entering 'external/asmjit' 2025-05-07T19:43:04.8709774Z Entering 'external/composable_kernel' 2025-05-07T19:43:04.8737894Z Entering 'external/cpuinfo' 2025-05-07T19:43:04.8767819Z Entering 'external/cutlass' 2025-05-07T19:43:04.8796141Z Entering 'external/googletest' 2025-05-07T19:43:04.8821392Z Entering 'external/hipify_torch' 2025-05-07T19:43:04.8852308Z Entering 'external/json' 2025-05-07T19:43:04.8897369Z [command]/usr/bin/git submodule foreach git config --local --add 'url.https://github.com/.insteadOf' 'org-21003710@github.com:' 2025-05-07T19:43:04.9185926Z Entering 'external/asmjit' 2025-05-07T19:43:04.9214590Z Entering 'external/composable_kernel' 2025-05-07T19:43:04.9249843Z Entering 'external/cpuinfo' 2025-05-07T19:43:04.9273547Z Entering 'external/cutlass' 2025-05-07T19:43:04.9307114Z Entering 'external/googletest' 2025-05-07T19:43:04.9338168Z Entering 'external/hipify_torch' 2025-05-07T19:43:04.9369160Z Entering 'external/json' 2025-05-07T19:43:04.9415832Z ##[endgroup] 2025-05-07T19:43:04.9441455Z [command]/usr/bin/git log -1 --format=%H 2025-05-07T19:43:04.9461265Z a2f4c52051596e74bc8c16e3d2867a4ecdd271e0 2025-05-07T19:43:04.9616684Z ##[group]Run . $PRELUDE; print_system_info 2025-05-07T19:43:04.9617302Z . $PRELUDE; print_system_info 2025-05-07T19:43:04.9617927Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:43:04.9618299Z env: 2025-05-07T19:43:04.9618550Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:43:04.9618893Z BUILD_ENV: build_binary 2025-05-07T19:43:04.9619154Z BUILD_TARGET: default 2025-05-07T19:43:04.9619427Z BUILD_VARIANT: cuda 2025-05-07T19:43:04.9619680Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:43:04.9619972Z ##[endgroup] 2025-05-07T19:43:05.4387273Z ################################################################################ 2025-05-07T19:43:05.4388378Z # Print System Info 2025-05-07T19:43:05.4389020Z # 2025-05-07T19:43:05.4411890Z # [2025-05-07T19:43:05.440Z] + print_system_info 2025-05-07T19:43:05.4412503Z ################################################################################ 2025-05-07T19:43:05.4412894Z 2025-05-07T19:43:05.4413183Z ################################################################################ 2025-05-07T19:43:05.4413689Z [INFO] Printing environment variables ... 2025-05-07T19:43:05.4414055Z + printenv 2025-05-07T19:43:05.4414192Z 2025-05-07T19:43:05.4428365Z GITHUB_WORKSPACE=/__w/FBGEMM/FBGEMM 2025-05-07T19:43:05.4428792Z BUILD_VARIANT=cuda 2025-05-07T19:43:05.4430200Z HOSTNAME=2b31f69c500b 2025-05-07T19:43:05.4431528Z GITHUB_PATH=/__w/_temp/_runner_file_commands/add_path_8243a61a-e035-4651-9e08-aec1d4f28a3d 2025-05-07T19:43:05.4432875Z GITHUB_ACTION=__run_2 2025-05-07T19:43:05.4433551Z GITHUB_RUN_NUMBER=10601 2025-05-07T19:43:05.4434237Z RUNNER_NAME=i-04b10210667d81210 2025-05-07T19:43:05.4435037Z GITHUB_REPOSITORY_OWNER_ID=21003710 2025-05-07T19:43:05.4435649Z PLATFORM_NAME_LC=linux-x86_64 2025-05-07T19:43:05.4435909Z MACHINE_NAME_LC=x86_64 2025-05-07T19:43:05.4436136Z GITHUB_TRIGGERING_ACTOR=q10 2025-05-07T19:43:05.4436410Z PRELUDE=.github/scripts/setup_env.bash 2025-05-07T19:43:05.4436689Z GITHUB_REF_TYPE=branch 2025-05-07T19:43:05.4437149Z *** 2025-05-07T19:43:05.4437350Z GITHUB_REPOSITORY_ID=150154628 2025-05-07T19:43:05.4437850Z GITHUB_ACTIONS=true 2025-05-07T19:43:05.4438110Z GITHUB_SHA=a2f4c52051596e74bc8c16e3d2867a4ecdd271e0 2025-05-07T19:43:05.4438626Z GITHUB_WORKFLOW_REF=pytorch/FBGEMM/.github/workflows/fbgemm_gpu_ci_cuda.yml@refs/pull/4066/merge 2025-05-07T19:43:05.4439125Z RUNNER_ENVIRONMENT=self-hosted 2025-05-07T19:43:05.4439372Z GITHUB_REF=refs/pull/4066/merge 2025-05-07T19:43:05.4439621Z RUNNER_OS=Linux 2025-05-07T19:43:05.4439824Z GITHUB_REF_PROTECTED=false 2025-05-07T19:43:05.4440062Z HOME=/github/home 2025-05-07T19:43:05.4440305Z GITHUB_API_URL=https://api.github.com 2025-05-07T19:43:05.4440569Z RUNNER_ARCH=X64 2025-05-07T19:43:05.4440779Z RUNNER_TEMP=/__w/_temp 2025-05-07T19:43:05.4440992Z BUILD_TARGET=default 2025-05-07T19:43:05.4441387Z GITHUB_STATE=/__w/_temp/_runner_file_commands/save_state_8243a61a-e035-4651-9e08-aec1d4f28a3d 2025-05-07T19:43:05.4441974Z GITHUB_ENV=/__w/_temp/_runner_file_commands/set_env_8243a61a-e035-4651-9e08-aec1d4f28a3d 2025-05-07T19:43:05.4442437Z GITHUB_EVENT_PATH=/github/workflow/event.json 2025-05-07T19:43:05.4442792Z GITHUB_EVENT_NAME=pull_request 2025-05-07T19:43:05.4443036Z GITHUB_RUN_ID=14891846252 2025-05-07T19:43:05.4443472Z GITHUB_STEP_SUMMARY=/__w/_temp/_runner_file_commands/step_summary_8243a61a-e035-4651-9e08-aec1d4f28a3d 2025-05-07T19:43:05.4443935Z BUILD_ENV=build_binary 2025-05-07T19:43:05.4444324Z GITHUB_ACTOR=q10 2025-05-07T19:43:05.4444595Z GITHUB_RUN_ATTEMPT=1 2025-05-07T19:43:05.4444828Z KERN_NAME_LC=linux 2025-05-07T19:43:05.4445045Z BUILD_CUDA_VERSION=12.6.3 2025-05-07T19:43:05.4445510Z GITHUB_GRAPHQL_URL=https://api.github.com/graphql 2025-05-07T19:43:05.4445865Z PLATFORM_NAME=Linux-x86_64 2025-05-07T19:43:05.4446136Z GITHUB_SERVER_URL=https://github.com 2025-05-07T19:43:05.4446422Z SHLVL=1 2025-05-07T19:43:05.4446614Z GITHUB_ACTOR_ID=255046 2025-05-07T19:43:05.4446870Z RUNNER_TOOL_CACHE=/__w/_tool 2025-05-07T19:43:05.4447378Z GITHUB_WORKFLOW_SHA=6060cd4b5f971680caecdcc657faccb5720d1c3e 2025-05-07T19:43:05.4447774Z GITHUB_REF_NAME=4066/merge 2025-05-07T19:43:05.4448023Z KERN_NAME=Linux 2025-05-07T19:43:05.4448257Z GITHUB_JOB=build_artifact 2025-05-07T19:43:05.4448516Z GITHUB_REPOSITORY=pytorch/FBGEMM 2025-05-07T19:43:05.4448806Z GITHUB_RETENTION_DAYS=90 2025-05-07T19:43:05.4449067Z RUNNER_WORKSPACE=/__w/FBGEMM 2025-05-07T19:43:05.4449323Z GITHUB_ACTION_REPOSITORY= 2025-05-07T19:43:05.4449677Z PATH=/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin 2025-05-07T19:43:05.4450049Z GITHUB_BASE_REF=main 2025-05-07T19:43:05.4450275Z CI=true 2025-05-07T19:43:05.4450479Z GITHUB_REPOSITORY_OWNER=pytorch 2025-05-07T19:43:05.4450770Z GITHUB_HEAD_REF=bm/genai-rocm-oss-6 2025-05-07T19:43:05.4451045Z GITHUB_ACTION_REF= 2025-05-07T19:43:05.4451299Z GITHUB_WORKFLOW=FBGEMM GPU/GenAI CUDA CI 2025-05-07T19:43:05.4451778Z GITHUB_OUTPUT=/__w/_temp/_runner_file_commands/set_output_8243a61a-e035-4651-9e08-aec1d4f28a3d 2025-05-07T19:43:05.4452256Z MACHINE_NAME=x86_64 2025-05-07T19:43:05.4452594Z _=/usr/bin/printenv 2025-05-07T19:43:05.4452724Z 2025-05-07T19:43:05.4452844Z ################################################################################ 2025-05-07T19:43:05.4453163Z [INFO] Print ldd version ... 2025-05-07T19:43:05.4453410Z + ldd --version 2025-05-07T19:43:05.4453548Z 2025-05-07T19:43:05.4453644Z ldd (GNU libc) 2.34 2025-05-07T19:43:05.4453901Z Copyright (C) 2021 Free Software Foundation, Inc. 2025-05-07T19:43:05.4454349Z This is free software; see the source for copying conditions. There is NO 2025-05-07T19:43:05.4454884Z warranty; not even for MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. 2025-05-07T19:43:05.4455327Z Written by Roland McGrath and Ulrich Drepper. 2025-05-07T19:43:05.4455545Z 2025-05-07T19:43:05.4455671Z ################################################################################ 2025-05-07T19:43:05.4455972Z [INFO] Print CPU info ... 2025-05-07T19:43:05.4456212Z + nproc 2025-05-07T19:43:05.4456317Z 2025-05-07T19:43:05.4459953Z 96 2025-05-07T19:43:05.4461017Z 2025-05-07T19:43:05.4461127Z + lscpu 2025-05-07T19:43:05.4461315Z 2025-05-07T19:43:05.4726704Z Architecture: x86_64 2025-05-07T19:43:05.4727240Z CPU op-mode(s): 32-bit, 64-bit 2025-05-07T19:43:05.4727654Z Address sizes: 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.4728060Z Byte Order: Little Endian 2025-05-07T19:43:05.4728487Z CPU(s): 96 2025-05-07T19:43:05.4728779Z On-line CPU(s) list: 0-95 2025-05-07T19:43:05.4729075Z Vendor ID: GenuineIntel 2025-05-07T19:43:05.4729450Z Model name: Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.4729798Z CPU family: 6 2025-05-07T19:43:05.4730075Z Model: 85 2025-05-07T19:43:05.4730344Z Thread(s) per core: 2 2025-05-07T19:43:05.4730639Z Core(s) per socket: 24 2025-05-07T19:43:05.4730903Z Socket(s): 2 2025-05-07T19:43:05.4731179Z Stepping: 7 2025-05-07T19:43:05.4731469Z BogoMIPS: 5999.98 2025-05-07T19:43:05.4733590Z Flags: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.4735731Z Hypervisor vendor: KVM 2025-05-07T19:43:05.4736249Z Virtualization type: full 2025-05-07T19:43:05.4736697Z L1d cache: 1.5 MiB (48 instances) 2025-05-07T19:43:05.4737238Z L1i cache: 1.5 MiB (48 instances) 2025-05-07T19:43:05.4737618Z L2 cache: 48 MiB (48 instances) 2025-05-07T19:43:05.4738001Z L3 cache: 71.5 MiB (2 instances) 2025-05-07T19:43:05.4738354Z NUMA node(s): 2 2025-05-07T19:43:05.4738672Z NUMA node0 CPU(s): 0-23,48-71 2025-05-07T19:43:05.4739020Z NUMA node1 CPU(s): 24-47,72-95 2025-05-07T19:43:05.4739482Z Vulnerability Gather data sampling: Unknown: Dependent on hypervisor status 2025-05-07T19:43:05.4740058Z Vulnerability Itlb multihit: KVM: Mitigation: VMX unsupported 2025-05-07T19:43:05.4740554Z Vulnerability L1tf: Mitigation; PTE Inversion 2025-05-07T19:43:05.4741173Z Vulnerability Mds: Vulnerable: Clear CPU buffers attempted, no microcode; SMT Host state unknown 2025-05-07T19:43:05.4741764Z Vulnerability Meltdown: Mitigation; PTI 2025-05-07T19:43:05.4742611Z Vulnerability Mmio stale data: Vulnerable: Clear CPU buffers attempted, no microcode; SMT Host state unknown 2025-05-07T19:43:05.4743239Z Vulnerability Reg file data sampling: Not affected 2025-05-07T19:43:05.4743612Z Vulnerability Retbleed: Vulnerable 2025-05-07T19:43:05.4743998Z Vulnerability Spec rstack overflow: Not affected 2025-05-07T19:43:05.4744376Z Vulnerability Spec store bypass: Vulnerable 2025-05-07T19:43:05.4744921Z Vulnerability Spectre v1: Mitigation; usercopy/swapgs barriers and __user pointer sanitization 2025-05-07T19:43:05.4745756Z Vulnerability Spectre v2: Mitigation; Retpolines; STIBP disabled; RSB filling; PBRSB-eIBRS Not affected; BHI Retpoline 2025-05-07T19:43:05.4746400Z Vulnerability Srbds: Not affected 2025-05-07T19:43:05.4746782Z Vulnerability Tsx async abort: Not affected 2025-05-07T19:43:05.4747022Z 2025-05-07T19:43:05.4747136Z + cat /proc/cpuinfo 2025-05-07T19:43:05.4747380Z 2025-05-07T19:43:05.4747467Z processor : 0 2025-05-07T19:43:05.4747711Z vendor_id : GenuineIntel 2025-05-07T19:43:05.4747953Z cpu family : 6 2025-05-07T19:43:05.4748189Z model : 85 2025-05-07T19:43:05.4748488Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.4748835Z stepping : 7 2025-05-07T19:43:05.4749062Z microcode : 0x5003901 2025-05-07T19:43:05.4749288Z cpu MHz : 3363.947 2025-05-07T19:43:05.4749516Z cache size : 36608 KB 2025-05-07T19:43:05.4749788Z physical id : 0 2025-05-07T19:43:05.4749999Z siblings : 48 2025-05-07T19:43:05.4750212Z core id : 0 2025-05-07T19:43:05.4750418Z cpu cores : 24 2025-05-07T19:43:05.4750632Z apicid : 0 2025-05-07T19:43:05.4750840Z initial apicid : 0 2025-05-07T19:43:05.4751061Z fpu : yes 2025-05-07T19:43:05.4751258Z fpu_exception : yes 2025-05-07T19:43:05.4751498Z cpuid level : 13 2025-05-07T19:43:05.4751704Z wp : yes 2025-05-07T19:43:05.4753932Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.4756518Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.4757097Z bogomips : 5999.98 2025-05-07T19:43:05.4757334Z clflush size : 64 2025-05-07T19:43:05.4757562Z cache_alignment : 64 2025-05-07T19:43:05.4757835Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.4758227Z power management: 2025-05-07T19:43:05.4758369Z 2025-05-07T19:43:05.4758456Z processor : 1 2025-05-07T19:43:05.4758678Z vendor_id : GenuineIntel 2025-05-07T19:43:05.4758920Z cpu family : 6 2025-05-07T19:43:05.4759132Z model : 85 2025-05-07T19:43:05.4759414Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.4759773Z stepping : 7 2025-05-07T19:43:05.4759977Z microcode : 0x5003901 2025-05-07T19:43:05.4760220Z cpu MHz : 3291.276 2025-05-07T19:43:05.4760449Z cache size : 36608 KB 2025-05-07T19:43:05.4760679Z physical id : 0 2025-05-07T19:43:05.4760901Z siblings : 48 2025-05-07T19:43:05.4761112Z core id : 1 2025-05-07T19:43:05.4761326Z cpu cores : 24 2025-05-07T19:43:05.4761532Z apicid : 2 2025-05-07T19:43:05.4761750Z initial apicid : 2 2025-05-07T19:43:05.4761958Z fpu : yes 2025-05-07T19:43:05.4762165Z fpu_exception : yes 2025-05-07T19:43:05.4762387Z cpuid level : 13 2025-05-07T19:43:05.4762604Z wp : yes 2025-05-07T19:43:05.4764819Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.4767402Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.4767969Z bogomips : 5999.98 2025-05-07T19:43:05.4768200Z clflush size : 64 2025-05-07T19:43:05.4768420Z cache_alignment : 64 2025-05-07T19:43:05.4768700Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.4769020Z power management: 2025-05-07T19:43:05.4769168Z 2025-05-07T19:43:05.4769253Z processor : 2 2025-05-07T19:43:05.4769462Z vendor_id : GenuineIntel 2025-05-07T19:43:05.4769757Z cpu family : 6 2025-05-07T19:43:05.4769956Z model : 85 2025-05-07T19:43:05.4770398Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.4770759Z stepping : 7 2025-05-07T19:43:05.4770964Z microcode : 0x5003901 2025-05-07T19:43:05.4771299Z cpu MHz : 3302.616 2025-05-07T19:43:05.4771527Z cache size : 36608 KB 2025-05-07T19:43:05.4771762Z physical id : 0 2025-05-07T19:43:05.4771966Z siblings : 48 2025-05-07T19:43:05.4772173Z core id : 2 2025-05-07T19:43:05.4772366Z cpu cores : 24 2025-05-07T19:43:05.4772576Z apicid : 4 2025-05-07T19:43:05.4772768Z initial apicid : 4 2025-05-07T19:43:05.4772987Z fpu : yes 2025-05-07T19:43:05.4773181Z fpu_exception : yes 2025-05-07T19:43:05.4773408Z cpuid level : 13 2025-05-07T19:43:05.4773611Z wp : yes 2025-05-07T19:43:05.4775811Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.4778459Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.4779038Z bogomips : 5999.98 2025-05-07T19:43:05.4779255Z clflush size : 64 2025-05-07T19:43:05.4779487Z cache_alignment : 64 2025-05-07T19:43:05.4779757Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.4780090Z power management: 2025-05-07T19:43:05.4780225Z 2025-05-07T19:43:05.4780308Z processor : 3 2025-05-07T19:43:05.4780658Z vendor_id : GenuineIntel 2025-05-07T19:43:05.4780893Z cpu family : 6 2025-05-07T19:43:05.4781112Z model : 85 2025-05-07T19:43:05.4781382Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.4781740Z stepping : 7 2025-05-07T19:43:05.4781957Z microcode : 0x5003901 2025-05-07T19:43:05.4782180Z cpu MHz : 3280.414 2025-05-07T19:43:05.4782413Z cache size : 36608 KB 2025-05-07T19:43:05.4782638Z physical id : 0 2025-05-07T19:43:05.4782857Z siblings : 48 2025-05-07T19:43:05.4783056Z core id : 3 2025-05-07T19:43:05.4783280Z cpu cores : 24 2025-05-07T19:43:05.4783481Z apicid : 6 2025-05-07T19:43:05.4783693Z initial apicid : 6 2025-05-07T19:43:05.4783904Z fpu : yes 2025-05-07T19:43:05.4784119Z fpu_exception : yes 2025-05-07T19:43:05.4784340Z cpuid level : 13 2025-05-07T19:43:05.4784568Z wp : yes 2025-05-07T19:43:05.4786776Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.4789383Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.4789919Z bogomips : 5999.98 2025-05-07T19:43:05.4790143Z clflush size : 64 2025-05-07T19:43:05.4790350Z cache_alignment : 64 2025-05-07T19:43:05.4790624Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.4790928Z power management: 2025-05-07T19:43:05.4791071Z 2025-05-07T19:43:05.4791156Z processor : 4 2025-05-07T19:43:05.4791360Z vendor_id : GenuineIntel 2025-05-07T19:43:05.4791605Z cpu family : 6 2025-05-07T19:43:05.4791797Z model : 85 2025-05-07T19:43:05.4792131Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.4792468Z stepping : 7 2025-05-07T19:43:05.4792664Z microcode : 0x5003901 2025-05-07T19:43:05.4792892Z cpu MHz : 3325.774 2025-05-07T19:43:05.4793095Z cache size : 36608 KB 2025-05-07T19:43:05.4793319Z physical id : 0 2025-05-07T19:43:05.4793509Z siblings : 48 2025-05-07T19:43:05.4793705Z core id : 4 2025-05-07T19:43:05.4793884Z cpu cores : 24 2025-05-07T19:43:05.4794080Z apicid : 8 2025-05-07T19:43:05.4794261Z initial apicid : 8 2025-05-07T19:43:05.4794470Z fpu : yes 2025-05-07T19:43:05.4794651Z fpu_exception : yes 2025-05-07T19:43:05.4794863Z cpuid level : 13 2025-05-07T19:43:05.4795053Z wp : yes 2025-05-07T19:43:05.4797082Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.4799452Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.4799993Z bogomips : 5999.98 2025-05-07T19:43:05.4800194Z clflush size : 64 2025-05-07T19:43:05.4800411Z cache_alignment : 64 2025-05-07T19:43:05.4800665Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.4800978Z power management: 2025-05-07T19:43:05.4801102Z 2025-05-07T19:43:05.4801182Z processor : 5 2025-05-07T19:43:05.4801389Z vendor_id : GenuineIntel 2025-05-07T19:43:05.4801611Z cpu family : 6 2025-05-07T19:43:05.4801866Z model : 85 2025-05-07T19:43:05.4802120Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.4802459Z stepping : 7 2025-05-07T19:43:05.4802659Z microcode : 0x5003901 2025-05-07T19:43:05.4802951Z cpu MHz : 3317.121 2025-05-07T19:43:05.4803167Z cache size : 36608 KB 2025-05-07T19:43:05.4803374Z physical id : 0 2025-05-07T19:43:05.4803585Z siblings : 48 2025-05-07T19:43:05.4803777Z core id : 5 2025-05-07T19:43:05.4803981Z cpu cores : 24 2025-05-07T19:43:05.4804171Z apicid : 10 2025-05-07T19:43:05.4804371Z initial apicid : 10 2025-05-07T19:43:05.4804565Z fpu : yes 2025-05-07T19:43:05.4804763Z fpu_exception : yes 2025-05-07T19:43:05.4804964Z cpuid level : 13 2025-05-07T19:43:05.4805170Z wp : yes 2025-05-07T19:43:05.4807202Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.4809564Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.4810091Z bogomips : 5999.98 2025-05-07T19:43:05.4810306Z clflush size : 64 2025-05-07T19:43:05.4810508Z cache_alignment : 64 2025-05-07T19:43:05.4810771Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.4811070Z power management: 2025-05-07T19:43:05.4811202Z 2025-05-07T19:43:05.4811280Z processor : 6 2025-05-07T19:43:05.4811474Z vendor_id : GenuineIntel 2025-05-07T19:43:05.4811705Z cpu family : 6 2025-05-07T19:43:05.4811893Z model : 85 2025-05-07T19:43:05.4812159Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.4812542Z stepping : 7 2025-05-07T19:43:05.4812731Z microcode : 0x5003901 2025-05-07T19:43:05.4812954Z cpu MHz : 3306.913 2025-05-07T19:43:05.4813152Z cache size : 36608 KB 2025-05-07T19:43:05.4813375Z physical id : 0 2025-05-07T19:43:05.4813566Z siblings : 48 2025-05-07T19:43:05.4813764Z core id : 6 2025-05-07T19:43:05.4813948Z cpu cores : 24 2025-05-07T19:43:05.4814148Z apicid : 12 2025-05-07T19:43:05.4814338Z initial apicid : 12 2025-05-07T19:43:05.4814546Z fpu : yes 2025-05-07T19:43:05.4814730Z fpu_exception : yes 2025-05-07T19:43:05.4814943Z cpuid level : 13 2025-05-07T19:43:05.4815156Z wp : yes 2025-05-07T19:43:05.4817440Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.4820092Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.4820677Z bogomips : 5999.98 2025-05-07T19:43:05.4820890Z clflush size : 64 2025-05-07T19:43:05.4821117Z cache_alignment : 64 2025-05-07T19:43:05.4821384Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.4821714Z power management: 2025-05-07T19:43:05.4821847Z 2025-05-07T19:43:05.4821930Z processor : 7 2025-05-07T19:43:05.4822154Z vendor_id : GenuineIntel 2025-05-07T19:43:05.4822393Z cpu family : 6 2025-05-07T19:43:05.4822617Z model : 85 2025-05-07T19:43:05.4822899Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.4823307Z stepping : 7 2025-05-07T19:43:05.4823525Z microcode : 0x5003901 2025-05-07T19:43:05.4823752Z cpu MHz : 3301.753 2025-05-07T19:43:05.4823984Z cache size : 36608 KB 2025-05-07T19:43:05.4824207Z physical id : 0 2025-05-07T19:43:05.4824431Z siblings : 48 2025-05-07T19:43:05.4824633Z core id : 7 2025-05-07T19:43:05.4824844Z cpu cores : 24 2025-05-07T19:43:05.4825047Z apicid : 14 2025-05-07T19:43:05.4825270Z initial apicid : 14 2025-05-07T19:43:05.4825485Z fpu : yes 2025-05-07T19:43:05.4825703Z fpu_exception : yes 2025-05-07T19:43:05.4825919Z cpuid level : 13 2025-05-07T19:43:05.4826138Z wp : yes 2025-05-07T19:43:05.4828339Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.4830855Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.4831381Z bogomips : 5999.98 2025-05-07T19:43:05.4831596Z clflush size : 64 2025-05-07T19:43:05.4831799Z cache_alignment : 64 2025-05-07T19:43:05.4832071Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.4832370Z power management: 2025-05-07T19:43:05.4832509Z 2025-05-07T19:43:05.4832590Z processor : 8 2025-05-07T19:43:05.4832790Z vendor_id : GenuineIntel 2025-05-07T19:43:05.4833024Z cpu family : 6 2025-05-07T19:43:05.4833213Z model : 85 2025-05-07T19:43:05.4833478Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.4833808Z stepping : 7 2025-05-07T19:43:05.4834000Z microcode : 0x5003901 2025-05-07T19:43:05.4834217Z cpu MHz : 3325.830 2025-05-07T19:43:05.4834458Z cache size : 36608 KB 2025-05-07T19:43:05.4834676Z physical id : 0 2025-05-07T19:43:05.4834865Z siblings : 48 2025-05-07T19:43:05.4835057Z core id : 8 2025-05-07T19:43:05.4835238Z cpu cores : 24 2025-05-07T19:43:05.4835437Z apicid : 16 2025-05-07T19:43:05.4835623Z initial apicid : 16 2025-05-07T19:43:05.4835833Z fpu : yes 2025-05-07T19:43:05.4836019Z fpu_exception : yes 2025-05-07T19:43:05.4836235Z cpuid level : 13 2025-05-07T19:43:05.4836440Z wp : yes 2025-05-07T19:43:05.4838454Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.4840810Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.4841344Z bogomips : 5999.98 2025-05-07T19:43:05.4841542Z clflush size : 64 2025-05-07T19:43:05.4841754Z cache_alignment : 64 2025-05-07T19:43:05.4842002Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.4842311Z power management: 2025-05-07T19:43:05.4842434Z 2025-05-07T19:43:05.4842510Z processor : 9 2025-05-07T19:43:05.4842718Z vendor_id : GenuineIntel 2025-05-07T19:43:05.4842935Z cpu family : 6 2025-05-07T19:43:05.4843134Z model : 85 2025-05-07T19:43:05.4843398Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.4843714Z stepping : 7 2025-05-07T19:43:05.4843917Z microcode : 0x5003901 2025-05-07T19:43:05.4844182Z cpu MHz : 3737.003 2025-05-07T19:43:05.4844394Z cache size : 36608 KB 2025-05-07T19:43:05.4844601Z physical id : 0 2025-05-07T19:43:05.4844809Z siblings : 48 2025-05-07T19:43:05.4844992Z core id : 9 2025-05-07T19:43:05.4845184Z cpu cores : 24 2025-05-07T19:43:05.4845369Z apicid : 18 2025-05-07T19:43:05.4845570Z initial apicid : 18 2025-05-07T19:43:05.4845766Z fpu : yes 2025-05-07T19:43:05.4845963Z fpu_exception : yes 2025-05-07T19:43:05.4846164Z cpuid level : 13 2025-05-07T19:43:05.4846365Z wp : yes 2025-05-07T19:43:05.4848393Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.4850884Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.4851406Z bogomips : 5999.98 2025-05-07T19:43:05.4851616Z clflush size : 64 2025-05-07T19:43:05.4851813Z cache_alignment : 64 2025-05-07T19:43:05.4852072Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.4852365Z power management: 2025-05-07T19:43:05.4852498Z 2025-05-07T19:43:05.4852576Z processor : 10 2025-05-07T19:43:05.4852772Z vendor_id : GenuineIntel 2025-05-07T19:43:05.4853002Z cpu family : 6 2025-05-07T19:43:05.4853184Z model : 85 2025-05-07T19:43:05.4853445Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.4853775Z stepping : 7 2025-05-07T19:43:05.4853964Z microcode : 0x5003901 2025-05-07T19:43:05.4854183Z cpu MHz : 3313.888 2025-05-07T19:43:05.4854385Z cache size : 36608 KB 2025-05-07T19:43:05.4854602Z physical id : 0 2025-05-07T19:43:05.4854861Z siblings : 48 2025-05-07T19:43:05.4855058Z core id : 10 2025-05-07T19:43:05.4855242Z cpu cores : 24 2025-05-07T19:43:05.4855444Z apicid : 20 2025-05-07T19:43:05.4855632Z initial apicid : 20 2025-05-07T19:43:05.4855842Z fpu : yes 2025-05-07T19:43:05.4856028Z fpu_exception : yes 2025-05-07T19:43:05.4856242Z cpuid level : 13 2025-05-07T19:43:05.4856505Z wp : yes 2025-05-07T19:43:05.4858841Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.4861416Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.4862018Z bogomips : 5999.98 2025-05-07T19:43:05.4862252Z clflush size : 64 2025-05-07T19:43:05.4862504Z cache_alignment : 64 2025-05-07T19:43:05.4862789Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.4863147Z power management: 2025-05-07T19:43:05.4863287Z 2025-05-07T19:43:05.4863378Z processor : 11 2025-05-07T19:43:05.4863641Z vendor_id : GenuineIntel 2025-05-07T19:43:05.4863888Z cpu family : 6 2025-05-07T19:43:05.4864108Z model : 85 2025-05-07T19:43:05.4864418Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.4864777Z stepping : 7 2025-05-07T19:43:05.4864995Z microcode : 0x5003901 2025-05-07T19:43:05.4865222Z cpu MHz : 3322.575 2025-05-07T19:43:05.4865448Z cache size : 36608 KB 2025-05-07T19:43:05.4865731Z physical id : 0 2025-05-07T19:43:05.4865954Z siblings : 48 2025-05-07T19:43:05.4866159Z core id : 11 2025-05-07T19:43:05.4866373Z cpu cores : 24 2025-05-07T19:43:05.4866575Z apicid : 22 2025-05-07T19:43:05.4866794Z initial apicid : 22 2025-05-07T19:43:05.4867006Z fpu : yes 2025-05-07T19:43:05.4867223Z fpu_exception : yes 2025-05-07T19:43:05.4867439Z cpuid level : 13 2025-05-07T19:43:05.4867655Z wp : yes 2025-05-07T19:43:05.4869852Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.4872588Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.4873169Z bogomips : 5999.98 2025-05-07T19:43:05.4873398Z clflush size : 64 2025-05-07T19:43:05.4873614Z cache_alignment : 64 2025-05-07T19:43:05.4873906Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.4874222Z power management: 2025-05-07T19:43:05.4874367Z 2025-05-07T19:43:05.4874451Z processor : 12 2025-05-07T19:43:05.4874662Z vendor_id : GenuineIntel 2025-05-07T19:43:05.4874908Z cpu family : 6 2025-05-07T19:43:05.4875107Z model : 85 2025-05-07T19:43:05.4875386Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.4875741Z stepping : 7 2025-05-07T19:43:05.4875940Z microcode : 0x5003901 2025-05-07T19:43:05.4876194Z cpu MHz : 3335.111 2025-05-07T19:43:05.4876425Z cache size : 36608 KB 2025-05-07T19:43:05.4876684Z physical id : 0 2025-05-07T19:43:05.4876912Z siblings : 48 2025-05-07T19:43:05.4877154Z core id : 12 2025-05-07T19:43:05.4877463Z cpu cores : 24 2025-05-07T19:43:05.4877708Z apicid : 24 2025-05-07T19:43:05.4877931Z initial apicid : 24 2025-05-07T19:43:05.4878187Z fpu : yes 2025-05-07T19:43:05.4878406Z fpu_exception : yes 2025-05-07T19:43:05.4878658Z cpuid level : 13 2025-05-07T19:43:05.4878914Z wp : yes 2025-05-07T19:43:05.4881121Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.4883703Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.4884313Z bogomips : 5999.98 2025-05-07T19:43:05.4884548Z clflush size : 64 2025-05-07T19:43:05.4884802Z cache_alignment : 64 2025-05-07T19:43:05.4885086Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.4885442Z power management: 2025-05-07T19:43:05.4885581Z 2025-05-07T19:43:05.4885673Z processor : 13 2025-05-07T19:43:05.4885924Z vendor_id : GenuineIntel 2025-05-07T19:43:05.4886182Z cpu family : 6 2025-05-07T19:43:05.4886424Z model : 85 2025-05-07T19:43:05.4886731Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.4887088Z stepping : 7 2025-05-07T19:43:05.4887336Z microcode : 0x5003901 2025-05-07T19:43:05.4887573Z cpu MHz : 3279.053 2025-05-07T19:43:05.4887822Z cache size : 36608 KB 2025-05-07T19:43:05.4888059Z physical id : 0 2025-05-07T19:43:05.4888300Z siblings : 48 2025-05-07T19:43:05.4891588Z core id : 13 2025-05-07T19:43:05.4891870Z cpu cores : 24 2025-05-07T19:43:05.4892091Z apicid : 26 2025-05-07T19:43:05.4892346Z initial apicid : 26 2025-05-07T19:43:05.4892580Z fpu : yes 2025-05-07T19:43:05.4892815Z fpu_exception : yes 2025-05-07T19:43:05.4893045Z cpuid level : 13 2025-05-07T19:43:05.4893291Z wp : yes 2025-05-07T19:43:05.4895517Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.4898188Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.4898778Z bogomips : 5999.98 2025-05-07T19:43:05.4899034Z clflush size : 64 2025-05-07T19:43:05.4899262Z cache_alignment : 64 2025-05-07T19:43:05.4899566Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.4899903Z power management: 2025-05-07T19:43:05.4900067Z 2025-05-07T19:43:05.4900159Z processor : 14 2025-05-07T19:43:05.4900389Z vendor_id : GenuineIntel 2025-05-07T19:43:05.4900663Z cpu family : 6 2025-05-07T19:43:05.4900878Z model : 85 2025-05-07T19:43:05.4901188Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.4901573Z stepping : 7 2025-05-07T19:43:05.4901797Z microcode : 0x5003901 2025-05-07T19:43:05.4902068Z cpu MHz : 2999.994 2025-05-07T19:43:05.4902300Z cache size : 36608 KB 2025-05-07T19:43:05.4902563Z physical id : 0 2025-05-07T19:43:05.4902789Z siblings : 48 2025-05-07T19:43:05.4903035Z core id : 14 2025-05-07T19:43:05.4903254Z cpu cores : 24 2025-05-07T19:43:05.4903499Z apicid : 28 2025-05-07T19:43:05.4903713Z initial apicid : 28 2025-05-07T19:43:05.4904028Z fpu : yes 2025-05-07T19:43:05.4904244Z fpu_exception : yes 2025-05-07T19:43:05.4904499Z cpuid level : 13 2025-05-07T19:43:05.4904746Z wp : yes 2025-05-07T19:43:05.4906952Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.4909543Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.4910150Z bogomips : 5999.98 2025-05-07T19:43:05.4910393Z clflush size : 64 2025-05-07T19:43:05.4910657Z cache_alignment : 64 2025-05-07T19:43:05.4910948Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.4911315Z power management: 2025-05-07T19:43:05.4911462Z 2025-05-07T19:43:05.4911558Z processor : 15 2025-05-07T19:43:05.4911819Z vendor_id : GenuineIntel 2025-05-07T19:43:05.4912081Z cpu family : 6 2025-05-07T19:43:05.4912328Z model : 85 2025-05-07T19:43:05.4912643Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.4913011Z stepping : 7 2025-05-07T19:43:05.4913262Z microcode : 0x5003901 2025-05-07T19:43:05.4913509Z cpu MHz : 3339.759 2025-05-07T19:43:05.4913769Z cache size : 36608 KB 2025-05-07T19:43:05.4914017Z physical id : 0 2025-05-07T19:43:05.4914541Z siblings : 48 2025-05-07T19:43:05.4914765Z core id : 15 2025-05-07T19:43:05.4915015Z cpu cores : 24 2025-05-07T19:43:05.4915238Z apicid : 30 2025-05-07T19:43:05.4915544Z initial apicid : 30 2025-05-07T19:43:05.4915779Z fpu : yes 2025-05-07T19:43:05.4916027Z fpu_exception : yes 2025-05-07T19:43:05.4916285Z cpuid level : 13 2025-05-07T19:43:05.4916513Z wp : yes 2025-05-07T19:43:05.4918743Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.4921310Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.4921894Z bogomips : 5999.98 2025-05-07T19:43:05.4922143Z clflush size : 64 2025-05-07T19:43:05.4922383Z cache_alignment : 64 2025-05-07T19:43:05.4922667Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.4922996Z power management: 2025-05-07T19:43:05.4923155Z 2025-05-07T19:43:05.4923246Z processor : 16 2025-05-07T19:43:05.4923473Z vendor_id : GenuineIntel 2025-05-07T19:43:05.4923750Z cpu family : 6 2025-05-07T19:43:05.4923964Z model : 85 2025-05-07T19:43:05.4924273Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.4924656Z stepping : 7 2025-05-07T19:43:05.4924867Z microcode : 0x5003901 2025-05-07T19:43:05.4925131Z cpu MHz : 2999.994 2025-05-07T19:43:05.4925362Z cache size : 36608 KB 2025-05-07T19:43:05.4925636Z physical id : 0 2025-05-07T19:43:05.4925857Z siblings : 48 2025-05-07T19:43:05.4926099Z core id : 16 2025-05-07T19:43:05.4926310Z cpu cores : 24 2025-05-07T19:43:05.4926546Z apicid : 32 2025-05-07T19:43:05.4926766Z initial apicid : 32 2025-05-07T19:43:05.4927018Z fpu : yes 2025-05-07T19:43:05.4927229Z fpu_exception : yes 2025-05-07T19:43:05.4927631Z cpuid level : 13 2025-05-07T19:43:05.4927864Z wp : yes 2025-05-07T19:43:05.4929992Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.4932750Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.4933356Z bogomips : 5999.98 2025-05-07T19:43:05.4933587Z clflush size : 64 2025-05-07T19:43:05.4933840Z cache_alignment : 64 2025-05-07T19:43:05.4934133Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.4934489Z power management: 2025-05-07T19:43:05.4934628Z 2025-05-07T19:43:05.4934719Z processor : 17 2025-05-07T19:43:05.4934973Z vendor_id : GenuineIntel 2025-05-07T19:43:05.4935222Z cpu family : 6 2025-05-07T19:43:05.4935460Z model : 85 2025-05-07T19:43:05.4935767Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.4936125Z stepping : 7 2025-05-07T19:43:05.4936366Z microcode : 0x5003901 2025-05-07T19:43:05.4936677Z cpu MHz : 2999.994 2025-05-07T19:43:05.4936936Z cache size : 36608 KB 2025-05-07T19:43:05.4937176Z physical id : 0 2025-05-07T19:43:05.4937429Z siblings : 48 2025-05-07T19:43:05.4937642Z core id : 17 2025-05-07T19:43:05.4937969Z cpu cores : 24 2025-05-07T19:43:05.4938188Z apicid : 34 2025-05-07T19:43:05.4938432Z initial apicid : 34 2025-05-07T19:43:05.4938659Z fpu : yes 2025-05-07T19:43:05.4938951Z fpu_exception : yes 2025-05-07T19:43:05.4939205Z cpuid level : 13 2025-05-07T19:43:05.4939431Z wp : yes 2025-05-07T19:43:05.4941654Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.4944229Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.4944809Z bogomips : 5999.98 2025-05-07T19:43:05.4945069Z clflush size : 64 2025-05-07T19:43:05.4945304Z cache_alignment : 64 2025-05-07T19:43:05.4945611Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.4945948Z power management: 2025-05-07T19:43:05.4946108Z 2025-05-07T19:43:05.4946200Z processor : 18 2025-05-07T19:43:05.4946433Z vendor_id : GenuineIntel 2025-05-07T19:43:05.4946713Z cpu family : 6 2025-05-07T19:43:05.4946951Z model : 85 2025-05-07T19:43:05.4947244Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.4947641Z stepping : 7 2025-05-07T19:43:05.4947866Z microcode : 0x5003901 2025-05-07T19:43:05.4948130Z cpu MHz : 2999.994 2025-05-07T19:43:05.4948448Z cache size : 36608 KB 2025-05-07T19:43:05.4948703Z physical id : 0 2025-05-07T19:43:05.4948907Z siblings : 48 2025-05-07T19:43:05.4949116Z core id : 18 2025-05-07T19:43:05.4949313Z cpu cores : 24 2025-05-07T19:43:05.4949526Z apicid : 36 2025-05-07T19:43:05.4949738Z initial apicid : 36 2025-05-07T19:43:05.4949964Z fpu : yes 2025-05-07T19:43:05.4950159Z fpu_exception : yes 2025-05-07T19:43:05.4950390Z cpuid level : 13 2025-05-07T19:43:05.4950606Z wp : yes 2025-05-07T19:43:05.4952869Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.4955572Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.4956155Z bogomips : 5999.98 2025-05-07T19:43:05.4956368Z clflush size : 64 2025-05-07T19:43:05.4956597Z cache_alignment : 64 2025-05-07T19:43:05.4956869Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.4957203Z power management: 2025-05-07T19:43:05.4957339Z 2025-05-07T19:43:05.4957422Z processor : 19 2025-05-07T19:43:05.4957654Z vendor_id : GenuineIntel 2025-05-07T19:43:05.4957890Z cpu family : 6 2025-05-07T19:43:05.4958104Z model : 85 2025-05-07T19:43:05.4958382Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.4981366Z stepping : 7 2025-05-07T19:43:05.4981861Z microcode : 0x5003901 2025-05-07T19:43:05.4982158Z cpu MHz : 2999.994 2025-05-07T19:43:05.4982379Z cache size : 36608 KB 2025-05-07T19:43:05.4982691Z physical id : 0 2025-05-07T19:43:05.4983078Z siblings : 48 2025-05-07T19:43:05.4983282Z core id : 19 2025-05-07T19:43:05.4983486Z cpu cores : 24 2025-05-07T19:43:05.4983690Z apicid : 38 2025-05-07T19:43:05.4983908Z initial apicid : 38 2025-05-07T19:43:05.4984118Z fpu : yes 2025-05-07T19:43:05.4984329Z fpu_exception : yes 2025-05-07T19:43:05.4984543Z cpuid level : 13 2025-05-07T19:43:05.4984763Z wp : yes 2025-05-07T19:43:05.4987144Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.4989782Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.4990350Z bogomips : 5999.98 2025-05-07T19:43:05.4990578Z clflush size : 64 2025-05-07T19:43:05.4990789Z cache_alignment : 64 2025-05-07T19:43:05.4991071Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.4991386Z power management: 2025-05-07T19:43:05.4991518Z 2025-05-07T19:43:05.4991609Z processor : 20 2025-05-07T19:43:05.4991817Z vendor_id : GenuineIntel 2025-05-07T19:43:05.4992065Z cpu family : 6 2025-05-07T19:43:05.4992257Z model : 85 2025-05-07T19:43:05.4992533Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.4992874Z stepping : 7 2025-05-07T19:43:05.4993085Z microcode : 0x5003901 2025-05-07T19:43:05.4993319Z cpu MHz : 2999.994 2025-05-07T19:43:05.4993525Z cache size : 36608 KB 2025-05-07T19:43:05.4993754Z physical id : 0 2025-05-07T19:43:05.4994057Z siblings : 48 2025-05-07T19:43:05.4994255Z core id : 20 2025-05-07T19:43:05.4994438Z cpu cores : 24 2025-05-07T19:43:05.4994633Z apicid : 40 2025-05-07T19:43:05.4994812Z initial apicid : 40 2025-05-07T19:43:05.4995012Z fpu : yes 2025-05-07T19:43:05.4995184Z fpu_exception : yes 2025-05-07T19:43:05.4995396Z cpuid level : 13 2025-05-07T19:43:05.4995585Z wp : yes 2025-05-07T19:43:05.4997618Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.5000057Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.5000584Z bogomips : 5999.98 2025-05-07T19:43:05.5000805Z clflush size : 64 2025-05-07T19:43:05.5001011Z cache_alignment : 64 2025-05-07T19:43:05.5001285Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.5001589Z power management: 2025-05-07T19:43:05.5001732Z 2025-05-07T19:43:05.5001814Z processor : 21 2025-05-07T19:43:05.5002033Z vendor_id : GenuineIntel 2025-05-07T19:43:05.5002254Z cpu family : 6 2025-05-07T19:43:05.5002460Z model : 85 2025-05-07T19:43:05.5002717Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.5003045Z stepping : 7 2025-05-07T19:43:05.5003220Z microcode : 0x5003901 2025-05-07T19:43:05.5003414Z cpu MHz : 2999.994 2025-05-07T19:43:05.5003594Z cache size : 36608 KB 2025-05-07T19:43:05.5003785Z physical id : 0 2025-05-07T19:43:05.5003970Z siblings : 48 2025-05-07T19:43:05.5004158Z core id : 21 2025-05-07T19:43:05.5004337Z cpu cores : 24 2025-05-07T19:43:05.5004532Z apicid : 42 2025-05-07T19:43:05.5004727Z initial apicid : 42 2025-05-07T19:43:05.5004941Z fpu : yes 2025-05-07T19:43:05.5005609Z lspci: Unable to load libkmod resources: error -2 2025-05-07T19:43:05.5005913Z fpu_exception : yes 2025-05-07T19:43:05.5006132Z cpuid level : 13 2025-05-07T19:43:05.5006324Z wp : yes 2025-05-07T19:43:05.5008415Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.5010770Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.5011282Z bogomips : 5999.98 2025-05-07T19:43:05.5011474Z clflush size : 64 2025-05-07T19:43:05.5011661Z cache_alignment : 64 2025-05-07T19:43:05.5011909Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.5012210Z power management: 2025-05-07T19:43:05.5012327Z 2025-05-07T19:43:05.5012400Z processor : 22 2025-05-07T19:43:05.5012602Z vendor_id : GenuineIntel 2025-05-07T19:43:05.5012806Z cpu family : 6 2025-05-07T19:43:05.5012992Z model : 85 2025-05-07T19:43:05.5013231Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.5013548Z stepping : 7 2025-05-07T19:43:05.5013726Z microcode : 0x5003901 2025-05-07T19:43:05.5013930Z cpu MHz : 2999.994 2025-05-07T19:43:05.5014117Z cache size : 36608 KB 2025-05-07T19:43:05.5014325Z physical id : 0 2025-05-07T19:43:05.5014508Z siblings : 48 2025-05-07T19:43:05.5014689Z core id : 22 2025-05-07T19:43:05.5014874Z cpu cores : 24 2025-05-07T19:43:05.5015049Z apicid : 44 2025-05-07T19:43:05.5015234Z initial apicid : 44 2025-05-07T19:43:05.5015424Z fpu : yes 2025-05-07T19:43:05.5015605Z fpu_exception : yes 2025-05-07T19:43:05.5015798Z cpuid level : 13 2025-05-07T19:43:05.5015991Z wp : yes 2025-05-07T19:43:05.5018339Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.5020941Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.5021512Z bogomips : 5999.98 2025-05-07T19:43:05.5021716Z clflush size : 64 2025-05-07T19:43:05.5021929Z cache_alignment : 64 2025-05-07T19:43:05.5022187Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.5022502Z power management: 2025-05-07T19:43:05.5022626Z 2025-05-07T19:43:05.5022715Z processor : 23 2025-05-07T19:43:05.5022916Z vendor_id : GenuineIntel 2025-05-07T19:43:05.5023147Z cpu family : 6 2025-05-07T19:43:05.5023336Z model : 85 2025-05-07T19:43:05.5023603Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.5023937Z stepping : 7 2025-05-07T19:43:05.5024144Z microcode : 0x5003901 2025-05-07T19:43:05.5024361Z cpu MHz : 2999.994 2025-05-07T19:43:05.5024571Z cache size : 36608 KB 2025-05-07T19:43:05.5024784Z physical id : 0 2025-05-07T19:43:05.5024988Z siblings : 48 2025-05-07T19:43:05.5025179Z core id : 23 2025-05-07T19:43:05.5025365Z cpu cores : 24 2025-05-07T19:43:05.5025565Z apicid : 46 2025-05-07T19:43:05.5025754Z initial apicid : 46 2025-05-07T19:43:05.5025959Z fpu : yes 2025-05-07T19:43:05.5026146Z fpu_exception : yes 2025-05-07T19:43:05.5026357Z cpuid level : 13 2025-05-07T19:43:05.5026546Z wp : yes 2025-05-07T19:43:05.5028781Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.5031258Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.5031773Z bogomips : 5999.98 2025-05-07T19:43:05.5031973Z clflush size : 64 2025-05-07T19:43:05.5032157Z cache_alignment : 64 2025-05-07T19:43:05.5032407Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.5032700Z power management: 2025-05-07T19:43:05.5032816Z 2025-05-07T19:43:05.5032886Z processor : 24 2025-05-07T19:43:05.5033079Z vendor_id : GenuineIntel 2025-05-07T19:43:05.5033289Z cpu family : 6 2025-05-07T19:43:05.5033472Z model : 85 2025-05-07T19:43:05.5033711Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.5034027Z stepping : 7 2025-05-07T19:43:05.5034208Z microcode : 0x5003901 2025-05-07T19:43:05.5034415Z cpu MHz : 1443.979 2025-05-07T19:43:05.5034603Z cache size : 36608 KB 2025-05-07T19:43:05.5034810Z physical id : 1 2025-05-07T19:43:05.5034991Z siblings : 48 2025-05-07T19:43:05.5035171Z core id : 0 2025-05-07T19:43:05.5035352Z cpu cores : 24 2025-05-07T19:43:05.5035526Z apicid : 64 2025-05-07T19:43:05.5035711Z initial apicid : 64 2025-05-07T19:43:05.5035895Z fpu : yes 2025-05-07T19:43:05.5036075Z fpu_exception : yes 2025-05-07T19:43:05.5036259Z cpuid level : 13 2025-05-07T19:43:05.5036442Z wp : yes 2025-05-07T19:43:05.5038455Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.5040850Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.5041388Z bogomips : 5999.98 2025-05-07T19:43:05.5041601Z clflush size : 64 2025-05-07T19:43:05.5041806Z cache_alignment : 64 2025-05-07T19:43:05.5042202Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.5042498Z power management: 2025-05-07T19:43:05.5042619Z 2025-05-07T19:43:05.5042711Z processor : 25 2025-05-07T19:43:05.5042902Z vendor_id : GenuineIntel 2025-05-07T19:43:05.5043129Z cpu family : 6 2025-05-07T19:43:05.5043309Z model : 85 2025-05-07T19:43:05.5043578Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.5043899Z stepping : 7 2025-05-07T19:43:05.5044104Z microcode : 0x5003901 2025-05-07T19:43:05.5044319Z cpu MHz : 1200.605 2025-05-07T19:43:05.5044516Z cache size : 36608 KB 2025-05-07T19:43:05.5044734Z physical id : 1 2025-05-07T19:43:05.5044921Z siblings : 48 2025-05-07T19:43:05.5045122Z core id : 1 2025-05-07T19:43:05.5045302Z cpu cores : 24 2025-05-07T19:43:05.5045506Z apicid : 66 2025-05-07T19:43:05.5045682Z initial apicid : 66 2025-05-07T19:43:05.5045891Z fpu : yes 2025-05-07T19:43:05.5046068Z fpu_exception : yes 2025-05-07T19:43:05.5046268Z cpuid level : 13 2025-05-07T19:43:05.5046447Z wp : yes 2025-05-07T19:43:05.5048534Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.5050874Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.5051413Z bogomips : 5999.98 2025-05-07T19:43:05.5051608Z clflush size : 64 2025-05-07T19:43:05.5051822Z cache_alignment : 64 2025-05-07T19:43:05.5052058Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.5052367Z power management: 2025-05-07T19:43:05.5052486Z 2025-05-07T19:43:05.5052564Z processor : 26 2025-05-07T19:43:05.5052774Z vendor_id : GenuineIntel 2025-05-07T19:43:05.5052984Z cpu family : 6 2025-05-07T19:43:05.5053180Z model : 85 2025-05-07T19:43:05.5053427Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.5053758Z stepping : 7 2025-05-07T19:43:05.5053958Z microcode : 0x5003901 2025-05-07T19:43:05.5054163Z cpu MHz : 1200.956 2025-05-07T19:43:05.5054377Z cache size : 36608 KB 2025-05-07T19:43:05.5054580Z physical id : 1 2025-05-07T19:43:05.5054780Z siblings : 48 2025-05-07T19:43:05.5054958Z core id : 2 2025-05-07T19:43:05.5055144Z cpu cores : 24 2025-05-07T19:43:05.5055318Z apicid : 68 2025-05-07T19:43:05.5055517Z initial apicid : 68 2025-05-07T19:43:05.5055702Z fpu : yes 2025-05-07T19:43:05.5055890Z fpu_exception : yes 2025-05-07T19:43:05.5056079Z cpuid level : 13 2025-05-07T19:43:05.5056276Z wp : yes 2025-05-07T19:43:05.5058648Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.5061228Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.5061824Z bogomips : 5999.98 2025-05-07T19:43:05.5062050Z clflush size : 64 2025-05-07T19:43:05.5062266Z cache_alignment : 64 2025-05-07T19:43:05.5062548Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.5062866Z power management: 2025-05-07T19:43:05.5062993Z 2025-05-07T19:43:05.5063089Z processor : 27 2025-05-07T19:43:05.5063297Z vendor_id : GenuineIntel 2025-05-07T19:43:05.5063543Z cpu family : 6 2025-05-07T19:43:05.5063735Z model : 85 2025-05-07T19:43:05.5064022Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.5064367Z stepping : 7 2025-05-07T19:43:05.5064582Z microcode : 0x5003901 2025-05-07T19:43:05.5064801Z cpu MHz : 2999.994 2025-05-07T19:43:05.5065005Z cache size : 36608 KB 2025-05-07T19:43:05.5065226Z physical id : 1 2025-05-07T19:43:05.5065424Z siblings : 48 2025-05-07T19:43:05.5065625Z core id : 3 2025-05-07T19:43:05.5065808Z cpu cores : 24 2025-05-07T19:43:05.5066012Z apicid : 70 2025-05-07T19:43:05.5066210Z initial apicid : 70 2025-05-07T19:43:05.5066428Z fpu : yes 2025-05-07T19:43:05.5066620Z fpu_exception : yes 2025-05-07T19:43:05.5066846Z cpuid level : 13 2025-05-07T19:43:05.5067039Z wp : yes 2025-05-07T19:43:05.5069265Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.5071951Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.5072524Z bogomips : 5999.98 2025-05-07T19:43:05.5072737Z clflush size : 64 2025-05-07T19:43:05.5072968Z cache_alignment : 64 2025-05-07T19:43:05.5073227Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.5073554Z power management: 2025-05-07T19:43:05.5073686Z 2025-05-07T19:43:05.5073768Z processor : 28 2025-05-07T19:43:05.5073989Z vendor_id : GenuineIntel 2025-05-07T19:43:05.5074218Z cpu family : 6 2025-05-07T19:43:05.5074426Z model : 85 2025-05-07T19:43:05.5074689Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.5075045Z stepping : 7 2025-05-07T19:43:05.5075253Z microcode : 0x5003901 2025-05-07T19:43:05.5075483Z cpu MHz : 1199.437 2025-05-07T19:43:05.5075706Z cache size : 36608 KB 2025-05-07T19:43:05.5075926Z physical id : 1 2025-05-07T19:43:05.5076125Z siblings : 48 2025-05-07T19:43:05.5076311Z core id : 4 2025-05-07T19:43:05.5076504Z cpu cores : 24 2025-05-07T19:43:05.5076689Z apicid : 72 2025-05-07T19:43:05.5076890Z initial apicid : 72 2025-05-07T19:43:05.5077086Z fpu : yes 2025-05-07T19:43:05.5077278Z fpu_exception : yes 2025-05-07T19:43:05.5077478Z cpuid level : 13 2025-05-07T19:43:05.5077683Z wp : yes 2025-05-07T19:43:05.5079857Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.5082570Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.5083096Z bogomips : 5999.98 2025-05-07T19:43:05.5083288Z clflush size : 64 2025-05-07T19:43:05.5083475Z cache_alignment : 64 2025-05-07T19:43:05.5083717Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.5084002Z power management: 2025-05-07T19:43:05.5084122Z 2025-05-07T19:43:05.5084195Z processor : 29 2025-05-07T19:43:05.5084379Z vendor_id : GenuineIntel 2025-05-07T19:43:05.5084592Z cpu family : 6 2025-05-07T19:43:05.5084763Z model : 85 2025-05-07T19:43:05.5085009Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.5085324Z stepping : 7 2025-05-07T19:43:05.5085530Z microcode : 0x5003901 2025-05-07T19:43:05.5085749Z cpu MHz : 1199.514 2025-05-07T19:43:05.5085948Z cache size : 36608 KB 2025-05-07T19:43:05.5086169Z physical id : 1 2025-05-07T19:43:05.5086373Z siblings : 48 2025-05-07T19:43:05.5086599Z core id : 5 2025-05-07T19:43:05.5086790Z cpu cores : 24 2025-05-07T19:43:05.5086987Z apicid : 74 2025-05-07T19:43:05.5087173Z initial apicid : 74 2025-05-07T19:43:05.5087379Z fpu : yes 2025-05-07T19:43:05.5087556Z fpu_exception : yes 2025-05-07T19:43:05.5087762Z cpuid level : 13 2025-05-07T19:43:05.5087947Z wp : yes 2025-05-07T19:43:05.5090071Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.5092419Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.5092955Z bogomips : 5999.98 2025-05-07T19:43:05.5093149Z clflush size : 64 2025-05-07T19:43:05.5093360Z cache_alignment : 64 2025-05-07T19:43:05.5093604Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.5093911Z power management: 2025-05-07T19:43:05.5094030Z 2025-05-07T19:43:05.5094107Z processor : 30 2025-05-07T19:43:05.5094319Z vendor_id : GenuineIntel 2025-05-07T19:43:05.5094539Z cpu family : 6 2025-05-07T19:43:05.5094737Z model : 85 2025-05-07T19:43:05.5094986Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.5095318Z stepping : 7 2025-05-07T19:43:05.5095529Z microcode : 0x5003901 2025-05-07T19:43:05.5095743Z cpu MHz : 1200.376 2025-05-07T19:43:05.5095980Z cache size : 36608 KB 2025-05-07T19:43:05.5096206Z physical id : 1 2025-05-07T19:43:05.5096501Z siblings : 48 2025-05-07T19:43:05.5096715Z core id : 6 2025-05-07T19:43:05.5097109Z cpu cores : 24 2025-05-07T19:43:05.5097331Z apicid : 76 2025-05-07T19:43:05.5097581Z initial apicid : 76 2025-05-07T19:43:05.5097858Z fpu : yes 2025-05-07T19:43:05.5098103Z fpu_exception : yes 2025-05-07T19:43:05.5098344Z cpuid level : 13 2025-05-07T19:43:05.5098598Z wp : yes 2025-05-07T19:43:05.5100833Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.5103433Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.5104046Z bogomips : 5999.98 2025-05-07T19:43:05.5104306Z clflush size : 64 2025-05-07T19:43:05.5104539Z cache_alignment : 64 2025-05-07T19:43:05.5104854Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.5105194Z power management: 2025-05-07T19:43:05.5105360Z 2025-05-07T19:43:05.5105454Z processor : 31 2025-05-07T19:43:05.5105684Z vendor_id : GenuineIntel 2025-05-07T19:43:05.5105957Z cpu family : 6 2025-05-07T19:43:05.5106171Z model : 85 2025-05-07T19:43:05.5106474Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.5106831Z stepping : 7 2025-05-07T19:43:05.5107073Z microcode : 0x5003901 2025-05-07T19:43:05.5107335Z cpu MHz : 1199.941 2025-05-07T19:43:05.5107570Z cache size : 36608 KB 2025-05-07T19:43:05.5107839Z physical id : 1 2025-05-07T19:43:05.5108061Z siblings : 48 2025-05-07T19:43:05.5108294Z core id : 7 2025-05-07T19:43:05.5108506Z cpu cores : 24 2025-05-07T19:43:05.5108739Z apicid : 78 2025-05-07T19:43:05.5108956Z initial apicid : 78 2025-05-07T19:43:05.5109311Z fpu : yes 2025-05-07T19:43:05.5109508Z fpu_exception : yes 2025-05-07T19:43:05.5109743Z cpuid level : 13 2025-05-07T19:43:05.5109945Z wp : yes 2025-05-07T19:43:05.5112047Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.5114435Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.5115003Z bogomips : 5999.98 2025-05-07T19:43:05.5115217Z clflush size : 64 2025-05-07T19:43:05.5115467Z cache_alignment : 64 2025-05-07T19:43:05.5115736Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.5116070Z power management: 2025-05-07T19:43:05.5116200Z 2025-05-07T19:43:05.5116288Z processor : 32 2025-05-07T19:43:05.5116533Z vendor_id : GenuineIntel 2025-05-07T19:43:05.5116775Z cpu family : 6 2025-05-07T19:43:05.5117000Z model : 85 2025-05-07T19:43:05.5117272Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.5117626Z stepping : 7 2025-05-07T19:43:05.5117854Z microcode : 0x5003901 2025-05-07T19:43:05.5118079Z cpu MHz : 1200.019 2025-05-07T19:43:05.5118317Z cache size : 36608 KB 2025-05-07T19:43:05.5118544Z physical id : 1 2025-05-07T19:43:05.5118774Z siblings : 48 2025-05-07T19:43:05.5118976Z core id : 8 2025-05-07T19:43:05.5119166Z cpu cores : 24 2025-05-07T19:43:05.5119340Z apicid : 80 2025-05-07T19:43:05.5119527Z initial apicid : 80 2025-05-07T19:43:05.5119707Z fpu : yes 2025-05-07T19:43:05.5119889Z fpu_exception : yes 2025-05-07T19:43:05.5120076Z cpuid level : 13 2025-05-07T19:43:05.5120266Z wp : yes 2025-05-07T19:43:05.5122275Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.5124606Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.5125185Z bogomips : 5999.98 2025-05-07T19:43:05.5125380Z clflush size : 64 2025-05-07T19:43:05.5125570Z cache_alignment : 64 2025-05-07T19:43:05.5125816Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.5126105Z power management: 2025-05-07T19:43:05.5126230Z 2025-05-07T19:43:05.5126303Z processor : 33 2025-05-07T19:43:05.5126495Z vendor_id : GenuineIntel 2025-05-07T19:43:05.5126714Z cpu family : 6 2025-05-07T19:43:05.5126886Z model : 85 2025-05-07T19:43:05.5127137Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.5127442Z stepping : 7 2025-05-07T19:43:05.5127630Z microcode : 0x5003901 2025-05-07T19:43:05.5127832Z cpu MHz : 2999.994 2025-05-07T19:43:05.5128019Z cache size : 36608 KB 2025-05-07T19:43:05.5128223Z physical id : 1 2025-05-07T19:43:05.5128406Z siblings : 48 2025-05-07T19:43:05.5128594Z core id : 9 2025-05-07T19:43:05.5128762Z cpu cores : 24 2025-05-07T19:43:05.5128951Z apicid : 82 2025-05-07T19:43:05.5129125Z initial apicid : 82 2025-05-07T19:43:05.5129325Z fpu : yes 2025-05-07T19:43:05.5129490Z fpu_exception : yes 2025-05-07T19:43:05.5129687Z cpuid level : 13 2025-05-07T19:43:05.5129863Z wp : yes 2025-05-07T19:43:05.5131873Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.5134263Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.5134785Z bogomips : 5999.98 2025-05-07T19:43:05.5134973Z clflush size : 64 2025-05-07T19:43:05.5135171Z cache_alignment : 64 2025-05-07T19:43:05.5135409Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.5135700Z power management: 2025-05-07T19:43:05.5135818Z 2025-05-07T19:43:05.5135891Z processor : 34 2025-05-07T19:43:05.5136088Z vendor_id : GenuineIntel 2025-05-07T19:43:05.5136290Z cpu family : 6 2025-05-07T19:43:05.5136575Z model : 85 2025-05-07T19:43:05.5136815Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.5137326Z stepping : 7 2025-05-07T19:43:05.5137529Z microcode : 0x5003901 2025-05-07T19:43:05.5137813Z cpu MHz : 1200.086 2025-05-07T19:43:05.5138022Z cache size : 36608 KB 2025-05-07T19:43:05.5138237Z physical id : 1 2025-05-07T19:43:05.5138447Z siblings : 48 2025-05-07T19:43:05.5138638Z core id : 10 2025-05-07T19:43:05.5138841Z cpu cores : 24 2025-05-07T19:43:05.5139030Z apicid : 84 2025-05-07T19:43:05.5139232Z initial apicid : 84 2025-05-07T19:43:05.5139428Z fpu : yes 2025-05-07T19:43:05.5139623Z fpu_exception : yes 2025-05-07T19:43:05.5139823Z cpuid level : 13 2025-05-07T19:43:05.5140023Z wp : yes 2025-05-07T19:43:05.5142209Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.5142586Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.5142725Z bogomips : 5999.98 2025-05-07T19:43:05.5142816Z clflush size : 64 2025-05-07T19:43:05.5142895Z cache_alignment : 64 2025-05-07T19:43:05.5143022Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.5143110Z power management: 2025-05-07T19:43:05.5143115Z 2025-05-07T19:43:05.5143193Z processor : 35 2025-05-07T19:43:05.5143281Z vendor_id : GenuineIntel 2025-05-07T19:43:05.5143362Z cpu family : 6 2025-05-07T19:43:05.5143449Z model : 85 2025-05-07T19:43:05.5143606Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.5143686Z stepping : 7 2025-05-07T19:43:05.5143778Z microcode : 0x5003901 2025-05-07T19:43:05.5143855Z cpu MHz : 1200.207 2025-05-07T19:43:05.5143938Z cache size : 36608 KB 2025-05-07T19:43:05.5144017Z physical id : 1 2025-05-07T19:43:05.5144103Z siblings : 48 2025-05-07T19:43:05.5144179Z core id : 11 2025-05-07T19:43:05.5144255Z cpu cores : 24 2025-05-07T19:43:05.5144332Z apicid : 86 2025-05-07T19:43:05.5144424Z initial apicid : 86 2025-05-07T19:43:05.5144503Z fpu : yes 2025-05-07T19:43:05.5144586Z fpu_exception : yes 2025-05-07T19:43:05.5144674Z cpuid level : 13 2025-05-07T19:43:05.5144750Z wp : yes 2025-05-07T19:43:05.5146818Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.5147365Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.5147447Z bogomips : 5999.98 2025-05-07T19:43:05.5147531Z clflush size : 64 2025-05-07T19:43:05.5147622Z cache_alignment : 64 2025-05-07T19:43:05.5147749Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.5147832Z power management: 2025-05-07T19:43:05.5147836Z 2025-05-07T19:43:05.5147925Z processor : 36 2025-05-07T19:43:05.5148010Z vendor_id : GenuineIntel 2025-05-07T19:43:05.5148088Z cpu family : 6 2025-05-07T19:43:05.5148168Z model : 85 2025-05-07T19:43:05.5148333Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.5148412Z stepping : 7 2025-05-07T19:43:05.5148494Z microcode : 0x5003901 2025-05-07T19:43:05.5148583Z cpu MHz : 1200.147 2025-05-07T19:43:05.5148663Z cache size : 36608 KB 2025-05-07T19:43:05.5148743Z physical id : 1 2025-05-07T19:43:05.5148818Z siblings : 48 2025-05-07T19:43:05.5148906Z core id : 12 2025-05-07T19:43:05.5148982Z cpu cores : 24 2025-05-07T19:43:05.5149165Z apicid : 88 2025-05-07T19:43:05.5149241Z initial apicid : 88 2025-05-07T19:43:05.5149316Z fpu : yes 2025-05-07T19:43:05.5149395Z fpu_exception : yes 2025-05-07T19:43:05.5149468Z cpuid level : 13 2025-05-07T19:43:05.5149548Z wp : yes 2025-05-07T19:43:05.5151454Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.5151797Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.5151883Z bogomips : 5999.98 2025-05-07T19:43:05.5151955Z clflush size : 64 2025-05-07T19:43:05.5152075Z cache_alignment : 64 2025-05-07T19:43:05.5152199Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.5152274Z power management: 2025-05-07T19:43:05.5152279Z 2025-05-07T19:43:05.5152348Z processor : 37 2025-05-07T19:43:05.5152426Z vendor_id : GenuineIntel 2025-05-07T19:43:05.5152507Z cpu family : 6 2025-05-07T19:43:05.5152576Z model : 85 2025-05-07T19:43:05.5152716Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.5152792Z stepping : 7 2025-05-07T19:43:05.5152866Z microcode : 0x5003901 2025-05-07T19:43:05.5152936Z cpu MHz : 2999.994 2025-05-07T19:43:05.5153011Z cache size : 36608 KB 2025-05-07T19:43:05.5153095Z physical id : 1 2025-05-07T19:43:05.5153163Z siblings : 48 2025-05-07T19:43:05.5153234Z core id : 13 2025-05-07T19:43:05.5153328Z cpu cores : 24 2025-05-07T19:43:05.5153402Z apicid : 90 2025-05-07T19:43:05.5153480Z initial apicid : 90 2025-05-07T19:43:05.5153558Z fpu : yes 2025-05-07T19:43:05.5153653Z fpu_exception : yes 2025-05-07T19:43:05.5153732Z cpuid level : 13 2025-05-07T19:43:05.5153806Z wp : yes 2025-05-07T19:43:05.5155742Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.5156089Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.5156168Z bogomips : 5999.98 2025-05-07T19:43:05.5156306Z clflush size : 64 2025-05-07T19:43:05.5156389Z cache_alignment : 64 2025-05-07T19:43:05.5156515Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.5156607Z power management: 2025-05-07T19:43:05.5156612Z 2025-05-07T19:43:05.5156688Z processor : 38 2025-05-07T19:43:05.5156768Z vendor_id : GenuineIntel 2025-05-07T19:43:05.5156839Z cpu family : 6 2025-05-07T19:43:05.5156929Z model : 85 2025-05-07T19:43:05.5157080Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.5157156Z stepping : 7 2025-05-07T19:43:05.5157250Z microcode : 0x5003901 2025-05-07T19:43:05.5157323Z cpu MHz : 1248.017 2025-05-07T19:43:05.5157400Z cache size : 36608 KB 2025-05-07T19:43:05.5157472Z physical id : 1 2025-05-07T19:43:05.5157552Z siblings : 48 2025-05-07T19:43:05.5157626Z core id : 14 2025-05-07T19:43:05.5157699Z cpu cores : 24 2025-05-07T19:43:05.5157774Z apicid : 92 2025-05-07T19:43:05.5157851Z initial apicid : 92 2025-05-07T19:43:05.5157922Z fpu : yes 2025-05-07T19:43:05.5158001Z fpu_exception : yes 2025-05-07T19:43:05.5158082Z cpuid level : 13 2025-05-07T19:43:05.5158159Z wp : yes 2025-05-07T19:43:05.5160059Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.5160413Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.5160486Z bogomips : 5999.98 2025-05-07T19:43:05.5160560Z clflush size : 64 2025-05-07T19:43:05.5160648Z cache_alignment : 64 2025-05-07T19:43:05.5160764Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.5162662Z power management: 2025-05-07T19:43:05.5162667Z 2025-05-07T19:43:05.5162745Z processor : 39 2025-05-07T19:43:05.5162827Z vendor_id : GenuineIntel 2025-05-07T19:43:05.5162901Z cpu family : 6 2025-05-07T19:43:05.5162973Z model : 85 2025-05-07T19:43:05.5163129Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.5163202Z stepping : 7 2025-05-07T19:43:05.5163279Z microcode : 0x5003901 2025-05-07T19:43:05.5163358Z cpu MHz : 2999.994 2025-05-07T19:43:05.5163434Z cache size : 36608 KB 2025-05-07T19:43:05.5163509Z physical id : 1 2025-05-07T19:43:05.5163580Z siblings : 48 2025-05-07T19:43:05.5163659Z core id : 15 2025-05-07T19:43:05.5163731Z cpu cores : 24 2025-05-07T19:43:05.5163803Z apicid : 94 2025-05-07T19:43:05.5163887Z initial apicid : 94 2025-05-07T19:43:05.5163956Z fpu : yes 2025-05-07T19:43:05.5164030Z fpu_exception : yes 2025-05-07T19:43:05.5164109Z cpuid level : 13 2025-05-07T19:43:05.5164187Z wp : yes 2025-05-07T19:43:05.5166108Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.5166459Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.5166533Z bogomips : 5999.98 2025-05-07T19:43:05.5166604Z clflush size : 64 2025-05-07T19:43:05.5166681Z cache_alignment : 64 2025-05-07T19:43:05.5166856Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.5166932Z power management: 2025-05-07T19:43:05.5166940Z 2025-05-07T19:43:05.5167011Z processor : 40 2025-05-07T19:43:05.5167100Z vendor_id : GenuineIntel 2025-05-07T19:43:05.5167173Z cpu family : 6 2025-05-07T19:43:05.5167242Z model : 85 2025-05-07T19:43:05.5167385Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.5167466Z stepping : 7 2025-05-07T19:43:05.5167542Z microcode : 0x5003901 2025-05-07T19:43:05.5167611Z cpu MHz : 2999.994 2025-05-07T19:43:05.5167697Z cache size : 36608 KB 2025-05-07T19:43:05.5167770Z physical id : 1 2025-05-07T19:43:05.5167849Z siblings : 48 2025-05-07T19:43:05.5167923Z core id : 16 2025-05-07T19:43:05.5168015Z cpu cores : 24 2025-05-07T19:43:05.5168096Z apicid : 96 2025-05-07T19:43:05.5168176Z initial apicid : 96 2025-05-07T19:43:05.5168269Z fpu : yes 2025-05-07T19:43:05.5168349Z fpu_exception : yes 2025-05-07T19:43:05.5168429Z cpuid level : 13 2025-05-07T19:43:05.5168505Z wp : yes 2025-05-07T19:43:05.5170722Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.5171248Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.5171345Z bogomips : 5999.98 2025-05-07T19:43:05.5171430Z clflush size : 64 2025-05-07T19:43:05.5171574Z cache_alignment : 64 2025-05-07T19:43:05.5171702Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.5171802Z power management: 2025-05-07T19:43:05.5171807Z 2025-05-07T19:43:05.5171992Z processor : 41 2025-05-07T19:43:05.5172083Z vendor_id : GenuineIntel 2025-05-07T19:43:05.5172178Z cpu family : 6 2025-05-07T19:43:05.5172252Z model : 85 2025-05-07T19:43:05.5172409Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.5172488Z stepping : 7 2025-05-07T19:43:05.5172590Z microcode : 0x5003901 2025-05-07T19:43:05.5172668Z cpu MHz : 2999.994 2025-05-07T19:43:05.5172757Z cache size : 36608 KB 2025-05-07T19:43:05.5172851Z physical id : 1 2025-05-07T19:43:05.5172935Z siblings : 48 2025-05-07T19:43:05.5173017Z core id : 17 2025-05-07T19:43:05.5173105Z cpu cores : 24 2025-05-07T19:43:05.5173199Z apicid : 98 2025-05-07T19:43:05.5173290Z initial apicid : 98 2025-05-07T19:43:05.5173376Z fpu : yes 2025-05-07T19:43:05.5173464Z fpu_exception : yes 2025-05-07T19:43:05.5173557Z cpuid level : 13 2025-05-07T19:43:05.5173639Z wp : yes 2025-05-07T19:43:05.5175713Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.5176106Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.5176195Z bogomips : 5999.98 2025-05-07T19:43:05.5176285Z clflush size : 64 2025-05-07T19:43:05.5176384Z cache_alignment : 64 2025-05-07T19:43:05.5176583Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.5176681Z power management: 2025-05-07T19:43:05.5176756Z 2025-05-07T19:43:05.5176868Z processor : 42 2025-05-07T19:43:05.5176965Z vendor_id : GenuineIntel 2025-05-07T19:43:05.5177048Z cpu family : 6 2025-05-07T19:43:05.5177158Z model : 85 2025-05-07T19:43:05.5177377Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.5177462Z stepping : 7 2025-05-07T19:43:05.5177550Z microcode : 0x5003901 2025-05-07T19:43:05.5177661Z cpu MHz : 1199.723 2025-05-07T19:43:05.5177750Z cache size : 36608 KB 2025-05-07T19:43:05.5177837Z physical id : 1 2025-05-07T19:43:05.5177932Z siblings : 48 2025-05-07T19:43:05.5178043Z core id : 18 2025-05-07T19:43:05.5178123Z cpu cores : 24 2025-05-07T19:43:05.5178202Z apicid : 100 2025-05-07T19:43:05.5178307Z initial apicid : 100 2025-05-07T19:43:05.5178383Z fpu : yes 2025-05-07T19:43:05.5178467Z fpu_exception : yes 2025-05-07T19:43:05.5178550Z cpuid level : 13 2025-05-07T19:43:05.5178639Z wp : yes 2025-05-07T19:43:05.5180706Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.5181100Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.5181183Z bogomips : 5999.98 2025-05-07T19:43:05.5181265Z clflush size : 64 2025-05-07T19:43:05.5181351Z cache_alignment : 64 2025-05-07T19:43:05.5181499Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.5181588Z power management: 2025-05-07T19:43:05.5181592Z 2025-05-07T19:43:05.5181678Z processor : 43 2025-05-07T19:43:05.5181783Z vendor_id : GenuineIntel 2025-05-07T19:43:05.5181916Z cpu family : 6 2025-05-07T19:43:05.5181998Z model : 85 2025-05-07T19:43:05.5182160Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.5182241Z stepping : 7 2025-05-07T19:43:05.5182334Z microcode : 0x5003901 2025-05-07T19:43:05.5182420Z cpu MHz : 2999.994 2025-05-07T19:43:05.5182514Z cache size : 36608 KB 2025-05-07T19:43:05.5182593Z physical id : 1 2025-05-07T19:43:05.5182670Z siblings : 48 2025-05-07T19:43:05.5182751Z core id : 19 2025-05-07T19:43:05.5182856Z cpu cores : 24 2025-05-07T19:43:05.5182937Z apicid : 102 2025-05-07T19:43:05.5183019Z initial apicid : 102 2025-05-07T19:43:05.5183115Z fpu : yes 2025-05-07T19:43:05.5183206Z fpu_exception : yes 2025-05-07T19:43:05.5183288Z cpuid level : 13 2025-05-07T19:43:05.5183365Z wp : yes 2025-05-07T19:43:05.5185450Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.5185835Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.5185933Z bogomips : 5999.98 2025-05-07T19:43:05.5186021Z clflush size : 64 2025-05-07T19:43:05.5186108Z cache_alignment : 64 2025-05-07T19:43:05.5186235Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.5186334Z power management: 2025-05-07T19:43:05.5186339Z 2025-05-07T19:43:05.5186422Z processor : 44 2025-05-07T19:43:05.5186568Z vendor_id : GenuineIntel 2025-05-07T19:43:05.5186661Z cpu family : 6 2025-05-07T19:43:05.5186740Z model : 85 2025-05-07T19:43:05.5186899Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.5186982Z stepping : 7 2025-05-07T19:43:05.5187075Z microcode : 0x5003901 2025-05-07T19:43:05.5187160Z cpu MHz : 2999.994 2025-05-07T19:43:05.5187248Z cache size : 36608 KB 2025-05-07T19:43:05.5187352Z physical id : 1 2025-05-07T19:43:05.5187431Z siblings : 48 2025-05-07T19:43:05.5187520Z core id : 20 2025-05-07T19:43:05.5187607Z cpu cores : 24 2025-05-07T19:43:05.5187707Z apicid : 104 2025-05-07T19:43:05.5187792Z initial apicid : 104 2025-05-07T19:43:05.5187873Z fpu : yes 2025-05-07T19:43:05.5187975Z fpu_exception : yes 2025-05-07T19:43:05.5188058Z cpuid level : 13 2025-05-07T19:43:05.5188138Z wp : yes 2025-05-07T19:43:05.5190316Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.5190665Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.5190739Z bogomips : 5999.98 2025-05-07T19:43:05.5190835Z clflush size : 64 2025-05-07T19:43:05.5190913Z cache_alignment : 64 2025-05-07T19:43:05.5191030Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.5191105Z power management: 2025-05-07T19:43:05.5191109Z 2025-05-07T19:43:05.5191194Z processor : 45 2025-05-07T19:43:05.5191273Z vendor_id : GenuineIntel 2025-05-07T19:43:05.5191348Z cpu family : 6 2025-05-07T19:43:05.5191439Z model : 85 2025-05-07T19:43:05.5191636Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.5191710Z stepping : 7 2025-05-07T19:43:05.5191789Z microcode : 0x5003901 2025-05-07T19:43:05.5191877Z cpu MHz : 2999.994 2025-05-07T19:43:05.5191956Z cache size : 36608 KB 2025-05-07T19:43:05.5192036Z physical id : 1 2025-05-07T19:43:05.5192122Z siblings : 48 2025-05-07T19:43:05.5192194Z core id : 21 2025-05-07T19:43:05.5192266Z cpu cores : 24 2025-05-07T19:43:05.5192338Z apicid : 106 2025-05-07T19:43:05.5192418Z initial apicid : 106 2025-05-07T19:43:05.5192487Z fpu : yes 2025-05-07T19:43:05.5192564Z fpu_exception : yes 2025-05-07T19:43:05.5192652Z cpuid level : 13 2025-05-07T19:43:05.5192721Z wp : yes 2025-05-07T19:43:05.5194631Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.5194994Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.5195072Z bogomips : 5999.98 2025-05-07T19:43:05.5195147Z clflush size : 64 2025-05-07T19:43:05.5195231Z cache_alignment : 64 2025-05-07T19:43:05.5195344Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.5195420Z power management: 2025-05-07T19:43:05.5195425Z 2025-05-07T19:43:05.5195500Z processor : 46 2025-05-07T19:43:05.5195590Z vendor_id : GenuineIntel 2025-05-07T19:43:05.5195664Z cpu family : 6 2025-05-07T19:43:05.5195733Z model : 85 2025-05-07T19:43:05.5195931Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.5196006Z stepping : 7 2025-05-07T19:43:05.5196082Z microcode : 0x5003901 2025-05-07T19:43:05.5196158Z cpu MHz : 2999.994 2025-05-07T19:43:05.5196246Z cache size : 36608 KB 2025-05-07T19:43:05.5196321Z physical id : 1 2025-05-07T19:43:05.5196392Z siblings : 48 2025-05-07T19:43:05.5196468Z core id : 22 2025-05-07T19:43:05.5196541Z cpu cores : 24 2025-05-07T19:43:05.5196615Z apicid : 108 2025-05-07T19:43:05.5196692Z initial apicid : 108 2025-05-07T19:43:05.5196774Z fpu : yes 2025-05-07T19:43:05.5196851Z fpu_exception : yes 2025-05-07T19:43:05.5196922Z cpuid level : 13 2025-05-07T19:43:05.5196997Z wp : yes 2025-05-07T19:43:05.5198918Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.5199270Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.5199354Z bogomips : 5999.98 2025-05-07T19:43:05.5199425Z clflush size : 64 2025-05-07T19:43:05.5199500Z cache_alignment : 64 2025-05-07T19:43:05.5199619Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.5199694Z power management: 2025-05-07T19:43:05.5199698Z 2025-05-07T19:43:05.5199767Z processor : 47 2025-05-07T19:43:05.5199844Z vendor_id : GenuineIntel 2025-05-07T19:43:05.5199916Z cpu family : 6 2025-05-07T19:43:05.5199987Z model : 85 2025-05-07T19:43:05.5200131Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.5200207Z stepping : 7 2025-05-07T19:43:05.5200323Z microcode : 0x5003901 2025-05-07T19:43:05.5200395Z cpu MHz : 2999.994 2025-05-07T19:43:05.5200472Z cache size : 36608 KB 2025-05-07T19:43:05.5200552Z physical id : 1 2025-05-07T19:43:05.5200620Z siblings : 48 2025-05-07T19:43:05.5200689Z core id : 23 2025-05-07T19:43:05.5200770Z cpu cores : 24 2025-05-07T19:43:05.5200838Z apicid : 110 2025-05-07T19:43:05.5200911Z initial apicid : 110 2025-05-07T19:43:05.5200977Z fpu : yes 2025-05-07T19:43:05.5201060Z fpu_exception : yes 2025-05-07T19:43:05.5201134Z cpuid level : 13 2025-05-07T19:43:05.5201199Z wp : yes 2025-05-07T19:43:05.5203108Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.5203458Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.5203535Z bogomips : 5999.98 2025-05-07T19:43:05.5203615Z clflush size : 64 2025-05-07T19:43:05.5203688Z cache_alignment : 64 2025-05-07T19:43:05.5203806Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.5203894Z power management: 2025-05-07T19:43:05.5203898Z 2025-05-07T19:43:05.5203969Z processor : 48 2025-05-07T19:43:05.5204050Z vendor_id : GenuineIntel 2025-05-07T19:43:05.5204120Z cpu family : 6 2025-05-07T19:43:05.5204195Z model : 85 2025-05-07T19:43:05.5204337Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.5204450Z stepping : 7 2025-05-07T19:43:05.5204536Z microcode : 0x5003901 2025-05-07T19:43:05.5204613Z cpu MHz : 2999.994 2025-05-07T19:43:05.5204684Z cache size : 36608 KB 2025-05-07T19:43:05.5204755Z physical id : 0 2025-05-07T19:43:05.5204834Z siblings : 48 2025-05-07T19:43:05.5204904Z core id : 0 2025-05-07T19:43:05.5204978Z cpu cores : 24 2025-05-07T19:43:05.5205053Z apicid : 1 2025-05-07T19:43:05.5205127Z initial apicid : 1 2025-05-07T19:43:05.5205198Z fpu : yes 2025-05-07T19:43:05.5205275Z fpu_exception : yes 2025-05-07T19:43:05.5205358Z cpuid level : 13 2025-05-07T19:43:05.5205424Z wp : yes 2025-05-07T19:43:05.5207334Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.5207692Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.5207770Z bogomips : 5999.98 2025-05-07T19:43:05.5207844Z clflush size : 64 2025-05-07T19:43:05.5207924Z cache_alignment : 64 2025-05-07T19:43:05.5208034Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.5208106Z power management: 2025-05-07T19:43:05.5208111Z 2025-05-07T19:43:05.5208194Z processor : 49 2025-05-07T19:43:05.5208277Z vendor_id : GenuineIntel 2025-05-07T19:43:05.5208348Z cpu family : 6 2025-05-07T19:43:05.5208413Z model : 85 2025-05-07T19:43:05.5208564Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.5208637Z stepping : 7 2025-05-07T19:43:05.5208718Z microcode : 0x5003901 2025-05-07T19:43:05.5208795Z cpu MHz : 2999.994 2025-05-07T19:43:05.5208921Z cache size : 36608 KB 2025-05-07T19:43:05.5208998Z physical id : 0 2025-05-07T19:43:05.5209069Z siblings : 48 2025-05-07T19:43:05.5209149Z core id : 1 2025-05-07T19:43:05.5209225Z cpu cores : 24 2025-05-07T19:43:05.5209294Z apicid : 3 2025-05-07T19:43:05.5209367Z initial apicid : 3 2025-05-07T19:43:05.5209439Z fpu : yes 2025-05-07T19:43:05.5209512Z fpu_exception : yes 2025-05-07T19:43:05.5209588Z cpuid level : 13 2025-05-07T19:43:05.5209661Z wp : yes 2025-05-07T19:43:05.5211556Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.5211901Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.5211983Z bogomips : 5999.98 2025-05-07T19:43:05.5212055Z clflush size : 64 2025-05-07T19:43:05.5212135Z cache_alignment : 64 2025-05-07T19:43:05.5212258Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.5212333Z power management: 2025-05-07T19:43:05.5212337Z 2025-05-07T19:43:05.5212408Z processor : 50 2025-05-07T19:43:05.5212500Z vendor_id : GenuineIntel 2025-05-07T19:43:05.5212572Z cpu family : 6 2025-05-07T19:43:05.5212644Z model : 85 2025-05-07T19:43:05.5212788Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.5212871Z stepping : 7 2025-05-07T19:43:05.5212946Z microcode : 0x5003901 2025-05-07T19:43:05.5213066Z cpu MHz : 2999.994 2025-05-07T19:43:05.5213144Z cache size : 36608 KB 2025-05-07T19:43:05.5213230Z physical id : 0 2025-05-07T19:43:05.5213300Z siblings : 48 2025-05-07T19:43:05.5213373Z core id : 2 2025-05-07T19:43:05.5213452Z cpu cores : 24 2025-05-07T19:43:05.5213525Z apicid : 5 2025-05-07T19:43:05.5213601Z initial apicid : 5 2025-05-07T19:43:05.5213672Z fpu : yes 2025-05-07T19:43:05.5213754Z fpu_exception : yes 2025-05-07T19:43:05.5213830Z cpuid level : 13 2025-05-07T19:43:05.5213900Z wp : yes 2025-05-07T19:43:05.5215815Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.5216161Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.5216239Z bogomips : 5999.98 2025-05-07T19:43:05.5216325Z clflush size : 64 2025-05-07T19:43:05.5216401Z cache_alignment : 64 2025-05-07T19:43:05.5216588Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.5216675Z power management: 2025-05-07T19:43:05.5216679Z 2025-05-07T19:43:05.5216751Z processor : 51 2025-05-07T19:43:05.5216833Z vendor_id : GenuineIntel 2025-05-07T19:43:05.5217078Z cpu family : 6 2025-05-07T19:43:05.5217166Z model : 85 2025-05-07T19:43:05.5217320Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.5217399Z stepping : 7 2025-05-07T19:43:05.5217492Z microcode : 0x5003901 2025-05-07T19:43:05.5217571Z cpu MHz : 3276.205 2025-05-07T19:43:05.5217654Z cache size : 36608 KB 2025-05-07T19:43:05.5217733Z physical id : 0 2025-05-07T19:43:05.5217887Z siblings : 48 2025-05-07T19:43:05.5217964Z core id : 3 2025-05-07T19:43:05.5218040Z cpu cores : 24 2025-05-07T19:43:05.5218125Z apicid : 7 2025-05-07T19:43:05.5218208Z initial apicid : 7 2025-05-07T19:43:05.5218284Z fpu : yes 2025-05-07T19:43:05.5218368Z fpu_exception : yes 2025-05-07T19:43:05.5218459Z cpuid level : 13 2025-05-07T19:43:05.5218537Z wp : yes 2025-05-07T19:43:05.5220610Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.5221004Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.5221080Z bogomips : 5999.98 2025-05-07T19:43:05.5221157Z clflush size : 64 2025-05-07T19:43:05.5221247Z cache_alignment : 64 2025-05-07T19:43:05.5221375Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.5221453Z power management: 2025-05-07T19:43:05.5221458Z 2025-05-07T19:43:05.5221545Z processor : 52 2025-05-07T19:43:05.5221630Z vendor_id : GenuineIntel 2025-05-07T19:43:05.5221708Z cpu family : 6 2025-05-07T19:43:05.5221779Z model : 85 2025-05-07T19:43:05.5221941Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.5222019Z stepping : 7 2025-05-07T19:43:05.5222099Z microcode : 0x5003901 2025-05-07T19:43:05.5222187Z cpu MHz : 2999.994 2025-05-07T19:43:05.5222269Z cache size : 36608 KB 2025-05-07T19:43:05.5222397Z physical id : 0 2025-05-07T19:43:05.5222474Z siblings : 48 2025-05-07T19:43:05.5222560Z core id : 4 2025-05-07T19:43:05.5222640Z cpu cores : 24 2025-05-07T19:43:05.5222713Z apicid : 9 2025-05-07T19:43:05.5222802Z initial apicid : 9 2025-05-07T19:43:05.5222879Z fpu : yes 2025-05-07T19:43:05.5222962Z fpu_exception : yes 2025-05-07T19:43:05.5223039Z cpuid level : 13 2025-05-07T19:43:05.5223119Z wp : yes 2025-05-07T19:43:05.5225188Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.5225570Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.5225650Z bogomips : 5999.98 2025-05-07T19:43:05.5225730Z clflush size : 64 2025-05-07T19:43:05.5225814Z cache_alignment : 64 2025-05-07T19:43:05.5225945Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.5226026Z power management: 2025-05-07T19:43:05.5226030Z 2025-05-07T19:43:05.5226107Z processor : 53 2025-05-07T19:43:05.5226207Z vendor_id : GenuineIntel 2025-05-07T19:43:05.5226286Z cpu family : 6 2025-05-07T19:43:05.5226362Z model : 85 2025-05-07T19:43:05.5226519Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.5226609Z stepping : 7 2025-05-07T19:43:05.5226691Z microcode : 0x5003901 2025-05-07T19:43:05.5226770Z cpu MHz : 3325.031 2025-05-07T19:43:05.5226867Z cache size : 36608 KB 2025-05-07T19:43:05.5226949Z physical id : 0 2025-05-07T19:43:05.5227025Z siblings : 48 2025-05-07T19:43:05.5227103Z core id : 5 2025-05-07T19:43:05.5227198Z cpu cores : 24 2025-05-07T19:43:05.5227334Z apicid : 11 2025-05-07T19:43:05.5227418Z initial apicid : 11 2025-05-07T19:43:05.5227505Z fpu : yes 2025-05-07T19:43:05.5227588Z fpu_exception : yes 2025-05-07T19:43:05.5227665Z cpuid level : 13 2025-05-07T19:43:05.5227740Z wp : yes 2025-05-07T19:43:05.5229873Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.5230223Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.5230315Z bogomips : 5999.98 2025-05-07T19:43:05.5230388Z clflush size : 64 2025-05-07T19:43:05.5230464Z cache_alignment : 64 2025-05-07T19:43:05.5230581Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.5230672Z power management: 2025-05-07T19:43:05.5230676Z 2025-05-07T19:43:05.5230747Z processor : 54 2025-05-07T19:43:05.5230830Z vendor_id : GenuineIntel 2025-05-07T19:43:05.5230913Z cpu family : 6 2025-05-07T19:43:05.5230985Z model : 85 2025-05-07T19:43:05.5231127Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.5231198Z stepping : 7 2025-05-07T19:43:05.5231288Z microcode : 0x5003901 2025-05-07T19:43:05.5231362Z cpu MHz : 3338.205 2025-05-07T19:43:05.5231435Z cache size : 36608 KB 2025-05-07T19:43:05.5231526Z physical id : 0 2025-05-07T19:43:05.5231599Z siblings : 48 2025-05-07T19:43:05.5231669Z core id : 6 2025-05-07T19:43:05.5231788Z cpu cores : 24 2025-05-07T19:43:05.5231876Z apicid : 13 2025-05-07T19:43:05.5231959Z initial apicid : 13 2025-05-07T19:43:05.5232028Z fpu : yes 2025-05-07T19:43:05.5232106Z fpu_exception : yes 2025-05-07T19:43:05.5232189Z cpuid level : 13 2025-05-07T19:43:05.5232259Z wp : yes 2025-05-07T19:43:05.5234164Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.5234526Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.5234605Z bogomips : 5999.98 2025-05-07T19:43:05.5234679Z clflush size : 64 2025-05-07T19:43:05.5234763Z cache_alignment : 64 2025-05-07T19:43:05.5234881Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.5234955Z power management: 2025-05-07T19:43:05.5234959Z 2025-05-07T19:43:05.5235038Z processor : 55 2025-05-07T19:43:05.5235120Z vendor_id : GenuineIntel 2025-05-07T19:43:05.5235194Z cpu family : 6 2025-05-07T19:43:05.5235270Z model : 85 2025-05-07T19:43:05.5235411Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.5235484Z stepping : 7 2025-05-07T19:43:05.5235562Z microcode : 0x5003901 2025-05-07T19:43:05.5235645Z cpu MHz : 2999.994 2025-05-07T19:43:05.5235722Z cache size : 36608 KB 2025-05-07T19:43:05.5235796Z physical id : 0 2025-05-07T19:43:05.5235875Z siblings : 48 2025-05-07T19:43:05.5235942Z core id : 7 2025-05-07T19:43:05.5236016Z cpu cores : 24 2025-05-07T19:43:05.5236090Z apicid : 15 2025-05-07T19:43:05.5236174Z initial apicid : 15 2025-05-07T19:43:05.5236290Z fpu : yes 2025-05-07T19:43:05.5236369Z fpu_exception : yes 2025-05-07T19:43:05.5236443Z cpuid level : 13 2025-05-07T19:43:05.5236519Z wp : yes 2025-05-07T19:43:05.5238425Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.5238777Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.5238852Z bogomips : 5999.98 2025-05-07T19:43:05.5238933Z clflush size : 64 2025-05-07T19:43:05.5239011Z cache_alignment : 64 2025-05-07T19:43:05.5239135Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.5239213Z power management: 2025-05-07T19:43:05.5239218Z 2025-05-07T19:43:05.5239291Z processor : 56 2025-05-07T19:43:05.5239379Z vendor_id : GenuineIntel 2025-05-07T19:43:05.5239452Z cpu family : 6 2025-05-07T19:43:05.5239522Z model : 85 2025-05-07T19:43:05.5239665Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.5239743Z stepping : 7 2025-05-07T19:43:05.5239818Z microcode : 0x5003901 2025-05-07T19:43:05.5239890Z cpu MHz : 3301.069 2025-05-07T19:43:05.5239974Z cache size : 36608 KB 2025-05-07T19:43:05.5240047Z physical id : 0 2025-05-07T19:43:05.5240117Z siblings : 48 2025-05-07T19:43:05.5240187Z core id : 8 2025-05-07T19:43:05.5240266Z cpu cores : 24 2025-05-07T19:43:05.5240337Z apicid : 17 2025-05-07T19:43:05.5240473Z initial apicid : 17 2025-05-07T19:43:05.5240551Z fpu : yes 2025-05-07T19:43:05.5240630Z fpu_exception : yes 2025-05-07T19:43:05.5240706Z cpuid level : 13 2025-05-07T19:43:05.5240776Z wp : yes 2025-05-07T19:43:05.5242689Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.5243031Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.5243113Z bogomips : 5999.98 2025-05-07T19:43:05.5243191Z clflush size : 64 2025-05-07T19:43:05.5243271Z cache_alignment : 64 2025-05-07T19:43:05.5243394Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.5243486Z power management: 2025-05-07T19:43:05.5243490Z 2025-05-07T19:43:05.5243564Z processor : 57 2025-05-07T19:43:05.5243645Z vendor_id : GenuineIntel 2025-05-07T19:43:05.5243729Z cpu family : 6 2025-05-07T19:43:05.5243799Z model : 85 2025-05-07T19:43:05.5243939Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.5244008Z stepping : 7 2025-05-07T19:43:05.5244092Z microcode : 0x5003901 2025-05-07T19:43:05.5244166Z cpu MHz : 2999.994 2025-05-07T19:43:05.5244242Z cache size : 36608 KB 2025-05-07T19:43:05.5244324Z physical id : 0 2025-05-07T19:43:05.5244397Z siblings : 48 2025-05-07T19:43:05.5244470Z core id : 9 2025-05-07T19:43:05.5244541Z cpu cores : 24 2025-05-07T19:43:05.5244622Z apicid : 19 2025-05-07T19:43:05.5244699Z initial apicid : 19 2025-05-07T19:43:05.5244769Z fpu : yes 2025-05-07T19:43:05.5244856Z fpu_exception : yes 2025-05-07T19:43:05.5244930Z cpuid level : 13 2025-05-07T19:43:05.5245049Z wp : yes 2025-05-07T19:43:05.5246964Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.5247312Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.5247388Z bogomips : 5999.98 2025-05-07T19:43:05.5247468Z clflush size : 64 2025-05-07T19:43:05.5247550Z cache_alignment : 64 2025-05-07T19:43:05.5247668Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.5247748Z power management: 2025-05-07T19:43:05.5247752Z 2025-05-07T19:43:05.5247834Z processor : 58 2025-05-07T19:43:05.5247917Z vendor_id : GenuineIntel 2025-05-07T19:43:05.5247990Z cpu family : 6 2025-05-07T19:43:05.5248070Z model : 85 2025-05-07T19:43:05.5248215Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.5248288Z stepping : 7 2025-05-07T19:43:05.5248367Z microcode : 0x5003901 2025-05-07T19:43:05.5248448Z cpu MHz : 2999.994 2025-05-07T19:43:05.5248526Z cache size : 36608 KB 2025-05-07T19:43:05.5248599Z physical id : 0 2025-05-07T19:43:05.5248679Z siblings : 48 2025-05-07T19:43:05.5248748Z core id : 10 2025-05-07T19:43:05.5248821Z cpu cores : 24 2025-05-07T19:43:05.5248892Z apicid : 21 2025-05-07T19:43:05.5248977Z initial apicid : 21 2025-05-07T19:43:05.5249044Z fpu : yes 2025-05-07T19:43:05.5249122Z fpu_exception : yes 2025-05-07T19:43:05.5249251Z cpuid level : 13 2025-05-07T19:43:05.5249323Z wp : yes 2025-05-07T19:43:05.5251234Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.5251592Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.5251667Z bogomips : 5999.98 2025-05-07T19:43:05.5251742Z clflush size : 64 2025-05-07T19:43:05.5251824Z cache_alignment : 64 2025-05-07T19:43:05.5251947Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.5252024Z power management: 2025-05-07T19:43:05.5252029Z 2025-05-07T19:43:05.5252099Z processor : 59 2025-05-07T19:43:05.5252192Z vendor_id : GenuineIntel 2025-05-07T19:43:05.5252266Z cpu family : 6 2025-05-07T19:43:05.5252335Z model : 85 2025-05-07T19:43:05.5252490Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.5252562Z stepping : 7 2025-05-07T19:43:05.5252637Z microcode : 0x5003901 2025-05-07T19:43:05.5252711Z cpu MHz : 3260.143 2025-05-07T19:43:05.5252801Z cache size : 36608 KB 2025-05-07T19:43:05.5252876Z physical id : 0 2025-05-07T19:43:05.5252946Z siblings : 48 2025-05-07T19:43:05.5253025Z core id : 11 2025-05-07T19:43:05.5253098Z cpu cores : 24 2025-05-07T19:43:05.5253168Z apicid : 23 2025-05-07T19:43:05.5253244Z initial apicid : 23 2025-05-07T19:43:05.5253325Z fpu : yes 2025-05-07T19:43:05.5253402Z fpu_exception : yes 2025-05-07T19:43:05.5253476Z cpuid level : 13 2025-05-07T19:43:05.5253548Z wp : yes 2025-05-07T19:43:05.5255467Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.5255859Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.5255942Z bogomips : 5999.98 2025-05-07T19:43:05.5256015Z clflush size : 64 2025-05-07T19:43:05.5256090Z cache_alignment : 64 2025-05-07T19:43:05.5256219Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.5256296Z power management: 2025-05-07T19:43:05.5256304Z 2025-05-07T19:43:05.5256379Z processor : 60 2025-05-07T19:43:05.5256522Z vendor_id : GenuineIntel 2025-05-07T19:43:05.5256606Z cpu family : 6 2025-05-07T19:43:05.5256675Z model : 85 2025-05-07T19:43:05.5256819Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.5257073Z stepping : 7 2025-05-07T19:43:05.5257156Z microcode : 0x5003901 2025-05-07T19:43:05.5257237Z cpu MHz : 2999.994 2025-05-07T19:43:05.5257317Z cache size : 36608 KB 2025-05-07T19:43:05.5257406Z physical id : 0 2025-05-07T19:43:05.5257483Z siblings : 48 2025-05-07T19:43:05.5257560Z core id : 12 2025-05-07T19:43:05.5257657Z cpu cores : 24 2025-05-07T19:43:05.5257816Z apicid : 25 2025-05-07T19:43:05.5257905Z initial apicid : 25 2025-05-07T19:43:05.5257984Z fpu : yes 2025-05-07T19:43:05.5258086Z fpu_exception : yes 2025-05-07T19:43:05.5258170Z cpuid level : 13 2025-05-07T19:43:05.5258250Z wp : yes 2025-05-07T19:43:05.5260381Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.5260765Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.5260851Z bogomips : 5999.98 2025-05-07T19:43:05.5260951Z clflush size : 64 2025-05-07T19:43:05.5261040Z cache_alignment : 64 2025-05-07T19:43:05.5261171Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.5261275Z power management: 2025-05-07T19:43:05.5261280Z 2025-05-07T19:43:05.5261364Z processor : 61 2025-05-07T19:43:05.5261461Z vendor_id : GenuineIntel 2025-05-07T19:43:05.5261545Z cpu family : 6 2025-05-07T19:43:05.5261641Z model : 85 2025-05-07T19:43:05.5261800Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.5261883Z stepping : 7 2025-05-07T19:43:05.5261988Z microcode : 0x5003901 2025-05-07T19:43:05.5262073Z cpu MHz : 2999.994 2025-05-07T19:43:05.5262159Z cache size : 36608 KB 2025-05-07T19:43:05.5262244Z physical id : 0 2025-05-07T19:43:05.5262343Z siblings : 48 2025-05-07T19:43:05.5262425Z core id : 13 2025-05-07T19:43:05.5262509Z cpu cores : 24 2025-05-07T19:43:05.5262590Z apicid : 27 2025-05-07T19:43:05.5262692Z initial apicid : 27 2025-05-07T19:43:05.5262775Z fpu : yes 2025-05-07T19:43:05.5262863Z fpu_exception : yes 2025-05-07T19:43:05.5262964Z cpuid level : 13 2025-05-07T19:43:05.5263044Z wp : yes 2025-05-07T19:43:05.5265112Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.5265556Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.5265646Z bogomips : 5999.98 2025-05-07T19:43:05.5265733Z clflush size : 64 2025-05-07T19:43:05.5265836Z cache_alignment : 64 2025-05-07T19:43:05.5265969Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.5266057Z power management: 2025-05-07T19:43:05.5266061Z 2025-05-07T19:43:05.5266161Z processor : 62 2025-05-07T19:43:05.5266254Z vendor_id : GenuineIntel 2025-05-07T19:43:05.5266341Z cpu family : 6 2025-05-07T19:43:05.5266422Z model : 85 2025-05-07T19:43:05.5266594Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.5266678Z stepping : 7 2025-05-07T19:43:05.5266768Z microcode : 0x5003901 2025-05-07T19:43:05.5266867Z cpu MHz : 3313.442 2025-05-07T19:43:05.5266955Z cache size : 36608 KB 2025-05-07T19:43:05.5267045Z physical id : 0 2025-05-07T19:43:05.5267126Z siblings : 48 2025-05-07T19:43:05.5267215Z core id : 14 2025-05-07T19:43:05.5267295Z cpu cores : 24 2025-05-07T19:43:05.5267375Z apicid : 29 2025-05-07T19:43:05.5267459Z initial apicid : 29 2025-05-07T19:43:05.5267551Z fpu : yes 2025-05-07T19:43:05.5267637Z fpu_exception : yes 2025-05-07T19:43:05.5267719Z cpuid level : 13 2025-05-07T19:43:05.5267809Z wp : yes 2025-05-07T19:43:05.5269929Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.5270474Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.5270572Z bogomips : 5999.98 2025-05-07T19:43:05.5270656Z clflush size : 64 2025-05-07T19:43:05.5270740Z cache_alignment : 64 2025-05-07T19:43:05.5270882Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.5270968Z power management: 2025-05-07T19:43:05.5270972Z 2025-05-07T19:43:05.5271053Z processor : 63 2025-05-07T19:43:05.5271161Z vendor_id : GenuineIntel 2025-05-07T19:43:05.5271243Z cpu family : 6 2025-05-07T19:43:05.5271327Z model : 85 2025-05-07T19:43:05.5271484Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.5271580Z stepping : 7 2025-05-07T19:43:05.5271666Z microcode : 0x5003901 2025-05-07T19:43:05.5271748Z cpu MHz : 3273.006 2025-05-07T19:43:05.5271835Z cache size : 36608 KB 2025-05-07T19:43:05.5271939Z physical id : 0 2025-05-07T19:43:05.5272020Z siblings : 48 2025-05-07T19:43:05.5272101Z core id : 15 2025-05-07T19:43:05.5272197Z cpu cores : 24 2025-05-07T19:43:05.5272277Z apicid : 31 2025-05-07T19:43:05.5272363Z initial apicid : 31 2025-05-07T19:43:05.5272443Z fpu : yes 2025-05-07T19:43:05.5272540Z fpu_exception : yes 2025-05-07T19:43:05.5272622Z cpuid level : 13 2025-05-07T19:43:05.5272700Z wp : yes 2025-05-07T19:43:05.5274793Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.5275259Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.5275343Z bogomips : 5999.98 2025-05-07T19:43:05.5275440Z clflush size : 64 2025-05-07T19:43:05.5275526Z cache_alignment : 64 2025-05-07T19:43:05.5275657Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.5275755Z power management: 2025-05-07T19:43:05.5275759Z 2025-05-07T19:43:05.5275841Z processor : 64 2025-05-07T19:43:05.5275934Z vendor_id : GenuineIntel 2025-05-07T19:43:05.5276019Z cpu family : 6 2025-05-07T19:43:05.5276113Z model : 85 2025-05-07T19:43:05.5276276Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.5276358Z stepping : 7 2025-05-07T19:43:05.5276456Z microcode : 0x5003901 2025-05-07T19:43:05.5276541Z cpu MHz : 3320.707 2025-05-07T19:43:05.5276627Z cache size : 36608 KB 2025-05-07T19:43:05.5276711Z physical id : 0 2025-05-07T19:43:05.5276804Z siblings : 48 2025-05-07T19:43:05.5276889Z core id : 16 2025-05-07T19:43:05.5276972Z cpu cores : 24 2025-05-07T19:43:05.5277066Z apicid : 33 2025-05-07T19:43:05.5277153Z initial apicid : 33 2025-05-07T19:43:05.5277231Z fpu : yes 2025-05-07T19:43:05.5277318Z fpu_exception : yes 2025-05-07T19:43:05.5277414Z cpuid level : 13 2025-05-07T19:43:05.5277495Z wp : yes 2025-05-07T19:43:05.5279654Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.5280051Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.5280137Z bogomips : 5999.98 2025-05-07T19:43:05.5280225Z clflush size : 64 2025-05-07T19:43:05.5280328Z cache_alignment : 64 2025-05-07T19:43:05.5280459Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.5280547Z power management: 2025-05-07T19:43:05.5280552Z 2025-05-07T19:43:05.5280650Z processor : 65 2025-05-07T19:43:05.5280746Z vendor_id : GenuineIntel 2025-05-07T19:43:05.5280828Z cpu family : 6 2025-05-07T19:43:05.5280910Z model : 85 2025-05-07T19:43:05.5281083Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.5281170Z stepping : 7 2025-05-07T19:43:05.5281259Z microcode : 0x5003901 2025-05-07T19:43:05.5281356Z cpu MHz : 3319.615 2025-05-07T19:43:05.5281442Z cache size : 36608 KB 2025-05-07T19:43:05.5281526Z physical id : 0 2025-05-07T19:43:05.5281606Z siblings : 48 2025-05-07T19:43:05.5281698Z core id : 17 2025-05-07T19:43:05.5281780Z cpu cores : 24 2025-05-07T19:43:05.5281859Z apicid : 35 2025-05-07T19:43:05.5281957Z initial apicid : 35 2025-05-07T19:43:05.5282145Z fpu : yes 2025-05-07T19:43:05.5282227Z fpu_exception : yes 2025-05-07T19:43:05.5282304Z cpuid level : 13 2025-05-07T19:43:05.5282392Z wp : yes 2025-05-07T19:43:05.5284305Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.5284713Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.5284791Z bogomips : 5999.98 2025-05-07T19:43:05.5284869Z clflush size : 64 2025-05-07T19:43:05.5284949Z cache_alignment : 64 2025-05-07T19:43:05.5285083Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.5285163Z power management: 2025-05-07T19:43:05.5285167Z 2025-05-07T19:43:05.5285244Z processor : 66 2025-05-07T19:43:05.5285340Z vendor_id : GenuineIntel 2025-05-07T19:43:05.5285415Z cpu family : 6 2025-05-07T19:43:05.5285488Z model : 85 2025-05-07T19:43:05.5285637Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.5285728Z stepping : 7 2025-05-07T19:43:05.5285807Z microcode : 0x5003901 2025-05-07T19:43:05.5285883Z cpu MHz : 3346.520 2025-05-07T19:43:05.5285974Z cache size : 36608 KB 2025-05-07T19:43:05.5286051Z physical id : 0 2025-05-07T19:43:05.5286125Z siblings : 48 2025-05-07T19:43:05.5286197Z core id : 18 2025-05-07T19:43:05.5286285Z cpu cores : 24 2025-05-07T19:43:05.5286358Z apicid : 37 2025-05-07T19:43:05.5286438Z initial apicid : 37 2025-05-07T19:43:05.5286523Z fpu : yes 2025-05-07T19:43:05.5286609Z fpu_exception : yes 2025-05-07T19:43:05.5286686Z cpuid level : 13 2025-05-07T19:43:05.5286758Z wp : yes 2025-05-07T19:43:05.5288727Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.5289081Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.5289173Z bogomips : 5999.98 2025-05-07T19:43:05.5289251Z clflush size : 64 2025-05-07T19:43:05.5289332Z cache_alignment : 64 2025-05-07T19:43:05.5289453Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.5289545Z power management: 2025-05-07T19:43:05.5289549Z 2025-05-07T19:43:05.5289626Z processor : 67 2025-05-07T19:43:05.5289711Z vendor_id : GenuineIntel 2025-05-07T19:43:05.5289800Z cpu family : 6 2025-05-07T19:43:05.5289875Z model : 85 2025-05-07T19:43:05.5290027Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.5290105Z stepping : 7 2025-05-07T19:43:05.5290201Z microcode : 0x5003901 2025-05-07T19:43:05.5290281Z cpu MHz : 3339.248 2025-05-07T19:43:05.5290359Z cache size : 36608 KB 2025-05-07T19:43:05.5290453Z physical id : 0 2025-05-07T19:43:05.5290530Z siblings : 48 2025-05-07T19:43:05.5290604Z core id : 19 2025-05-07T19:43:05.5290680Z cpu cores : 24 2025-05-07T19:43:05.5290767Z apicid : 39 2025-05-07T19:43:05.5290846Z initial apicid : 39 2025-05-07T19:43:05.5290921Z fpu : yes 2025-05-07T19:43:05.5291001Z fpu_exception : yes 2025-05-07T19:43:05.5291090Z cpuid level : 13 2025-05-07T19:43:05.5291164Z wp : yes 2025-05-07T19:43:05.5293073Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.5293482Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.5293560Z bogomips : 5999.98 2025-05-07T19:43:05.5293652Z clflush size : 64 2025-05-07T19:43:05.5293733Z cache_alignment : 64 2025-05-07T19:43:05.5293855Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.5293934Z power management: 2025-05-07T19:43:05.5293938Z 2025-05-07T19:43:05.5294026Z processor : 68 2025-05-07T19:43:05.5294111Z vendor_id : GenuineIntel 2025-05-07T19:43:05.5294187Z cpu family : 6 2025-05-07T19:43:05.5294274Z model : 85 2025-05-07T19:43:05.5294423Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.5294499Z stepping : 7 2025-05-07T19:43:05.5294583Z microcode : 0x5003901 2025-05-07T19:43:05.5294675Z cpu MHz : 3257.908 2025-05-07T19:43:05.5294760Z cache size : 36608 KB 2025-05-07T19:43:05.5294838Z physical id : 0 2025-05-07T19:43:05.5294931Z siblings : 48 2025-05-07T19:43:05.5295007Z core id : 20 2025-05-07T19:43:05.5295085Z cpu cores : 24 2025-05-07T19:43:05.5295162Z apicid : 41 2025-05-07T19:43:05.5295256Z initial apicid : 41 2025-05-07T19:43:05.5295331Z fpu : yes 2025-05-07T19:43:05.5295413Z fpu_exception : yes 2025-05-07T19:43:05.5295492Z cpuid level : 13 2025-05-07T19:43:05.5295582Z wp : yes 2025-05-07T19:43:05.5297946Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.5298346Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.5298431Z bogomips : 5999.98 2025-05-07T19:43:05.5298517Z clflush size : 64 2025-05-07T19:43:05.5298606Z cache_alignment : 64 2025-05-07T19:43:05.5298754Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.5298843Z power management: 2025-05-07T19:43:05.5298847Z 2025-05-07T19:43:05.5298931Z processor : 69 2025-05-07T19:43:05.5299040Z vendor_id : GenuineIntel 2025-05-07T19:43:05.5299124Z cpu family : 6 2025-05-07T19:43:05.5299205Z model : 85 2025-05-07T19:43:05.5299380Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.5299464Z stepping : 7 2025-05-07T19:43:05.5299551Z microcode : 0x5003901 2025-05-07T19:43:05.5299638Z cpu MHz : 3323.634 2025-05-07T19:43:05.5299740Z cache size : 36608 KB 2025-05-07T19:43:05.5299829Z physical id : 0 2025-05-07T19:43:05.5299910Z siblings : 48 2025-05-07T19:43:05.5299990Z core id : 21 2025-05-07T19:43:05.5300085Z cpu cores : 24 2025-05-07T19:43:05.5300167Z apicid : 43 2025-05-07T19:43:05.5300254Z initial apicid : 43 2025-05-07T19:43:05.5300345Z fpu : yes 2025-05-07T19:43:05.5300433Z fpu_exception : yes 2025-05-07T19:43:05.5300516Z cpuid level : 13 2025-05-07T19:43:05.5300595Z wp : yes 2025-05-07T19:43:05.5302687Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.5303119Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.5303219Z bogomips : 5999.98 2025-05-07T19:43:05.5303305Z clflush size : 64 2025-05-07T19:43:05.5303393Z cache_alignment : 64 2025-05-07T19:43:05.5303525Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.5303625Z power management: 2025-05-07T19:43:05.5303630Z 2025-05-07T19:43:05.5303710Z processor : 70 2025-05-07T19:43:05.5303800Z vendor_id : GenuineIntel 2025-05-07T19:43:05.5303899Z cpu family : 6 2025-05-07T19:43:05.5303977Z model : 85 2025-05-07T19:43:05.5304136Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.5304217Z stepping : 7 2025-05-07T19:43:05.5304315Z microcode : 0x5003901 2025-05-07T19:43:05.5304396Z cpu MHz : 3468.513 2025-05-07T19:43:05.5304483Z cache size : 36608 KB 2025-05-07T19:43:05.5304577Z physical id : 0 2025-05-07T19:43:05.5304660Z siblings : 48 2025-05-07T19:43:05.5304740Z core id : 22 2025-05-07T19:43:05.5304821Z cpu cores : 24 2025-05-07T19:43:05.5304912Z apicid : 45 2025-05-07T19:43:05.5305002Z initial apicid : 45 2025-05-07T19:43:05.5305081Z fpu : yes 2025-05-07T19:43:05.5305179Z fpu_exception : yes 2025-05-07T19:43:05.5305260Z cpuid level : 13 2025-05-07T19:43:05.5305338Z wp : yes 2025-05-07T19:43:05.5307459Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.5307839Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.5307923Z bogomips : 5999.98 2025-05-07T19:43:05.5308022Z clflush size : 64 2025-05-07T19:43:05.5308109Z cache_alignment : 64 2025-05-07T19:43:05.5308240Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.5308326Z power management: 2025-05-07T19:43:05.5308330Z 2025-05-07T19:43:05.5308429Z processor : 71 2025-05-07T19:43:05.5308523Z vendor_id : GenuineIntel 2025-05-07T19:43:05.5308606Z cpu family : 6 2025-05-07T19:43:05.5322102Z model : 85 2025-05-07T19:43:05.5322337Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.5322416Z stepping : 7 2025-05-07T19:43:05.5322503Z microcode : 0x5003901 2025-05-07T19:43:05.5322576Z cpu MHz : 3363.173 2025-05-07T19:43:05.5322653Z cache size : 36608 KB 2025-05-07T19:43:05.5322738Z physical id : 0 2025-05-07T19:43:05.5322835Z siblings : 48 2025-05-07T19:43:05.5322912Z core id : 23 2025-05-07T19:43:05.5322985Z cpu cores : 24 2025-05-07T19:43:05.5323065Z apicid : 47 2025-05-07T19:43:05.5323152Z initial apicid : 47 2025-05-07T19:43:05.5323222Z fpu : yes 2025-05-07T19:43:05.5323301Z fpu_exception : yes 2025-05-07T19:43:05.5323380Z cpuid level : 13 2025-05-07T19:43:05.5323450Z wp : yes 2025-05-07T19:43:05.5325374Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.5325732Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.5325936Z bogomips : 5999.98 2025-05-07T19:43:05.5326011Z clflush size : 64 2025-05-07T19:43:05.5326101Z cache_alignment : 64 2025-05-07T19:43:05.5326222Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.5326300Z power management: 2025-05-07T19:43:05.5326306Z 2025-05-07T19:43:05.5326381Z processor : 72 2025-05-07T19:43:05.5326474Z vendor_id : GenuineIntel 2025-05-07T19:43:05.5326549Z cpu family : 6 2025-05-07T19:43:05.5326619Z model : 85 2025-05-07T19:43:05.5326773Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.5326846Z stepping : 7 2025-05-07T19:43:05.5326921Z microcode : 0x5003901 2025-05-07T19:43:05.5326991Z cpu MHz : 2999.994 2025-05-07T19:43:05.5327072Z cache size : 36608 KB 2025-05-07T19:43:05.5327144Z physical id : 1 2025-05-07T19:43:05.5327217Z siblings : 48 2025-05-07T19:43:05.5327295Z core id : 0 2025-05-07T19:43:05.5327367Z cpu cores : 24 2025-05-07T19:43:05.5327438Z apicid : 65 2025-05-07T19:43:05.5327512Z initial apicid : 65 2025-05-07T19:43:05.5327592Z fpu : yes 2025-05-07T19:43:05.5327668Z fpu_exception : yes 2025-05-07T19:43:05.5327739Z cpuid level : 13 2025-05-07T19:43:05.5327818Z wp : yes 2025-05-07T19:43:05.5329726Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.5330121Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.5330208Z bogomips : 5999.98 2025-05-07T19:43:05.5330283Z clflush size : 64 2025-05-07T19:43:05.5330358Z cache_alignment : 64 2025-05-07T19:43:05.5330481Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.5330554Z power management: 2025-05-07T19:43:05.5330559Z 2025-05-07T19:43:05.5330631Z processor : 73 2025-05-07T19:43:05.5330713Z vendor_id : GenuineIntel 2025-05-07T19:43:05.5330788Z cpu family : 6 2025-05-07T19:43:05.5330856Z model : 85 2025-05-07T19:43:05.5331000Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.5331081Z stepping : 7 2025-05-07T19:43:05.5331156Z microcode : 0x5003901 2025-05-07T19:43:05.5331229Z cpu MHz : 2999.994 2025-05-07T19:43:05.5331303Z cache size : 36608 KB 2025-05-07T19:43:05.5331383Z physical id : 1 2025-05-07T19:43:05.5331453Z siblings : 48 2025-05-07T19:43:05.5331525Z core id : 1 2025-05-07T19:43:05.5331603Z cpu cores : 24 2025-05-07T19:43:05.5331675Z apicid : 67 2025-05-07T19:43:05.5331749Z initial apicid : 67 2025-05-07T19:43:05.5331817Z fpu : yes 2025-05-07T19:43:05.5331900Z fpu_exception : yes 2025-05-07T19:43:05.5331970Z cpuid level : 13 2025-05-07T19:43:05.5332037Z wp : yes 2025-05-07T19:43:05.5333946Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.5334290Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.5334414Z bogomips : 5999.98 2025-05-07T19:43:05.5334491Z clflush size : 64 2025-05-07T19:43:05.5334569Z cache_alignment : 64 2025-05-07T19:43:05.5334684Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.5334765Z power management: 2025-05-07T19:43:05.5334770Z 2025-05-07T19:43:05.5334842Z processor : 74 2025-05-07T19:43:05.5334921Z vendor_id : GenuineIntel 2025-05-07T19:43:05.5334992Z cpu family : 6 2025-05-07T19:43:05.5335067Z model : 85 2025-05-07T19:43:05.5335211Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.5335284Z stepping : 7 2025-05-07T19:43:05.5335364Z microcode : 0x5003901 2025-05-07T19:43:05.5335436Z cpu MHz : 2999.994 2025-05-07T19:43:05.5335509Z cache size : 36608 KB 2025-05-07T19:43:05.5335581Z physical id : 1 2025-05-07T19:43:05.5335658Z siblings : 48 2025-05-07T19:43:05.5335726Z core id : 2 2025-05-07T19:43:05.5335798Z cpu cores : 24 2025-05-07T19:43:05.5335874Z apicid : 69 2025-05-07T19:43:05.5335947Z initial apicid : 69 2025-05-07T19:43:05.5336017Z fpu : yes 2025-05-07T19:43:05.5336089Z fpu_exception : yes 2025-05-07T19:43:05.5336170Z cpuid level : 13 2025-05-07T19:43:05.5336238Z wp : yes 2025-05-07T19:43:05.5338516Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.5338957Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.5339041Z bogomips : 5999.98 2025-05-07T19:43:05.5339127Z clflush size : 64 2025-05-07T19:43:05.5339218Z cache_alignment : 64 2025-05-07T19:43:05.5339345Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.5339428Z power management: 2025-05-07T19:43:05.5339433Z 2025-05-07T19:43:05.5339521Z processor : 75 2025-05-07T19:43:05.5339606Z vendor_id : GenuineIntel 2025-05-07T19:43:05.5339684Z cpu family : 6 2025-05-07T19:43:05.5339757Z model : 85 2025-05-07T19:43:05.5339920Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.5339995Z stepping : 7 2025-05-07T19:43:05.5340075Z microcode : 0x5003901 2025-05-07T19:43:05.5340158Z cpu MHz : 1199.649 2025-05-07T19:43:05.5340240Z cache size : 36608 KB 2025-05-07T19:43:05.5340317Z physical id : 1 2025-05-07T19:43:05.5340390Z siblings : 48 2025-05-07T19:43:05.5340468Z core id : 3 2025-05-07T19:43:05.5340543Z cpu cores : 24 2025-05-07T19:43:05.5340617Z apicid : 71 2025-05-07T19:43:05.5340707Z initial apicid : 71 2025-05-07T19:43:05.5340780Z fpu : yes 2025-05-07T19:43:05.5340863Z fpu_exception : yes 2025-05-07T19:43:05.5340942Z cpuid level : 13 2025-05-07T19:43:05.5341021Z wp : yes 2025-05-07T19:43:05.5343079Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.5343451Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.5343541Z bogomips : 5999.98 2025-05-07T19:43:05.5343620Z clflush size : 64 2025-05-07T19:43:05.5343753Z cache_alignment : 64 2025-05-07T19:43:05.5343890Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.5343970Z power management: 2025-05-07T19:43:05.5343975Z 2025-05-07T19:43:05.5344052Z processor : 76 2025-05-07T19:43:05.5344139Z vendor_id : GenuineIntel 2025-05-07T19:43:05.5344211Z cpu family : 6 2025-05-07T19:43:05.5344281Z model : 85 2025-05-07T19:43:05.5344430Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.5344512Z stepping : 7 2025-05-07T19:43:05.5344591Z microcode : 0x5003901 2025-05-07T19:43:05.5344667Z cpu MHz : 2999.994 2025-05-07T19:43:05.5344756Z cache size : 36608 KB 2025-05-07T19:43:05.5344836Z physical id : 1 2025-05-07T19:43:05.5344909Z siblings : 48 2025-05-07T19:43:05.5344985Z core id : 4 2025-05-07T19:43:05.5345072Z cpu cores : 24 2025-05-07T19:43:05.5345145Z apicid : 73 2025-05-07T19:43:05.5345226Z initial apicid : 73 2025-05-07T19:43:05.5345302Z fpu : yes 2025-05-07T19:43:05.5345387Z fpu_exception : yes 2025-05-07T19:43:05.5345468Z cpuid level : 13 2025-05-07T19:43:05.5345539Z wp : yes 2025-05-07T19:43:05.5347602Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.5347976Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.5348053Z bogomips : 5999.98 2025-05-07T19:43:05.5348187Z clflush size : 64 2025-05-07T19:43:05.5348268Z cache_alignment : 64 2025-05-07T19:43:05.5348395Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.5348481Z power management: 2025-05-07T19:43:05.5348486Z 2025-05-07T19:43:05.5348561Z processor : 77 2025-05-07T19:43:05.5348645Z vendor_id : GenuineIntel 2025-05-07T19:43:05.5348726Z cpu family : 6 2025-05-07T19:43:05.5348797Z model : 85 2025-05-07T19:43:05.5348950Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.5349128Z stepping : 7 2025-05-07T19:43:05.5349210Z microcode : 0x5003901 2025-05-07T19:43:05.5349279Z cpu MHz : 2999.994 2025-05-07T19:43:05.5349354Z cache size : 36608 KB 2025-05-07T19:43:05.5349424Z physical id : 1 2025-05-07T19:43:05.5349499Z siblings : 48 2025-05-07T19:43:05.5349565Z core id : 5 2025-05-07T19:43:05.5349633Z cpu cores : 24 2025-05-07T19:43:05.5349709Z apicid : 75 2025-05-07T19:43:05.5349783Z initial apicid : 75 2025-05-07T19:43:05.5349850Z fpu : yes 2025-05-07T19:43:05.5349926Z fpu_exception : yes 2025-05-07T19:43:05.5350006Z cpuid level : 13 2025-05-07T19:43:05.5350077Z wp : yes 2025-05-07T19:43:05.5351995Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.5352345Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.5352420Z bogomips : 5999.98 2025-05-07T19:43:05.5352491Z clflush size : 64 2025-05-07T19:43:05.5352573Z cache_alignment : 64 2025-05-07T19:43:05.5352688Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.5352812Z power management: 2025-05-07T19:43:05.5352816Z 2025-05-07T19:43:05.5352892Z processor : 78 2025-05-07T19:43:05.5352969Z vendor_id : GenuineIntel 2025-05-07T19:43:05.5353041Z cpu family : 6 2025-05-07T19:43:05.5353108Z model : 85 2025-05-07T19:43:05.5353251Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.5353321Z stepping : 7 2025-05-07T19:43:05.5353395Z microcode : 0x5003901 2025-05-07T19:43:05.5353469Z cpu MHz : 2506.294 2025-05-07T19:43:05.5353546Z cache size : 36608 KB 2025-05-07T19:43:05.5353619Z physical id : 1 2025-05-07T19:43:05.5353688Z siblings : 48 2025-05-07T19:43:05.5353762Z core id : 6 2025-05-07T19:43:05.5353830Z cpu cores : 24 2025-05-07T19:43:05.5353897Z apicid : 77 2025-05-07T19:43:05.5353976Z initial apicid : 77 2025-05-07T19:43:05.5354044Z fpu : yes 2025-05-07T19:43:05.5354119Z fpu_exception : yes 2025-05-07T19:43:05.5354192Z cpuid level : 13 2025-05-07T19:43:05.5354264Z wp : yes 2025-05-07T19:43:05.5356172Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.5356522Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.5356594Z bogomips : 5999.98 2025-05-07T19:43:05.5356667Z clflush size : 64 2025-05-07T19:43:05.5356741Z cache_alignment : 64 2025-05-07T19:43:05.5356903Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.5356977Z power management: 2025-05-07T19:43:05.5356985Z 2025-05-07T19:43:05.5357055Z processor : 79 2025-05-07T19:43:05.5357138Z vendor_id : GenuineIntel 2025-05-07T19:43:05.5357210Z cpu family : 6 2025-05-07T19:43:05.5357276Z model : 85 2025-05-07T19:43:05.5357414Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.5357490Z stepping : 7 2025-05-07T19:43:05.5357564Z microcode : 0x5003901 2025-05-07T19:43:05.5357634Z cpu MHz : 2999.994 2025-05-07T19:43:05.5357712Z cache size : 36608 KB 2025-05-07T19:43:05.5357781Z physical id : 1 2025-05-07T19:43:05.5357848Z siblings : 48 2025-05-07T19:43:05.5357917Z core id : 7 2025-05-07T19:43:05.5357995Z cpu cores : 24 2025-05-07T19:43:05.5358064Z apicid : 79 2025-05-07T19:43:05.5358137Z initial apicid : 79 2025-05-07T19:43:05.5358213Z fpu : yes 2025-05-07T19:43:05.5358285Z fpu_exception : yes 2025-05-07T19:43:05.5358355Z cpuid level : 13 2025-05-07T19:43:05.5358426Z wp : yes 2025-05-07T19:43:05.5360345Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.5360690Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.5360767Z bogomips : 5999.98 2025-05-07T19:43:05.5360836Z clflush size : 64 2025-05-07T19:43:05.5360911Z cache_alignment : 64 2025-05-07T19:43:05.5361026Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.5361107Z power management: 2025-05-07T19:43:05.5361111Z 2025-05-07T19:43:05.5361243Z processor : 80 2025-05-07T19:43:05.5361322Z vendor_id : GenuineIntel 2025-05-07T19:43:05.5361397Z cpu family : 6 2025-05-07T19:43:05.5361464Z model : 85 2025-05-07T19:43:05.5361603Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.5361672Z stepping : 7 2025-05-07T19:43:05.5361753Z microcode : 0x5003901 2025-05-07T19:43:05.5361822Z cpu MHz : 2999.994 2025-05-07T19:43:05.5361894Z cache size : 36608 KB 2025-05-07T19:43:05.5361971Z physical id : 1 2025-05-07T19:43:05.5362040Z siblings : 48 2025-05-07T19:43:05.5362108Z core id : 8 2025-05-07T19:43:05.5362176Z cpu cores : 24 2025-05-07T19:43:05.5362249Z apicid : 81 2025-05-07T19:43:05.5362323Z initial apicid : 81 2025-05-07T19:43:05.5362390Z fpu : yes 2025-05-07T19:43:05.5362471Z fpu_exception : yes 2025-05-07T19:43:05.5362541Z cpuid level : 13 2025-05-07T19:43:05.5362606Z wp : yes 2025-05-07T19:43:05.5364517Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.5364866Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.5364938Z bogomips : 5999.98 2025-05-07T19:43:05.5365014Z clflush size : 64 2025-05-07T19:43:05.5365089Z cache_alignment : 64 2025-05-07T19:43:05.5365203Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.5365274Z power management: 2025-05-07T19:43:05.5365325Z 2025-05-07T19:43:05.5365401Z processor : 81 2025-05-07T19:43:05.5365483Z vendor_id : GenuineIntel 2025-05-07T19:43:05.5365551Z cpu family : 6 2025-05-07T19:43:05.5365624Z model : 85 2025-05-07T19:43:05.5365761Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.5365831Z stepping : 7 2025-05-07T19:43:05.5365904Z microcode : 0x5003901 2025-05-07T19:43:05.5365980Z cpu MHz : 1200.190 2025-05-07T19:43:05.5366053Z cache size : 36608 KB 2025-05-07T19:43:05.5366124Z physical id : 1 2025-05-07T19:43:05.5366197Z siblings : 48 2025-05-07T19:43:05.5366264Z core id : 9 2025-05-07T19:43:05.5366335Z cpu cores : 24 2025-05-07T19:43:05.5366401Z apicid : 83 2025-05-07T19:43:05.5366484Z initial apicid : 83 2025-05-07T19:43:05.5366550Z fpu : yes 2025-05-07T19:43:05.5366624Z fpu_exception : yes 2025-05-07T19:43:05.5366691Z cpuid level : 13 2025-05-07T19:43:05.5366765Z wp : yes 2025-05-07T19:43:05.5368666Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.5369019Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.5369092Z bogomips : 5999.98 2025-05-07T19:43:05.5369165Z clflush size : 64 2025-05-07T19:43:05.5369244Z cache_alignment : 64 2025-05-07T19:43:05.5369358Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.5369431Z power management: 2025-05-07T19:43:05.5369436Z 2025-05-07T19:43:05.5369510Z processor : 82 2025-05-07T19:43:05.5369595Z vendor_id : GenuineIntel 2025-05-07T19:43:05.5369712Z cpu family : 6 2025-05-07T19:43:05.5369777Z model : 85 2025-05-07T19:43:05.5369926Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.5369995Z stepping : 7 2025-05-07T19:43:05.5370067Z microcode : 0x5003901 2025-05-07T19:43:05.5370274Z cpu MHz : 2999.994 2025-05-07T19:43:05.5370352Z cache size : 36608 KB 2025-05-07T19:43:05.5370424Z physical id : 1 2025-05-07T19:43:05.5370657Z siblings : 48 2025-05-07T19:43:05.5370738Z core id : 10 2025-05-07T19:43:05.5370815Z cpu cores : 24 2025-05-07T19:43:05.5370889Z apicid : 85 2025-05-07T19:43:05.5370968Z initial apicid : 85 2025-05-07T19:43:05.5371051Z fpu : yes 2025-05-07T19:43:05.5371130Z fpu_exception : yes 2025-05-07T19:43:05.5371219Z cpuid level : 13 2025-05-07T19:43:05.5371293Z wp : yes 2025-05-07T19:43:05.5373365Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.5373747Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.5373826Z bogomips : 5999.98 2025-05-07T19:43:05.5373904Z clflush size : 64 2025-05-07T19:43:05.5373993Z cache_alignment : 64 2025-05-07T19:43:05.5374119Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.5374198Z power management: 2025-05-07T19:43:05.5374203Z 2025-05-07T19:43:05.5374286Z processor : 83 2025-05-07T19:43:05.5374473Z vendor_id : GenuineIntel 2025-05-07T19:43:05.5374549Z cpu family : 6 2025-05-07T19:43:05.5374626Z model : 85 2025-05-07T19:43:05.5374785Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.5374862Z stepping : 7 2025-05-07T19:43:05.5374941Z microcode : 0x5003901 2025-05-07T19:43:05.5375024Z cpu MHz : 1200.317 2025-05-07T19:43:05.5375104Z cache size : 36608 KB 2025-05-07T19:43:05.5375182Z physical id : 1 2025-05-07T19:43:05.5375256Z siblings : 48 2025-05-07T19:43:05.5375337Z core id : 11 2025-05-07T19:43:05.5375412Z cpu cores : 24 2025-05-07T19:43:05.5375486Z apicid : 87 2025-05-07T19:43:05.5375574Z initial apicid : 87 2025-05-07T19:43:05.5375649Z fpu : yes 2025-05-07T19:43:05.5375730Z fpu_exception : yes 2025-05-07T19:43:05.5375807Z cpuid level : 13 2025-05-07T19:43:05.5375887Z wp : yes 2025-05-07T19:43:05.5378009Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.5378391Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.5378478Z bogomips : 5999.98 2025-05-07T19:43:05.5378556Z clflush size : 64 2025-05-07T19:43:05.5378637Z cache_alignment : 64 2025-05-07T19:43:05.5378771Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.5378854Z power management: 2025-05-07T19:43:05.5378858Z 2025-05-07T19:43:05.5378935Z processor : 84 2025-05-07T19:43:05.5379028Z vendor_id : GenuineIntel 2025-05-07T19:43:05.5379110Z cpu family : 6 2025-05-07T19:43:05.5379183Z model : 85 2025-05-07T19:43:05.5379334Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.5379498Z stepping : 7 2025-05-07T19:43:05.5379578Z microcode : 0x5003901 2025-05-07T19:43:05.5379655Z cpu MHz : 1199.642 2025-05-07T19:43:05.5379739Z cache size : 36608 KB 2025-05-07T19:43:05.5379819Z physical id : 1 2025-05-07T19:43:05.5379894Z siblings : 48 2025-05-07T19:43:05.5379968Z core id : 12 2025-05-07T19:43:05.5380053Z cpu cores : 24 2025-05-07T19:43:05.5380128Z apicid : 89 2025-05-07T19:43:05.5380209Z initial apicid : 89 2025-05-07T19:43:05.5380291Z fpu : yes 2025-05-07T19:43:05.5380373Z fpu_exception : yes 2025-05-07T19:43:05.5380450Z cpuid level : 13 2025-05-07T19:43:05.5380523Z wp : yes 2025-05-07T19:43:05.5382603Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.5382978Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.5383064Z bogomips : 5999.98 2025-05-07T19:43:05.5383140Z clflush size : 64 2025-05-07T19:43:05.5383219Z cache_alignment : 64 2025-05-07T19:43:05.5383343Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.5383431Z power management: 2025-05-07T19:43:05.5383435Z 2025-05-07T19:43:05.5383510Z processor : 85 2025-05-07T19:43:05.5383596Z vendor_id : GenuineIntel 2025-05-07T19:43:05.5383675Z cpu family : 6 2025-05-07T19:43:05.5383747Z model : 85 2025-05-07T19:43:05.5383947Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.5384025Z stepping : 7 2025-05-07T19:43:05.5384109Z microcode : 0x5003901 2025-05-07T19:43:05.5384183Z cpu MHz : 1199.791 2025-05-07T19:43:05.5384260Z cache size : 36608 KB 2025-05-07T19:43:05.5384339Z physical id : 1 2025-05-07T19:43:05.5384412Z siblings : 48 2025-05-07T19:43:05.5384484Z core id : 13 2025-05-07T19:43:05.5384560Z cpu cores : 24 2025-05-07T19:43:05.5384640Z apicid : 91 2025-05-07T19:43:05.5384718Z initial apicid : 91 2025-05-07T19:43:05.5384793Z fpu : yes 2025-05-07T19:43:05.5384885Z fpu_exception : yes 2025-05-07T19:43:05.5384964Z cpuid level : 13 2025-05-07T19:43:05.5385039Z wp : yes 2025-05-07T19:43:05.5387104Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.5387485Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.5387566Z bogomips : 5999.98 2025-05-07T19:43:05.5387649Z clflush size : 64 2025-05-07T19:43:05.5387725Z cache_alignment : 64 2025-05-07T19:43:05.5387851Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.5387929Z power management: 2025-05-07T19:43:05.5387933Z 2025-05-07T19:43:05.5388015Z processor : 86 2025-05-07T19:43:05.5388099Z vendor_id : GenuineIntel 2025-05-07T19:43:05.5388173Z cpu family : 6 2025-05-07T19:43:05.5388252Z model : 85 2025-05-07T19:43:05.5388408Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.5388484Z stepping : 7 2025-05-07T19:43:05.5388645Z microcode : 0x5003901 2025-05-07T19:43:05.5388839Z cpu MHz : 1263.906 2025-05-07T19:43:05.5388915Z cache size : 36608 KB 2025-05-07T19:43:05.5388987Z physical id : 1 2025-05-07T19:43:05.5389064Z siblings : 48 2025-05-07T19:43:05.5389132Z core id : 14 2025-05-07T19:43:05.5389206Z cpu cores : 24 2025-05-07T19:43:05.5389278Z apicid : 93 2025-05-07T19:43:05.5389364Z initial apicid : 93 2025-05-07T19:43:05.5389436Z fpu : yes 2025-05-07T19:43:05.5389511Z fpu_exception : yes 2025-05-07T19:43:05.5389583Z cpuid level : 13 2025-05-07T19:43:05.5389659Z wp : yes 2025-05-07T19:43:05.5391581Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.5391936Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.5392012Z bogomips : 5999.98 2025-05-07T19:43:05.5392087Z clflush size : 64 2025-05-07T19:43:05.5392167Z cache_alignment : 64 2025-05-07T19:43:05.5392282Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.5392357Z power management: 2025-05-07T19:43:05.5392361Z 2025-05-07T19:43:05.5392431Z processor : 87 2025-05-07T19:43:05.5392517Z vendor_id : GenuineIntel 2025-05-07T19:43:05.5392588Z cpu family : 6 2025-05-07T19:43:05.5392654Z model : 85 2025-05-07T19:43:05.5392809Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.5392939Z stepping : 7 2025-05-07T19:43:05.5393016Z microcode : 0x5003901 2025-05-07T19:43:05.5393091Z cpu MHz : 1200.477 2025-05-07T19:43:05.5393173Z cache size : 36608 KB 2025-05-07T19:43:05.5393243Z physical id : 1 2025-05-07T19:43:05.5393312Z siblings : 48 2025-05-07T19:43:05.5393390Z core id : 15 2025-05-07T19:43:05.5393460Z cpu cores : 24 2025-05-07T19:43:05.5393531Z apicid : 95 2025-05-07T19:43:05.5393605Z initial apicid : 95 2025-05-07T19:43:05.5393685Z fpu : yes 2025-05-07T19:43:05.5393764Z fpu_exception : yes 2025-05-07T19:43:05.5393836Z cpuid level : 13 2025-05-07T19:43:05.5393904Z wp : yes 2025-05-07T19:43:05.5395818Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.5396167Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.5396246Z bogomips : 5999.98 2025-05-07T19:43:05.5396319Z clflush size : 64 2025-05-07T19:43:05.5396392Z cache_alignment : 64 2025-05-07T19:43:05.5396508Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.5396589Z power management: 2025-05-07T19:43:05.5396593Z 2025-05-07T19:43:05.5396667Z processor : 88 2025-05-07T19:43:05.5396745Z vendor_id : GenuineIntel 2025-05-07T19:43:05.5396826Z cpu family : 6 2025-05-07T19:43:05.5396896Z model : 85 2025-05-07T19:43:05.5397036Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.5397117Z stepping : 7 2025-05-07T19:43:05.5397198Z microcode : 0x5003901 2025-05-07T19:43:05.5397271Z cpu MHz : 1199.157 2025-05-07T19:43:05.5397391Z cache size : 36608 KB 2025-05-07T19:43:05.5397475Z physical id : 1 2025-05-07T19:43:05.5397549Z siblings : 48 2025-05-07T19:43:05.5397616Z core id : 16 2025-05-07T19:43:05.5397685Z cpu cores : 24 2025-05-07T19:43:05.5397761Z apicid : 97 2025-05-07T19:43:05.5397837Z initial apicid : 97 2025-05-07T19:43:05.5397904Z fpu : yes 2025-05-07T19:43:05.5397991Z fpu_exception : yes 2025-05-07T19:43:05.5398062Z cpuid level : 13 2025-05-07T19:43:05.5398128Z wp : yes 2025-05-07T19:43:05.5400038Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.5400385Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.5400459Z bogomips : 5999.98 2025-05-07T19:43:05.5400540Z clflush size : 64 2025-05-07T19:43:05.5400615Z cache_alignment : 64 2025-05-07T19:43:05.5400729Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.5400803Z power management: 2025-05-07T19:43:05.5400814Z 2025-05-07T19:43:05.5400885Z processor : 89 2025-05-07T19:43:05.5400964Z vendor_id : GenuineIntel 2025-05-07T19:43:05.5401033Z cpu family : 6 2025-05-07T19:43:05.5401108Z model : 85 2025-05-07T19:43:05.5401246Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.5401317Z stepping : 7 2025-05-07T19:43:05.5401389Z microcode : 0x5003901 2025-05-07T19:43:05.5401514Z cpu MHz : 1199.601 2025-05-07T19:43:05.5401591Z cache size : 36608 KB 2025-05-07T19:43:05.5401663Z physical id : 1 2025-05-07T19:43:05.5401739Z siblings : 48 2025-05-07T19:43:05.5401805Z core id : 17 2025-05-07T19:43:05.5401874Z cpu cores : 24 2025-05-07T19:43:05.5401941Z apicid : 99 2025-05-07T19:43:05.5402021Z initial apicid : 99 2025-05-07T19:43:05.5402091Z fpu : yes 2025-05-07T19:43:05.5402164Z fpu_exception : yes 2025-05-07T19:43:05.5402241Z cpuid level : 13 2025-05-07T19:43:05.5402307Z wp : yes 2025-05-07T19:43:05.5404206Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.5404560Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.5404634Z bogomips : 5999.98 2025-05-07T19:43:05.5404705Z clflush size : 64 2025-05-07T19:43:05.5404787Z cache_alignment : 64 2025-05-07T19:43:05.5404899Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.5404974Z power management: 2025-05-07T19:43:05.5404978Z 2025-05-07T19:43:05.5405049Z processor : 90 2025-05-07T19:43:05.5405136Z vendor_id : GenuineIntel 2025-05-07T19:43:05.5405208Z cpu family : 6 2025-05-07T19:43:05.5405275Z model : 85 2025-05-07T19:43:05.5405422Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.5405494Z stepping : 7 2025-05-07T19:43:05.5405568Z microcode : 0x5003901 2025-05-07T19:43:05.5405638Z cpu MHz : 2999.994 2025-05-07T19:43:05.5405722Z cache size : 36608 KB 2025-05-07T19:43:05.5405798Z physical id : 1 2025-05-07T19:43:05.5406248Z siblings : 48 2025-05-07T19:43:05.5406324Z core id : 18 2025-05-07T19:43:05.5406394Z cpu cores : 24 2025-05-07T19:43:05.5406466Z apicid : 101 2025-05-07T19:43:05.5406541Z initial apicid : 101 2025-05-07T19:43:05.5406621Z fpu : yes 2025-05-07T19:43:05.5406697Z fpu_exception : yes 2025-05-07T19:43:05.5406770Z cpuid level : 13 2025-05-07T19:43:05.5406844Z wp : yes 2025-05-07T19:43:05.5408757Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.5409101Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.5409180Z bogomips : 5999.98 2025-05-07T19:43:05.5409251Z clflush size : 64 2025-05-07T19:43:05.5409326Z cache_alignment : 64 2025-05-07T19:43:05.5409447Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.5409519Z power management: 2025-05-07T19:43:05.5409523Z 2025-05-07T19:43:05.5409595Z processor : 91 2025-05-07T19:43:05.5409673Z vendor_id : GenuineIntel 2025-05-07T19:43:05.5409747Z cpu family : 6 2025-05-07T19:43:05.5409814Z model : 85 2025-05-07T19:43:05.5409955Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.5410029Z stepping : 7 2025-05-07T19:43:05.5410102Z microcode : 0x5003901 2025-05-07T19:43:05.5410173Z cpu MHz : 1200.080 2025-05-07T19:43:05.5410244Z cache size : 36608 KB 2025-05-07T19:43:05.5410372Z physical id : 1 2025-05-07T19:43:05.5410444Z siblings : 48 2025-05-07T19:43:05.5410515Z core id : 19 2025-05-07T19:43:05.5410593Z cpu cores : 24 2025-05-07T19:43:05.5410665Z apicid : 103 2025-05-07T19:43:05.5410741Z initial apicid : 103 2025-05-07T19:43:05.5410808Z fpu : yes 2025-05-07T19:43:05.5410890Z fpu_exception : yes 2025-05-07T19:43:05.5410965Z cpuid level : 13 2025-05-07T19:43:05.5411032Z wp : yes 2025-05-07T19:43:05.5412937Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.5413284Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.5413363Z bogomips : 5999.98 2025-05-07T19:43:05.5413441Z clflush size : 64 2025-05-07T19:43:05.5413517Z cache_alignment : 64 2025-05-07T19:43:05.5413632Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.5413710Z power management: 2025-05-07T19:43:05.5413714Z 2025-05-07T19:43:05.5413786Z processor : 92 2025-05-07T19:43:05.5413865Z vendor_id : GenuineIntel 2025-05-07T19:43:05.5413933Z cpu family : 6 2025-05-07T19:43:05.5414008Z model : 85 2025-05-07T19:43:05.5414149Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.5414218Z stepping : 7 2025-05-07T19:43:05.5414299Z microcode : 0x5003901 2025-05-07T19:43:05.5414369Z cpu MHz : 2999.994 2025-05-07T19:43:05.5414441Z cache size : 36608 KB 2025-05-07T19:43:05.5414510Z physical id : 1 2025-05-07T19:43:05.5414587Z siblings : 48 2025-05-07T19:43:05.5414656Z core id : 20 2025-05-07T19:43:05.5414723Z cpu cores : 24 2025-05-07T19:43:05.5414845Z apicid : 105 2025-05-07T19:43:05.5414920Z initial apicid : 105 2025-05-07T19:43:05.5414988Z fpu : yes 2025-05-07T19:43:05.5415061Z fpu_exception : yes 2025-05-07T19:43:05.5415137Z cpuid level : 13 2025-05-07T19:43:05.5415203Z wp : yes 2025-05-07T19:43:05.5417386Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.5417767Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.5417848Z bogomips : 5999.98 2025-05-07T19:43:05.5417924Z clflush size : 64 2025-05-07T19:43:05.5418011Z cache_alignment : 64 2025-05-07T19:43:05.5418134Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.5418213Z power management: 2025-05-07T19:43:05.5418218Z 2025-05-07T19:43:05.5418298Z processor : 93 2025-05-07T19:43:05.5418383Z vendor_id : GenuineIntel 2025-05-07T19:43:05.5418456Z cpu family : 6 2025-05-07T19:43:05.5418526Z model : 85 2025-05-07T19:43:05.5418683Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.5418756Z stepping : 7 2025-05-07T19:43:05.5418844Z microcode : 0x5003901 2025-05-07T19:43:05.5418945Z cpu MHz : 2673.179 2025-05-07T19:43:05.5419032Z cache size : 36608 KB 2025-05-07T19:43:05.5419117Z physical id : 1 2025-05-07T19:43:05.5419197Z siblings : 48 2025-05-07T19:43:05.5419293Z core id : 21 2025-05-07T19:43:05.5419429Z cpu cores : 24 2025-05-07T19:43:05.5419512Z apicid : 107 2025-05-07T19:43:05.5419618Z initial apicid : 107 2025-05-07T19:43:05.5419698Z fpu : yes 2025-05-07T19:43:05.5419782Z fpu_exception : yes 2025-05-07T19:43:05.5419867Z cpuid level : 13 2025-05-07T19:43:05.5419961Z wp : yes 2025-05-07T19:43:05.5422034Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.5422415Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.5422514Z bogomips : 5999.98 2025-05-07T19:43:05.5422599Z clflush size : 64 2025-05-07T19:43:05.5422683Z cache_alignment : 64 2025-05-07T19:43:05.5422824Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.5422906Z power management: 2025-05-07T19:43:05.5422911Z 2025-05-07T19:43:05.5422989Z processor : 94 2025-05-07T19:43:05.5423088Z vendor_id : GenuineIntel 2025-05-07T19:43:05.5423167Z cpu family : 6 2025-05-07T19:43:05.5423242Z model : 85 2025-05-07T19:43:05.5423400Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.5423491Z stepping : 7 2025-05-07T19:43:05.5423572Z microcode : 0x5003901 2025-05-07T19:43:05.5423651Z cpu MHz : 2700.471 2025-05-07T19:43:05.5423746Z cache size : 36608 KB 2025-05-07T19:43:05.5423828Z physical id : 1 2025-05-07T19:43:05.5423904Z siblings : 48 2025-05-07T19:43:05.5423979Z core id : 22 2025-05-07T19:43:05.5424069Z cpu cores : 24 2025-05-07T19:43:05.5424148Z apicid : 109 2025-05-07T19:43:05.5424233Z initial apicid : 109 2025-05-07T19:43:05.5424357Z fpu : yes 2025-05-07T19:43:05.5424452Z fpu_exception : yes 2025-05-07T19:43:05.5424527Z cpuid level : 13 2025-05-07T19:43:05.5424601Z wp : yes 2025-05-07T19:43:05.5426676Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.5427054Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.5427136Z bogomips : 5999.98 2025-05-07T19:43:05.5427232Z clflush size : 64 2025-05-07T19:43:05.5427315Z cache_alignment : 64 2025-05-07T19:43:05.5427440Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.5427536Z power management: 2025-05-07T19:43:05.5427541Z 2025-05-07T19:43:05.5427619Z processor : 95 2025-05-07T19:43:05.5427708Z vendor_id : GenuineIntel 2025-05-07T19:43:05.5427791Z cpu family : 6 2025-05-07T19:43:05.5427865Z model : 85 2025-05-07T19:43:05.5428022Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.5428101Z stepping : 7 2025-05-07T19:43:05.5428189Z microcode : 0x5003901 2025-05-07T19:43:05.5428267Z cpu MHz : 1412.476 2025-05-07T19:43:05.5428345Z cache size : 36608 KB 2025-05-07T19:43:05.5428425Z physical id : 1 2025-05-07T19:43:05.5428506Z siblings : 48 2025-05-07T19:43:05.5428583Z core id : 23 2025-05-07T19:43:05.5428660Z cpu cores : 24 2025-05-07T19:43:05.5428782Z apicid : 111 2025-05-07T19:43:05.5428935Z initial apicid : 111 2025-05-07T19:43:05.5429118Z fpu : yes 2025-05-07T19:43:05.5429200Z fpu_exception : yes 2025-05-07T19:43:05.5429279Z cpuid level : 13 2025-05-07T19:43:05.5429349Z wp : yes 2025-05-07T19:43:05.5431249Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.5431603Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.5431683Z bogomips : 5999.98 2025-05-07T19:43:05.5431755Z clflush size : 64 2025-05-07T19:43:05.5431844Z cache_alignment : 64 2025-05-07T19:43:05.5431961Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.5432035Z power management: 2025-05-07T19:43:05.5432039Z 2025-05-07T19:43:05.5432043Z 2025-05-07T19:43:05.5432158Z ################################################################################ 2025-05-07T19:43:05.5432245Z [INFO] Print PCI info ... 2025-05-07T19:43:05.5432314Z + lspci -v 2025-05-07T19:43:05.5432319Z 2025-05-07T19:43:05.5432495Z 00:00.0 Host bridge: Intel Corporation 440FX - 82441FX PMC [Natoma] 2025-05-07T19:43:05.5432591Z Subsystem: Amazon.com, Inc. Device 1237 2025-05-07T19:43:05.5432698Z Flags: bus master, medium devsel, latency 0 2025-05-07T19:43:05.5432702Z 2025-05-07T19:43:05.5432894Z 00:01.0 ISA bridge: Intel Corporation 82371SB PIIX3 ISA [Natoma/Triton II] 2025-05-07T19:43:05.5432969Z Physical Slot: 1 2025-05-07T19:43:05.5433071Z Flags: bus master, fast devsel, latency 0 2025-05-07T19:43:05.5433075Z 2025-05-07T19:43:05.5433302Z 00:01.3 Non-VGA unclassified device: Intel Corporation 82371AB/EB/MB PIIX4 ACPI (rev 08) 2025-05-07T19:43:05.5433435Z Physical Slot: 1 2025-05-07T19:43:05.5433548Z Flags: bus master, fast devsel, latency 0, IRQ 9 2025-05-07T19:43:05.5433553Z 2025-05-07T19:43:05.5433793Z 00:03.0 VGA compatible controller: Amazon.com, Inc. Device 1111 (prog-if 00 [VGA controller]) 2025-05-07T19:43:05.5433875Z Physical Slot: 3 2025-05-07T19:43:05.5433976Z Flags: bus master, fast devsel, latency 0 2025-05-07T19:43:05.5434096Z Memory at c0000000 (32-bit, prefetchable) [size=4M] 2025-05-07T19:43:05.5434217Z Expansion ROM at 000c0000 [disabled] [size=128K] 2025-05-07T19:43:05.5434221Z 2025-05-07T19:43:05.5434501Z 00:04.0 Non-Volatile memory controller: Amazon.com, Inc. NVMe EBS Controller (prog-if 02 [NVM Express]) 2025-05-07T19:43:05.5434595Z Subsystem: Amazon.com, Inc. Device 0000 2025-05-07T19:43:05.5434682Z Physical Slot: 4 2025-05-07T19:43:05.5434802Z Flags: bus master, fast devsel, latency 0, IRQ 11 2025-05-07T19:43:05.5434951Z Memory at c0514000 (32-bit, non-prefetchable) [size=16K] 2025-05-07T19:43:05.5435047Z Capabilities: 2025-05-07T19:43:05.5435140Z Kernel driver in use: nvme 2025-05-07T19:43:05.5435144Z 2025-05-07T19:43:05.5435341Z 00:05.0 Ethernet controller: Amazon.com, Inc. Elastic Network Adapter (ENA) 2025-05-07T19:43:05.5435419Z Physical Slot: 5 2025-05-07T19:43:05.5435530Z Flags: bus master, fast devsel, latency 0 2025-05-07T19:43:05.5435663Z Memory at c0510000 (32-bit, non-prefetchable) [size=16K] 2025-05-07T19:43:05.5435784Z Memory at c0400000 (32-bit, prefetchable) [size=1M] 2025-05-07T19:43:05.5435922Z Memory at c0500000 (32-bit, non-prefetchable) [size=64K] 2025-05-07T19:43:05.5436011Z Capabilities: 2025-05-07T19:43:05.5436093Z Kernel driver in use: ena 2025-05-07T19:43:05.5436097Z 2025-05-07T19:43:05.5436101Z 2025-05-07T19:43:05.5436252Z ################################################################################ 2025-05-07T19:43:05.5436365Z [INFO] Print Linux distribution info ... 2025-05-07T19:43:05.5436441Z + uname -a 2025-05-07T19:43:05.5436446Z 2025-05-07T19:43:05.5436796Z Linux 2b31f69c500b 6.1.130-139.222.amzn2023.x86_64 #1 SMP PREEMPT_DYNAMIC Tue Mar 11 01:10:58 UTC 2025 x86_64 x86_64 x86_64 GNU/Linux 2025-05-07T19:43:05.5436801Z 2025-05-07T19:43:05.5436881Z + uname -m 2025-05-07T19:43:05.5436886Z 2025-05-07T19:43:05.5436952Z x86_64 2025-05-07T19:43:05.5436956Z 2025-05-07T19:43:05.5437031Z + cat /proc/version 2025-05-07T19:43:05.5437037Z 2025-05-07T19:43:05.5437584Z Linux version 6.1.130-139.222.amzn2023.x86_64 (mockbuild@ip-10-0-55-76) (gcc (GCC) 11.5.0 20240719 (Red Hat 11.5.0-5), GNU ld version 2.39-6.amzn2023.0.11) #1 SMP PREEMPT_DYNAMIC Tue Mar 11 01:10:58 UTC 2025 2025-05-07T19:43:05.5437590Z 2025-05-07T19:43:05.5437666Z + cat /etc/os-release 2025-05-07T19:43:05.5437670Z 2025-05-07T19:43:05.5437745Z NAME="Amazon Linux" 2025-05-07T19:43:05.5437829Z VERSION="2023" 2025-05-07T19:43:05.5437902Z ID="amzn" 2025-05-07T19:43:05.5437977Z ID_LIKE="fedora" 2025-05-07T19:43:05.5438057Z VERSION_ID="2023" 2025-05-07T19:43:05.5438145Z PLATFORM_ID="platform:al2023" 2025-05-07T19:43:05.5438242Z PRETTY_NAME="Amazon Linux 2023.7.20250428" 2025-05-07T19:43:05.5438314Z ANSI_COLOR="0;33" 2025-05-07T19:43:05.5438424Z CPE_NAME="cpe:2.3:o:amazon:amazon_linux:2023" 2025-05-07T19:43:05.5438588Z HOME_URL="https://aws.amazon.com/linux/amazon-linux-2023/" 2025-05-07T19:43:05.5438735Z DOCUMENTATION_URL="https://docs.aws.amazon.com/linux/" 2025-05-07T19:43:05.5438884Z SUPPORT_URL="https://aws.amazon.com/premiumsupport/" 2025-05-07T19:43:05.5439056Z BUG_REPORT_URL="https://github.com/amazonlinux/amazon-linux-2023" 2025-05-07T19:43:05.5439128Z VENDOR_NAME="AWS" 2025-05-07T19:43:05.5439228Z VENDOR_URL="https://aws.amazon.com/" 2025-05-07T19:43:05.5439306Z SUPPORT_END="2029-06-30" 2025-05-07T19:43:05.5439310Z 2025-05-07T19:43:05.5470521Z ##[group]Run . $PRELUDE; print_gpu_info 2025-05-07T19:43:05.5470684Z . $PRELUDE; print_gpu_info 2025-05-07T19:43:05.5470969Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:43:05.5471147Z env: 2025-05-07T19:43:05.5471255Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:43:05.5471346Z BUILD_ENV: build_binary 2025-05-07T19:43:05.5471429Z BUILD_TARGET: default 2025-05-07T19:43:05.5471509Z BUILD_VARIANT: cuda 2025-05-07T19:43:05.5471601Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:43:05.5471677Z ##[endgroup] 2025-05-07T19:43:05.9454760Z ################################################################################ 2025-05-07T19:43:05.9455904Z [INFO] Printing general display info ... 2025-05-07T19:43:05.9480007Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:43:06.0373334Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:43:06.0379469Z /usr/bin/sudo 2025-05-07T19:43:06.0389535Z which: no apt-get in (/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin) 2025-05-07T19:43:06.0395651Z /usr/bin/yum 2025-05-07T19:43:06.0396373Z [INSTALL] Updating system repositories ... 2025-05-07T19:43:06.0420677Z [EXEC] [ATTEMPT 0/3] + sudo yum update -y 2025-05-07T19:43:06.2593067Z Last metadata expiration check: 0:00:18 ago on Wed May 7 19:42:48 2025. 2025-05-07T19:43:06.3547091Z Dependencies resolved. 2025-05-07T19:43:06.3761230Z Nothing to do. 2025-05-07T19:43:06.3761966Z Complete! 2025-05-07T19:43:06.4436527Z [INSTALL] Installing system package(s): hostname lshw ... 2025-05-07T19:43:06.4461988Z [EXEC] [ATTEMPT 0/3] + sudo yum install -y hostname lshw 2025-05-07T19:43:06.6597745Z Last metadata expiration check: 0:00:18 ago on Wed May 7 19:42:48 2025. 2025-05-07T19:43:06.7117029Z Dependencies resolved. 2025-05-07T19:43:06.7280838Z ================================================================================ 2025-05-07T19:43:06.7282272Z Package Arch Version Repository Size 2025-05-07T19:43:06.7283490Z ================================================================================ 2025-05-07T19:43:06.7283971Z Installing: 2025-05-07T19:43:06.7284353Z hostname x86_64 3.23-4.amzn2023.0.3 amazonlinux 28 k 2025-05-07T19:43:06.7284834Z lshw x86_64 B.02.19.2-7.amzn2023.0.3 amazonlinux 319 k 2025-05-07T19:43:06.7285153Z 2025-05-07T19:43:06.7285258Z Transaction Summary 2025-05-07T19:43:06.7285555Z ================================================================================ 2025-05-07T19:43:06.7285887Z Install 2 Packages 2025-05-07T19:43:06.7286038Z 2025-05-07T19:43:06.7286180Z Total download size: 347 k 2025-05-07T19:43:06.7286466Z Installed size: 883 k 2025-05-07T19:43:06.7286755Z Downloading Packages: 2025-05-07T19:43:06.8357066Z (1/2): hostname-3.23-4.amzn2023.0.3.x86_64.rpm 1.7 MB/s | 28 kB 00:00 2025-05-07T19:43:06.8467120Z (2/2): lshw-B.02.19.2-7.amzn2023.0.3.x86_64.rpm 11 MB/s | 319 kB 00:00 2025-05-07T19:43:06.8472539Z -------------------------------------------------------------------------------- 2025-05-07T19:43:06.8475309Z Total 2.9 MB/s | 347 kB 00:00 2025-05-07T19:43:06.8681848Z Running transaction check 2025-05-07T19:43:06.8732872Z Transaction check succeeded. 2025-05-07T19:43:06.8733795Z Running transaction test 2025-05-07T19:43:06.8886481Z Transaction test succeeded. 2025-05-07T19:43:06.8887366Z Running transaction 2025-05-07T19:43:06.9164851Z Preparing : 1/1 2025-05-07T19:43:06.9243529Z Installing : lshw-B.02.19.2-7.amzn2023.0.3.x86_64 1/2 2025-05-07T19:43:06.9279787Z Installing : hostname-3.23-4.amzn2023.0.3.x86_64 2/2 2025-05-07T19:43:07.9687372Z Running scriptlet: hostname-3.23-4.amzn2023.0.3.x86_64 2/2 2025-05-07T19:43:07.9689620Z Verifying : hostname-3.23-4.amzn2023.0.3.x86_64 1/2 2025-05-07T19:43:08.0053613Z Verifying : lshw-B.02.19.2-7.amzn2023.0.3.x86_64 2/2 2025-05-07T19:43:08.0054566Z 2025-05-07T19:43:08.0054974Z Installed: 2025-05-07T19:43:08.0055334Z hostname-3.23-4.amzn2023.0.3.x86_64 lshw-B.02.19.2-7.amzn2023.0.3.x86_64 2025-05-07T19:43:08.0055694Z 2025-05-07T19:43:08.0055791Z Complete! 2025-05-07T19:43:08.0416299Z + hostname 2025-05-07T19:43:08.0416734Z 2025-05-07T19:43:08.0428692Z 2b31f69c500b 2025-05-07T19:43:08.0430177Z 2025-05-07T19:43:08.0430734Z + sudo lshw -C display 2025-05-07T19:43:08.0431294Z 2025-05-07T19:43:08.2420067Z *-display UNCLAIMED 2025-05-07T19:43:08.2420456Z description: VGA compatible controller 2025-05-07T19:43:08.2420815Z product: Amazon.com, Inc. 2025-05-07T19:43:08.2421151Z vendor: Amazon.com, Inc. 2025-05-07T19:43:08.2421434Z physical id: 3 2025-05-07T19:43:08.2421723Z bus info: pci@0000:00:03.0 2025-05-07T19:43:08.2422009Z version: 00 2025-05-07T19:43:08.2422298Z width: 32 bits 2025-05-07T19:43:08.2422546Z clock: 33MHz 2025-05-07T19:43:08.2422849Z capabilities: vga_controller bus_master 2025-05-07T19:43:08.2423240Z configuration: latency=0 2025-05-07T19:43:08.2423595Z resources: memory:c0000000-c03fffff memory:c0000-dffff 2025-05-07T19:43:08.2444980Z 2025-05-07T19:43:08.2445179Z ################################################################################ 2025-05-07T19:43:08.2445653Z [INFO] Printing NVIDIA GPU info ... 2025-05-07T19:43:08.2582691Z lspci: Unable to load libkmod resources: error -2 2025-05-07T19:43:08.2613988Z which: no nvidia-smi in (/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin) 2025-05-07T19:43:08.2614540Z [CHECK] nvidia-smi not found 2025-05-07T19:43:08.2614872Z ################################################################################ 2025-05-07T19:43:08.2615254Z [INFO] Printing AMD GPU info ... 2025-05-07T19:43:08.2738261Z lspci: Unable to load libkmod resources: error -2 2025-05-07T19:43:08.2765991Z which: no rocminfo in (/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin) 2025-05-07T19:43:08.2766524Z [CHECK] rocminfo not found 2025-05-07T19:43:08.2777414Z which: no rocm-smi in (/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin) 2025-05-07T19:43:08.2778063Z [CHECK] rocm-smi not found 2025-05-07T19:43:08.2855884Z ##[group]Run . $PRELUDE; setup_miniconda $HOME/miniconda 2025-05-07T19:43:08.2856379Z . $PRELUDE; setup_miniconda $HOME/miniconda 2025-05-07T19:43:08.2857219Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:43:08.2857639Z env: 2025-05-07T19:43:08.2857885Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:43:08.2858230Z BUILD_ENV: build_binary 2025-05-07T19:43:08.2858494Z BUILD_TARGET: default 2025-05-07T19:43:08.2858769Z BUILD_VARIANT: cuda 2025-05-07T19:43:08.2859052Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:43:08.2859317Z ##[endgroup] 2025-05-07T19:43:08.7421119Z ################################################################################ 2025-05-07T19:43:08.7421545Z # Setup Miniconda 2025-05-07T19:43:08.7421783Z # 2025-05-07T19:43:08.7441088Z # [2025-05-07T19:43:08.743Z] + setup_miniconda /github/home/miniconda 2025-05-07T19:43:08.7441574Z ################################################################################ 2025-05-07T19:43:08.7441923Z 2025-05-07T19:43:08.7459546Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:43:08.8304980Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:43:08.8305623Z + mkdir -p /github/home/miniconda 2025-05-07T19:43:08.8305864Z 2025-05-07T19:43:08.8326378Z 2025-05-07T19:43:08.8327029Z [SETUP] Downloading the Miniconda installer ... 2025-05-07T19:43:08.8351231Z [EXEC] [ATTEMPT 0/3] + wget -q https://repo.anaconda.com/miniconda/Miniconda3-latest-Linux-x86_64.sh -O miniconda.sh 2025-05-07T19:43:09.9015454Z [SETUP] Installing Miniconda ... 2025-05-07T19:43:09.9015975Z + bash miniconda.sh -b -p /github/home/miniconda -u 2025-05-07T19:43:09.9016279Z 2025-05-07T19:43:09.9166830Z PREFIX=/github/home/miniconda 2025-05-07T19:43:10.2723281Z Unpacking payload ... 2025-05-07T19:43:10.7539844Z entry_point.py:256: DeprecationWarning: Python 3.14 will, by default, filter extracted tar archives and reject files or modify their metadata. Use the filter argument to control this behavior. 2025-05-07T19:43:11.4352014Z entry_point.py:256: DeprecationWarning: Python 3.14 will, by default, filter extracted tar archives and reject files or modify their metadata. Use the filter argument to control this behavior. 2025-05-07T19:43:13.3027213Z 2025-05-07T19:43:13.3027928Z Installing base environment... 2025-05-07T19:43:13.3028545Z 2025-05-07T19:43:14.3764296Z Preparing transaction: ...working... done 2025-05-07T19:43:17.4049744Z Executing transaction: ...working... done 2025-05-07T19:43:17.9599028Z entry_point.py:256: DeprecationWarning: Python 3.14 will, by default, filter extracted tar archives and reject files or modify their metadata. Use the filter argument to control this behavior. 2025-05-07T19:43:18.0303834Z installation finished. 2025-05-07T19:43:18.0315055Z 2025-05-07T19:43:18.0315586Z + rm -f miniconda.sh 2025-05-07T19:43:18.0315807Z 2025-05-07T19:43:18.0515754Z 2025-05-07T19:43:18.0516181Z [SETUP] Reloading the bash configuration ... 2025-05-07T19:43:18.0516614Z + /github/home/miniconda/bin/conda init bash 2025-05-07T19:43:18.0516855Z 2025-05-07T19:43:18.4219920Z no change /github/home/miniconda/condabin/conda 2025-05-07T19:43:18.4221112Z no change /github/home/miniconda/bin/conda 2025-05-07T19:43:18.4222146Z no change /github/home/miniconda/bin/conda-env 2025-05-07T19:43:18.4223207Z no change /github/home/miniconda/bin/activate 2025-05-07T19:43:18.4224230Z no change /github/home/miniconda/bin/deactivate 2025-05-07T19:43:18.4225403Z no change /github/home/miniconda/etc/profile.d/conda.sh 2025-05-07T19:43:18.4226485Z no change /github/home/miniconda/etc/fish/conf.d/conda.fish 2025-05-07T19:43:18.4226930Z no change /github/home/miniconda/shell/condabin/Conda.psm1 2025-05-07T19:43:18.4227383Z no change /github/home/miniconda/shell/condabin/conda-hook.ps1 2025-05-07T19:43:18.4227929Z no change /github/home/miniconda/lib/python3.13/site-packages/xontrib/conda.xsh 2025-05-07T19:43:18.4228924Z no change /github/home/miniconda/etc/profile.d/conda.csh 2025-05-07T19:43:18.4229292Z modified /github/home/.bashrc 2025-05-07T19:43:18.4229497Z 2025-05-07T19:43:18.4229706Z ==> For changes to take effect, close and re-open your current shell. <== 2025-05-07T19:43:18.4229998Z 2025-05-07T19:43:18.4766290Z 2025-05-07T19:43:18.4766665Z + . /github/home/.bashrc 2025-05-07T19:43:18.4766862Z 2025-05-07T19:43:19.2703278Z 2025-05-07T19:43:19.2704471Z [SETUP] Installing libmamba-solver (required since Anaconda 2024.02-1) and libarchive ... 2025-05-07T19:43:19.2732324Z [EXEC] [ATTEMPT 0/3] + conda install --solver=classic -c conda-forge --override-channels -y conda-libmamba-solver libmamba libmambapy libarchive 2025-05-07T19:43:30.8422327Z Collecting package metadata (current_repodata.json): - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | done 2025-05-07T19:43:32.2946215Z Solving environment: - \ | / - \ | / - \ | done 2025-05-07T19:43:32.3845776Z 2025-05-07T19:43:32.3846193Z ## Package Plan ## 2025-05-07T19:43:32.3846653Z 2025-05-07T19:43:32.3847046Z environment location: /github/home/miniconda 2025-05-07T19:43:32.3847761Z 2025-05-07T19:43:32.3848031Z added / updated specs: 2025-05-07T19:43:32.3848787Z - conda-libmamba-solver 2025-05-07T19:43:32.3849509Z - libarchive 2025-05-07T19:43:32.3850110Z - libmamba 2025-05-07T19:43:32.3850671Z - libmambapy 2025-05-07T19:43:32.3851036Z 2025-05-07T19:43:32.3851071Z 2025-05-07T19:43:32.3851412Z The following packages will be downloaded: 2025-05-07T19:43:32.3852047Z 2025-05-07T19:43:32.3853001Z package | build 2025-05-07T19:43:32.3853947Z ---------------------------|----------------- 2025-05-07T19:43:32.3854935Z ca-certificates-2025.4.26 | hbd8a1cb_0 149 KB conda-forge 2025-05-07T19:43:32.3855419Z certifi-2025.4.26 | pyhd8ed1ab_0 154 KB conda-forge 2025-05-07T19:43:32.3855871Z conda-25.3.1 | py313h78bf25f_1 1.1 MB conda-forge 2025-05-07T19:43:32.3856348Z conda-libmamba-solver-25.4.0| pyhd8ed1ab_0 41 KB conda-forge 2025-05-07T19:43:32.3857136Z ------------------------------------------------------------ 2025-05-07T19:43:32.3857568Z Total: 1.4 MB 2025-05-07T19:43:32.3857791Z 2025-05-07T19:43:32.3857909Z The following packages will be UPDATED: 2025-05-07T19:43:32.3858122Z 2025-05-07T19:43:32.3863406Z ca-certificates pkgs/main/linux-64::ca-certificates-2~ --> conda-forge/noarch::ca-certificates-2025.4.26-hbd8a1cb_0 2025-05-07T19:43:32.3864267Z conda pkgs/main::conda-25.3.1-py313h06a4308~ --> conda-forge::conda-25.3.1-py313h78bf25f_1 2025-05-07T19:43:32.3864691Z 2025-05-07T19:43:32.3864921Z The following packages will be SUPERSEDED by a higher-priority channel: 2025-05-07T19:43:32.3865257Z 2025-05-07T19:43:32.3865607Z certifi pkgs/main/linux-64::certifi-2025.4.26~ --> conda-forge/noarch::certifi-2025.4.26-pyhd8ed1ab_0 2025-05-07T19:43:32.3866441Z conda-libmamba-so~ pkgs/main::conda-libmamba-solver-25.4~ --> conda-forge::conda-libmamba-solver-25.4.0-pyhd8ed1ab_0 2025-05-07T19:43:32.3866968Z 2025-05-07T19:43:32.3866973Z 2025-05-07T19:43:32.3866977Z 2025-05-07T19:43:32.3867127Z Downloading and Extracting Packages: ...working... 2025-05-07T19:43:32.3867524Z conda-25.3.1 | 1.1 MB | | 0% 2025-05-07T19:43:32.3867761Z 2025-05-07T19:43:32.3868206Z certifi-2025.4.26 | 154 KB | | 0%  2025-05-07T19:43:32.3868472Z 2025-05-07T19:43:32.3868480Z 2025-05-07T19:43:32.3868722Z ca-certificates-2025 | 149 KB | | 0%  2025-05-07T19:43:32.3868984Z 2025-05-07T19:43:32.3868988Z 2025-05-07T19:43:32.3869254Z 2025-05-07T19:43:32.4372200Z conda-libmamba-solve | 41 KB | | 0%  2025-05-07T19:43:32.4373100Z 2025-05-07T19:43:32.4373115Z 2025-05-07T19:43:32.4373125Z 2025-05-07T19:43:32.4514551Z conda-libmamba-solve | 41 KB | ########## | 100%  2025-05-07T19:43:32.4514884Z 2025-05-07T19:43:32.4515237Z 2025-05-07T19:43:32.4515267Z 2025-05-07T19:43:32.4587732Z conda-libmamba-solve | 41 KB | ########## | 100%  2025-05-07T19:43:32.4639653Z conda-25.3.1 | 1.1 MB | ########## | 100% 2025-05-07T19:43:32.4640188Z 2025-05-07T19:43:32.4734766Z certifi-2025.4.26 | 154 KB | ########## | 100%  2025-05-07T19:43:32.4735075Z 2025-05-07T19:43:32.4735080Z 2025-05-07T19:43:32.4877923Z ca-certificates-2025 | 149 KB | ########## | 100%  2025-05-07T19:43:32.4878389Z 2025-05-07T19:43:32.4880259Z certifi-2025.4.26 | 154 KB | ########## | 100%  2025-05-07T19:43:32.4880568Z 2025-05-07T19:43:32.4923858Z certifi-2025.4.26 | 154 KB | ########## | 100%  2025-05-07T19:43:32.4924151Z 2025-05-07T19:43:32.4924156Z 2025-05-07T19:43:32.4924404Z ca-certificates-2025 | 149 KB | ########## | 100%  2025-05-07T19:43:32.4924693Z 2025-05-07T19:43:32.4924697Z 2025-05-07T19:43:32.5816143Z ca-certificates-2025 | 149 KB | ########## | 100%  2025-05-07T19:43:32.5816694Z conda-25.3.1 | 1.1 MB | ########## | 100% 2025-05-07T19:43:32.5825437Z conda-25.3.1 | 1.1 MB | ########## | 100% 2025-05-07T19:43:32.5826291Z 2025-05-07T19:43:32.5826498Z 2025-05-07T19:43:32.5826693Z  2025-05-07T19:43:32.5826921Z 2025-05-07T19:43:32.5826927Z 2025-05-07T19:43:32.5827097Z  2025-05-07T19:43:32.5827610Z 2025-05-07T19:43:32.5827614Z 2025-05-07T19:43:32.5827633Z 2025-05-07T19:43:32.5827834Z  done 2025-05-07T19:43:32.6833500Z Preparing transaction: - done 2025-05-07T19:43:32.7845356Z Verifying transaction: | done 2025-05-07T19:43:34.0872652Z Executing transaction: - \ | / - \ | / - \ | / - done 2025-05-07T19:43:35.6574669Z [SETUP] Updating Miniconda base packages ... 2025-05-07T19:43:35.6609126Z [EXEC] [ATTEMPT 0/3] + conda update -n base -c defaults --update-deps -y conda 2025-05-07T19:43:36.4199422Z Channels: 2025-05-07T19:43:36.4200104Z - defaults 2025-05-07T19:43:36.4200712Z Platform: linux-64 2025-05-07T19:43:37.4919672Z Collecting package metadata (repodata.json): - \ | / - \ done 2025-05-07T19:43:37.6219310Z Solving environment: / - Channels: 2025-05-07T19:43:37.6219716Z - defaults 2025-05-07T19:43:37.6219991Z Platform: linux-64 2025-05-07T19:43:37.9033300Z Collecting package metadata (repodata.json): | / - \ done 2025-05-07T19:43:38.1217524Z Solving environment: / - \ done 2025-05-07T19:43:38.2420051Z | done 2025-05-07T19:43:38.3081205Z 2025-05-07T19:43:38.3081696Z ## Package Plan ## 2025-05-07T19:43:38.3082219Z 2025-05-07T19:43:38.3082604Z environment location: /github/home/miniconda 2025-05-07T19:43:38.3083321Z 2025-05-07T19:43:38.3083590Z added / updated specs: 2025-05-07T19:43:38.3084305Z - conda 2025-05-07T19:43:38.3084643Z 2025-05-07T19:43:38.3084654Z 2025-05-07T19:43:38.3085003Z The following packages will be downloaded: 2025-05-07T19:43:38.3085758Z 2025-05-07T19:43:38.3086008Z package | build 2025-05-07T19:43:38.3086326Z ---------------------------|----------------- 2025-05-07T19:43:38.3086683Z pip-25.1 | pyhc872135_2 1.3 MB 2025-05-07T19:43:38.3087059Z tzdata-2025b | h04d1e81_0 116 KB 2025-05-07T19:43:38.3087433Z ------------------------------------------------------------ 2025-05-07T19:43:38.3087786Z Total: 1.4 MB 2025-05-07T19:43:38.3087986Z 2025-05-07T19:43:38.3088395Z The following packages will be UPDATED: 2025-05-07T19:43:38.3088611Z 2025-05-07T19:43:38.3088940Z pip pkgs/main/linux-64::pip-25.0-py313h06~ --> pkgs/main/noarch::pip-25.1-pyhc872135_2 2025-05-07T19:43:38.3089428Z tzdata 2025a-h04d1e81_0 --> 2025b-h04d1e81_0 2025-05-07T19:43:38.3089689Z 2025-05-07T19:43:38.3089692Z 2025-05-07T19:43:38.3089696Z 2025-05-07T19:43:38.3089834Z Downloading and Extracting Packages: ...working... 2025-05-07T19:43:38.3090209Z pip-25.1 | 1.3 MB | | 0% 2025-05-07T19:43:38.3090431Z 2025-05-07T19:43:38.3608307Z tzdata-2025b | 116 KB | | 0%  2025-05-07T19:43:38.3714746Z pip-25.1 | 1.3 MB | ########## | 100% 2025-05-07T19:43:38.3715041Z 2025-05-07T19:43:38.6135727Z tzdata-2025b | 116 KB | ########## | 100%  2025-05-07T19:43:38.6136162Z pip-25.1 | 1.3 MB | ########## | 100% 2025-05-07T19:43:38.6298667Z pip-25.1 | 1.3 MB | ########## | 100% 2025-05-07T19:43:38.6299391Z 2025-05-07T19:43:38.6300185Z tzdata-2025b | 116 KB | ########## | 100%  2025-05-07T19:43:38.6300446Z 2025-05-07T19:43:38.6300773Z tzdata-2025b | 116 KB | ########## | 100%  2025-05-07T19:43:38.6301114Z 2025-05-07T19:43:38.6301312Z 2025-05-07T19:43:38.6301477Z  done 2025-05-07T19:43:38.7311010Z Preparing transaction: - done 2025-05-07T19:43:38.8321825Z Verifying transaction: | done 2025-05-07T19:43:40.8356532Z Executing transaction: - \ | / - \ | / - \ | / - \ | / - \ | / done 2025-05-07T19:43:41.4070288Z [SETUP] Cleaning up Conda packages ... 2025-05-07T19:43:41.4073185Z + conda clean --packages --tarball -y 2025-05-07T19:43:41.4073871Z 2025-05-07T19:43:41.8548527Z Will remove 99 (117.8 MB) tarball(s). 2025-05-07T19:43:41.8549522Z Will remove 11 (16.0 MB) package(s). 2025-05-07T19:43:41.9102456Z 2025-05-07T19:43:41.9106879Z + conda clean --all -y 2025-05-07T19:43:41.9107398Z 2025-05-07T19:43:42.3580416Z There are no unused tarball(s) to remove. 2025-05-07T19:43:42.3581414Z Will remove 1 index cache(s). 2025-05-07T19:43:42.3582232Z There are no unused package(s) to remove. 2025-05-07T19:43:42.3583151Z There are no tempfile(s) to remove. 2025-05-07T19:43:42.3584000Z There are no logfile(s) to remove. 2025-05-07T19:43:42.4137473Z 2025-05-07T19:43:42.4137865Z + conda info 2025-05-07T19:43:42.4138059Z 2025-05-07T19:43:42.9772089Z 2025-05-07T19:43:42.9772794Z active environment : base 2025-05-07T19:43:42.9773809Z active env location : /github/home/miniconda 2025-05-07T19:43:42.9774733Z shell level : 1 2025-05-07T19:43:42.9775540Z user config file : /github/home/.condarc 2025-05-07T19:43:42.9776890Z populated config files : /github/home/miniconda/.condarc 2025-05-07T19:43:42.9777968Z conda version : 25.3.1 2025-05-07T19:43:42.9778784Z conda-build version : not installed 2025-05-07T19:43:42.9779660Z python version : 3.13.2.final.0 2025-05-07T19:43:42.9780539Z solver : libmamba (default) 2025-05-07T19:43:42.9781498Z virtual packages : __archspec=1=cascadelake 2025-05-07T19:43:42.9781845Z __conda=25.3.1=0 2025-05-07T19:43:42.9782140Z __glibc=2.34=0 2025-05-07T19:43:42.9782477Z __linux=6.1.130=0 2025-05-07T19:43:42.9782777Z __unix=0=0 2025-05-07T19:43:42.9783274Z base environment : /github/home/miniconda (writable) 2025-05-07T19:43:42.9783676Z conda av data dir : /github/home/miniconda/etc/conda 2025-05-07T19:43:42.9784045Z conda av metadata url : None 2025-05-07T19:43:42.9784448Z channel URLs : https://repo.anaconda.com/pkgs/main/linux-64 2025-05-07T19:43:42.9784881Z https://repo.anaconda.com/pkgs/main/noarch 2025-05-07T19:43:42.9785587Z https://repo.anaconda.com/pkgs/r/linux-64 2025-05-07T19:43:42.9785977Z https://repo.anaconda.com/pkgs/r/noarch 2025-05-07T19:43:42.9786377Z package cache : /github/home/miniconda/pkgs 2025-05-07T19:43:42.9786722Z /github/home/.conda/pkgs 2025-05-07T19:43:42.9787096Z envs directories : /github/home/miniconda/envs 2025-05-07T19:43:42.9787436Z /github/home/.conda/envs 2025-05-07T19:43:42.9787786Z platform : linux-64 2025-05-07T19:43:42.9788662Z user-agent : conda/25.3.1 requests/2.32.3 CPython/3.13.2 Linux/6.1.130-139.222.amzn2023.x86_64 amzn/2023.7.20250428 glibc/2.34 solver/libmamba conda-libmamba-solver/25.4.0 libmambapy/2.0.5 aau/0.7.0 c/. s/. e/. 2025-05-07T19:43:42.9789490Z UID:GID : 0:0 2025-05-07T19:43:42.9789795Z netrc file : None 2025-05-07T19:43:42.9790068Z offline mode : False 2025-05-07T19:43:42.9790281Z 2025-05-07T19:43:43.0373767Z 2025-05-07T19:43:43.0374575Z [SETUP] Exporting Miniconda variables ... 2025-05-07T19:43:43.0375278Z [SETUP] Saving Miniconda variables to /__w/_temp/_runner_file_commands/add_path_bad157db-5625-47d4-92b6-1b94d23712e2 ... 2025-05-07T19:43:43.0375977Z [SETUP] Successfully set up Miniconda at /github/home/miniconda 2025-05-07T19:43:43.0532079Z ##[group]Run . $PRELUDE; create_conda_environment $BUILD_ENV 3.13 2025-05-07T19:43:43.0532593Z . $PRELUDE; create_conda_environment $BUILD_ENV 3.13 2025-05-07T19:43:43.0533288Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:43:43.0533603Z env: 2025-05-07T19:43:43.0533829Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:43:43.0534109Z BUILD_ENV: build_binary 2025-05-07T19:43:43.0534350Z BUILD_TARGET: default 2025-05-07T19:43:43.0534564Z BUILD_VARIANT: cuda 2025-05-07T19:43:43.0534978Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:43:43.0535207Z ##[endgroup] 2025-05-07T19:43:43.4848886Z ################################################################################ 2025-05-07T19:43:43.4849317Z # Create Conda Environment 2025-05-07T19:43:43.4849613Z # 2025-05-07T19:43:43.4867350Z # [2025-05-07T19:43:43.486Z] + create_conda_environment build_binary 3.13 2025-05-07T19:43:43.4867922Z ################################################################################ 2025-05-07T19:43:43.4868158Z 2025-05-07T19:43:43.4886380Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:43:43.5718870Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:43:43.5719295Z [SETUP] Listing existing Conda environments ... 2025-05-07T19:43:43.5719667Z + conda info --envs 2025-05-07T19:43:43.5719853Z 2025-05-07T19:43:44.1573051Z 2025-05-07T19:43:44.1573366Z # conda environments: 2025-05-07T19:43:44.1573662Z # 2025-05-07T19:43:44.1573946Z base /github/home/miniconda 2025-05-07T19:43:44.1574216Z 2025-05-07T19:43:44.2163369Z 2025-05-07T19:43:44.2164171Z [SETUP] Deleting the prefix directory if it exists ... 2025-05-07T19:43:45.8840249Z + rm -rf /github/home/miniconda/envs/build_binary 2025-05-07T19:43:45.8858145Z 2025-05-07T19:43:45.8858153Z 2025-05-07T19:43:45.8875062Z [SETUP] Creating new Conda environment (Python 3.13) ... 2025-05-07T19:43:45.8901740Z [EXEC] [ATTEMPT 0/3] + conda create -y -n build_binary python=3.13 2025-05-07T19:43:46.4598778Z Channels: 2025-05-07T19:43:46.4599114Z - defaults 2025-05-07T19:43:46.4599401Z Platform: linux-64 2025-05-07T19:43:47.8556521Z Collecting package metadata (repodata.json): - \ | / - \ | / - done 2025-05-07T19:43:47.9562463Z Solving environment: | done 2025-05-07T19:43:47.9854113Z 2025-05-07T19:43:47.9854461Z ## Package Plan ## 2025-05-07T19:43:47.9854658Z 2025-05-07T19:43:47.9855342Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:43:47.9855771Z 2025-05-07T19:43:47.9855892Z added / updated specs: 2025-05-07T19:43:47.9856182Z - python=3.13 2025-05-07T19:43:47.9856364Z 2025-05-07T19:43:47.9856368Z 2025-05-07T19:43:47.9856599Z The following packages will be downloaded: 2025-05-07T19:43:47.9856841Z 2025-05-07T19:43:47.9857006Z package | build 2025-05-07T19:43:47.9857365Z ---------------------------|----------------- 2025-05-07T19:43:47.9857863Z _libgcc_mutex-0.1 | main 3 KB 2025-05-07T19:43:47.9858344Z _openmp_mutex-5.1 | 1_gnu 21 KB 2025-05-07T19:43:47.9858800Z ca-certificates-2025.2.25 | h06a4308_0 129 KB 2025-05-07T19:43:47.9859287Z python_abi-3.13 | 0_cp313 6 KB 2025-05-07T19:43:47.9859700Z ------------------------------------------------------------ 2025-05-07T19:43:47.9860099Z Total: 159 KB 2025-05-07T19:43:47.9860334Z 2025-05-07T19:43:47.9860477Z The following NEW packages will be INSTALLED: 2025-05-07T19:43:47.9860746Z 2025-05-07T19:43:47.9860979Z _libgcc_mutex pkgs/main/linux-64::_libgcc_mutex-0.1-main 2025-05-07T19:43:47.9861476Z _openmp_mutex pkgs/main/linux-64::_openmp_mutex-5.1-1_gnu 2025-05-07T19:43:47.9861925Z bzip2 pkgs/main/linux-64::bzip2-1.0.8-h5eee18b_6 2025-05-07T19:43:47.9862809Z ca-certificates pkgs/main/linux-64::ca-certificates-2025.2.25-h06a4308_0 2025-05-07T19:43:47.9863344Z expat pkgs/main/linux-64::expat-2.7.1-h6a678d5_0 2025-05-07T19:43:47.9863870Z ld_impl_linux-64 pkgs/main/linux-64::ld_impl_linux-64-2.40-h12ee557_0 2025-05-07T19:43:47.9864405Z libffi pkgs/main/linux-64::libffi-3.4.4-h6a678d5_1 2025-05-07T19:43:47.9864863Z libgcc-ng pkgs/main/linux-64::libgcc-ng-11.2.0-h1234567_1 2025-05-07T19:43:47.9865361Z libgomp pkgs/main/linux-64::libgomp-11.2.0-h1234567_1 2025-05-07T19:43:47.9865969Z libmpdec pkgs/main/linux-64::libmpdec-4.0.0-h5eee18b_0 2025-05-07T19:43:47.9866500Z libstdcxx-ng pkgs/main/linux-64::libstdcxx-ng-11.2.0-h1234567_1 2025-05-07T19:43:47.9867012Z libuuid pkgs/main/linux-64::libuuid-1.41.5-h5eee18b_0 2025-05-07T19:43:47.9867467Z ncurses pkgs/main/linux-64::ncurses-6.4-h6a678d5_0 2025-05-07T19:43:47.9867947Z openssl pkgs/main/linux-64::openssl-3.0.16-h5eee18b_0 2025-05-07T19:43:47.9868396Z pip pkgs/main/noarch::pip-25.1-pyhc872135_2 2025-05-07T19:43:47.9868875Z python pkgs/main/linux-64::python-3.13.2-hf623796_100_cp313 2025-05-07T19:43:47.9869354Z python_abi pkgs/main/linux-64::python_abi-3.13-0_cp313 2025-05-07T19:43:47.9869840Z readline pkgs/main/linux-64::readline-8.2-h5eee18b_0 2025-05-07T19:43:47.9870547Z setuptools pkgs/main/linux-64::setuptools-78.1.1-py313h06a4308_0 2025-05-07T19:43:47.9871083Z sqlite pkgs/main/linux-64::sqlite-3.45.3-h5eee18b_0 2025-05-07T19:43:47.9871528Z tk pkgs/main/linux-64::tk-8.6.14-h39e8969_0 2025-05-07T19:43:47.9871940Z tzdata pkgs/main/noarch::tzdata-2025b-h04d1e81_0 2025-05-07T19:43:47.9872416Z wheel pkgs/main/linux-64::wheel-0.45.1-py313h06a4308_0 2025-05-07T19:43:47.9872861Z xz pkgs/main/linux-64::xz-5.6.4-h5eee18b_1 2025-05-07T19:43:47.9873259Z zlib pkgs/main/linux-64::zlib-1.2.13-h5eee18b_1 2025-05-07T19:43:47.9873553Z 2025-05-07T19:43:47.9873557Z 2025-05-07T19:43:47.9873561Z 2025-05-07T19:43:47.9873719Z Downloading and Extracting Packages: ...working... 2025-05-07T19:43:47.9874127Z ca-certificates-2025 | 129 KB | | 0% 2025-05-07T19:43:47.9874408Z 2025-05-07T19:43:47.9874773Z _openmp_mutex-5.1 | 21 KB | | 0%  2025-05-07T19:43:47.9875032Z 2025-05-07T19:43:47.9875036Z 2025-05-07T19:43:47.9875288Z python_abi-3.13 | 6 KB | | 0%  2025-05-07T19:43:47.9875554Z 2025-05-07T19:43:47.9875558Z 2025-05-07T19:43:47.9881094Z 2025-05-07T19:43:48.0215745Z _libgcc_mutex-0.1 | 3 KB | | 0%  2025-05-07T19:43:48.0216853Z 2025-05-07T19:43:48.0258485Z _openmp_mutex-5.1 | 21 KB | ########## | 100%  2025-05-07T19:43:48.0343727Z ca-certificates-2025 | 129 KB | ########## | 100% 2025-05-07T19:43:48.0344511Z 2025-05-07T19:43:48.0346265Z 2025-05-07T19:43:48.0373526Z python_abi-3.13 | 6 KB | ########## | 100%  2025-05-07T19:43:48.0373810Z 2025-05-07T19:43:48.0373836Z 2025-05-07T19:43:48.0373840Z 2025-05-07T19:43:48.0377842Z _libgcc_mutex-0.1 | 3 KB | ########## | 100%  2025-05-07T19:43:48.0459488Z ca-certificates-2025 | 129 KB | ########## | 100% 2025-05-07T19:43:48.0460253Z 2025-05-07T19:43:48.0467342Z 2025-05-07T19:43:48.0467725Z python_abi-3.13 | 6 KB | ########## | 100%  2025-05-07T19:43:48.0468004Z 2025-05-07T19:43:48.0468010Z 2025-05-07T19:43:48.0507245Z 2025-05-07T19:43:48.0508302Z _libgcc_mutex-0.1 | 3 KB | ########## | 100%  2025-05-07T19:43:48.0509113Z 2025-05-07T19:43:48.0510468Z _openmp_mutex-5.1 | 21 KB | ########## | 100%  2025-05-07T19:43:48.0511518Z 2025-05-07T19:43:48.0512159Z 2025-05-07T19:43:48.0512634Z  2025-05-07T19:43:48.0513236Z 2025-05-07T19:43:48.0513247Z 2025-05-07T19:43:48.0515491Z  2025-05-07T19:43:48.0516130Z 2025-05-07T19:43:48.0516141Z 2025-05-07T19:43:48.0516151Z 2025-05-07T19:43:48.0516686Z  done 2025-05-07T19:43:48.2623724Z Preparing transaction: - \ done 2025-05-07T19:43:49.8164204Z Verifying transaction: / - \ | / - \ | / - \ | / - done 2025-05-07T19:43:52.0329505Z Executing transaction: | / - \ | / - \ | / - \ | / - \ | / - \ | / done 2025-05-07T19:43:52.0367522Z # 2025-05-07T19:43:52.0367972Z # To activate this environment, use 2025-05-07T19:43:52.0368309Z # 2025-05-07T19:43:52.0368558Z # $ conda activate build_binary 2025-05-07T19:43:52.0368959Z # 2025-05-07T19:43:52.0369225Z # To deactivate an active environment, use 2025-05-07T19:43:52.0369532Z # 2025-05-07T19:43:52.0369776Z # $ conda deactivate 2025-05-07T19:43:52.0370300Z 2025-05-07T19:43:52.1230357Z [SETUP] Upgrading PIP to latest ... 2025-05-07T19:43:52.1259043Z [EXEC] [ATTEMPT 0/3] + conda run -n build_binary pip install --upgrade pip 2025-05-07T19:43:55.0884817Z WARNING: Running pip as the 'root' user can result in broken permissions and conflicting behaviour with the system package manager, possibly rendering your system unusable. It is recommended to use a virtual environment instead: https://pip.pypa.io/warnings/venv. Use the --root-user-action option if you know what you are doing and want to suppress this warning. 2025-05-07T19:43:55.0888550Z 2025-05-07T19:43:55.0889570Z Requirement already satisfied: pip in /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages (25.1) 2025-05-07T19:43:55.0891129Z Collecting pip 2025-05-07T19:43:55.0891820Z Downloading pip-25.1.1-py3-none-any.whl.metadata (3.6 kB) 2025-05-07T19:43:55.0892852Z Downloading pip-25.1.1-py3-none-any.whl (1.8 MB) 2025-05-07T19:43:55.0894337Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 1.8/1.8 MB 75.3 MB/s eta 0:00:00 2025-05-07T19:43:55.0895174Z Installing collected packages: pip 2025-05-07T19:43:55.0895823Z Attempting uninstall: pip 2025-05-07T19:43:55.0896503Z Found existing installation: pip 25.1 2025-05-07T19:43:55.0897355Z Uninstalling pip-25.1: 2025-05-07T19:43:55.0898029Z Successfully uninstalled pip-25.1 2025-05-07T19:43:55.0898733Z Successfully installed pip-25.1.1 2025-05-07T19:43:55.0899181Z 2025-05-07T19:43:55.1663276Z [SETUP] Upgrading pyOpenSSL ... 2025-05-07T19:43:55.1692695Z [EXEC] [ATTEMPT 0/3] + conda install -n build_binary -c conda-forge --override-channels -y pyOpenSSL>22.1.0 2025-05-07T19:43:55.8368829Z Channels: 2025-05-07T19:43:55.8370683Z - conda-forge 2025-05-07T19:43:55.8371431Z Platform: linux-64 2025-05-07T19:44:05.5117674Z Collecting package metadata (repodata.json): - \ | / - \ | / - \ | / - \ | / - \ done 2025-05-07T19:44:07.4059544Z Solving environment: / - \ | / done 2025-05-07T19:44:07.4554398Z 2025-05-07T19:44:07.4554924Z ## Package Plan ## 2025-05-07T19:44:07.4555453Z 2025-05-07T19:44:07.4556076Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:44:07.4557020Z 2025-05-07T19:44:07.4557297Z added / updated specs: 2025-05-07T19:44:07.4558069Z - pyopenssl[version='>22.1.0'] 2025-05-07T19:44:07.4558619Z 2025-05-07T19:44:07.4558632Z 2025-05-07T19:44:07.4558988Z The following packages will be downloaded: 2025-05-07T19:44:07.4559615Z 2025-05-07T19:44:07.4559944Z package | build 2025-05-07T19:44:07.4560910Z ---------------------------|----------------- 2025-05-07T19:44:07.4561969Z cffi-1.17.1 | py313hfab6e84_0 289 KB conda-forge 2025-05-07T19:44:07.4563304Z cryptography-44.0.3 | py313h6556f6e_0 1.5 MB conda-forge 2025-05-07T19:44:07.4564589Z libgcc-15.1.0 | h767d61c_2 810 KB conda-forge 2025-05-07T19:44:07.4565784Z libgcc-ng-15.1.0 | h69a702a_2 34 KB conda-forge 2025-05-07T19:44:07.4566517Z libgomp-15.1.0 | h767d61c_2 442 KB conda-forge 2025-05-07T19:44:07.4566951Z openssl-3.5.0 | h7b32b05_1 3.0 MB conda-forge 2025-05-07T19:44:07.4567512Z pycparser-2.22 | pyh29332c3_1 108 KB conda-forge 2025-05-07T19:44:07.4567934Z pyopenssl-25.0.0 | pyhd8ed1ab_0 120 KB conda-forge 2025-05-07T19:44:07.4568397Z typing-extensions-4.13.2 | h0e9735f_0 88 KB conda-forge 2025-05-07T19:44:07.4569217Z typing_extensions-4.13.2 | pyh29332c3_0 51 KB conda-forge 2025-05-07T19:44:07.4569647Z ------------------------------------------------------------ 2025-05-07T19:44:07.4570014Z Total: 6.4 MB 2025-05-07T19:44:07.4570459Z 2025-05-07T19:44:07.4570591Z The following NEW packages will be INSTALLED: 2025-05-07T19:44:07.4571019Z 2025-05-07T19:44:07.4571246Z cffi conda-forge/linux-64::cffi-1.17.1-py313hfab6e84_0 2025-05-07T19:44:07.4571776Z cryptography conda-forge/linux-64::cryptography-44.0.3-py313h6556f6e_0 2025-05-07T19:44:07.4572284Z libgcc conda-forge/linux-64::libgcc-15.1.0-h767d61c_2 2025-05-07T19:44:07.4575413Z pycparser conda-forge/noarch::pycparser-2.22-pyh29332c3_1 2025-05-07T19:44:07.4576041Z pyopenssl conda-forge/noarch::pyopenssl-25.0.0-pyhd8ed1ab_0 2025-05-07T19:44:07.4576713Z typing-extensions conda-forge/noarch::typing-extensions-4.13.2-h0e9735f_0 2025-05-07T19:44:07.4577331Z typing_extensions conda-forge/noarch::typing_extensions-4.13.2-pyh29332c3_0 2025-05-07T19:44:07.4577706Z 2025-05-07T19:44:07.4577829Z The following packages will be UPDATED: 2025-05-07T19:44:07.4578042Z 2025-05-07T19:44:07.4578470Z ca-certificates pkgs/main/linux-64::ca-certificates-2~ --> conda-forge/noarch::ca-certificates-2025.4.26-hbd8a1cb_0 2025-05-07T19:44:07.4579272Z libgcc-ng pkgs/main::libgcc-ng-11.2.0-h1234567_1 --> conda-forge::libgcc-ng-15.1.0-h69a702a_2 2025-05-07T19:44:07.4579956Z libgomp pkgs/main::libgomp-11.2.0-h1234567_1 --> conda-forge::libgomp-15.1.0-h767d61c_2 2025-05-07T19:44:07.4580625Z openssl pkgs/main::openssl-3.0.16-h5eee18b_0 --> conda-forge::openssl-3.5.0-h7b32b05_1 2025-05-07T19:44:07.4581005Z 2025-05-07T19:44:07.4581010Z 2025-05-07T19:44:07.4581014Z 2025-05-07T19:44:07.4581161Z Downloading and Extracting Packages: ...working... 2025-05-07T19:44:07.4581562Z openssl-3.5.0 | 3.0 MB | | 0% 2025-05-07T19:44:07.4581804Z 2025-05-07T19:44:07.4582165Z cryptography-44.0.3 | 1.5 MB | | 0%  2025-05-07T19:44:07.4582424Z 2025-05-07T19:44:07.4582428Z 2025-05-07T19:44:07.4582640Z libgcc-15.1.0 | 810 KB | | 0%  2025-05-07T19:44:07.4582909Z 2025-05-07T19:44:07.4582913Z 2025-05-07T19:44:07.4582917Z 2025-05-07T19:44:07.4583585Z libgomp-15.1.0 | 442 KB | | 0%  2025-05-07T19:44:07.4583843Z 2025-05-07T19:44:07.4583847Z 2025-05-07T19:44:07.4583851Z 2025-05-07T19:44:07.4584408Z 2025-05-07T19:44:07.4593863Z cffi-1.17.1 | 289 KB | | 0%  2025-05-07T19:44:07.4594657Z 2025-05-07T19:44:07.4594672Z 2025-05-07T19:44:07.4594683Z 2025-05-07T19:44:07.4594693Z 2025-05-07T19:44:07.4594735Z 2025-05-07T19:44:07.4595500Z pyopenssl-25.0.0 | 120 KB | | 0%  2025-05-07T19:44:07.4596323Z 2025-05-07T19:44:07.4596333Z 2025-05-07T19:44:07.4596378Z 2025-05-07T19:44:07.4596388Z 2025-05-07T19:44:07.4596398Z 2025-05-07T19:44:07.4596408Z 2025-05-07T19:44:07.4597110Z pycparser-2.22 | 108 KB | | 0%  2025-05-07T19:44:07.4597942Z 2025-05-07T19:44:07.4597955Z 2025-05-07T19:44:07.4597966Z 2025-05-07T19:44:07.4597976Z 2025-05-07T19:44:07.4597985Z 2025-05-07T19:44:07.4597995Z 2025-05-07T19:44:07.4598005Z 2025-05-07T19:44:07.4599228Z typing-extensions-4. | 88 KB | | 0%  2025-05-07T19:44:07.4600153Z 2025-05-07T19:44:07.4600164Z 2025-05-07T19:44:07.4600174Z 2025-05-07T19:44:07.4600184Z 2025-05-07T19:44:07.4600194Z 2025-05-07T19:44:07.4600205Z 2025-05-07T19:44:07.4600216Z 2025-05-07T19:44:07.4600226Z 2025-05-07T19:44:07.4601059Z typing_extensions-4. | 51 KB | | 0%  2025-05-07T19:44:07.4601390Z 2025-05-07T19:44:07.4601393Z 2025-05-07T19:44:07.4601396Z 2025-05-07T19:44:07.4601400Z 2025-05-07T19:44:07.4603053Z 2025-05-07T19:44:07.4603057Z 2025-05-07T19:44:07.4603061Z 2025-05-07T19:44:07.4603064Z 2025-05-07T19:44:07.4603067Z 2025-05-07T19:44:07.5114869Z libgcc-ng-15.1.0 | 34 KB | | 0%  2025-05-07T19:44:07.5115234Z 2025-05-07T19:44:07.5115238Z 2025-05-07T19:44:07.5115242Z 2025-05-07T19:44:07.5115246Z 2025-05-07T19:44:07.5228918Z cffi-1.17.1 | 289 KB | ########## | 100%  2025-05-07T19:44:07.5229222Z 2025-05-07T19:44:07.5229272Z 2025-05-07T19:44:07.5357367Z libgcc-15.1.0 | 810 KB | ########## | 100%  2025-05-07T19:44:07.5358189Z 2025-05-07T19:44:07.5358203Z 2025-05-07T19:44:07.5358214Z 2025-05-07T19:44:07.5520993Z libgomp-15.1.0 | 442 KB | ########## | 100%  2025-05-07T19:44:07.5521811Z 2025-05-07T19:44:07.5521842Z 2025-05-07T19:44:07.5521853Z 2025-05-07T19:44:07.5521864Z 2025-05-07T19:44:07.5558456Z cffi-1.17.1 | 289 KB | ########## | 100%  2025-05-07T19:44:07.5559700Z openssl-3.5.0 | 3.0 MB | ###3 | 33% 2025-05-07T19:44:07.5560460Z 2025-05-07T19:44:07.5594611Z cryptography-44.0.3 | 1.5 MB | 8 | 8%  2025-05-07T19:44:07.5595466Z 2025-05-07T19:44:07.5595479Z 2025-05-07T19:44:07.5595491Z 2025-05-07T19:44:07.5595502Z 2025-05-07T19:44:07.5595512Z 2025-05-07T19:44:07.5652786Z pyopenssl-25.0.0 | 120 KB | ########## | 100%  2025-05-07T19:44:07.5653718Z 2025-05-07T19:44:07.5653732Z 2025-05-07T19:44:07.5653777Z 2025-05-07T19:44:07.5653788Z 2025-05-07T19:44:07.5653799Z 2025-05-07T19:44:07.5653809Z 2025-05-07T19:44:07.5679209Z pycparser-2.22 | 108 KB | #4 | 15%  2025-05-07T19:44:07.5680129Z 2025-05-07T19:44:07.5680172Z 2025-05-07T19:44:07.5680183Z 2025-05-07T19:44:07.5680195Z 2025-05-07T19:44:07.5680205Z 2025-05-07T19:44:07.5680216Z 2025-05-07T19:44:07.5820267Z pycparser-2.22 | 108 KB | ########## | 100%  2025-05-07T19:44:07.5821185Z 2025-05-07T19:44:07.5821270Z 2025-05-07T19:44:07.5821282Z 2025-05-07T19:44:07.5821293Z 2025-05-07T19:44:07.5821303Z 2025-05-07T19:44:07.5821313Z 2025-05-07T19:44:07.5821323Z 2025-05-07T19:44:07.5842056Z typing-extensions-4. | 88 KB | #8 | 18%  2025-05-07T19:44:07.5842448Z 2025-05-07T19:44:07.5842453Z 2025-05-07T19:44:07.5842457Z 2025-05-07T19:44:07.5842460Z 2025-05-07T19:44:07.5842464Z 2025-05-07T19:44:07.5842467Z 2025-05-07T19:44:07.5842471Z 2025-05-07T19:44:07.5878084Z typing-extensions-4. | 88 KB | ########## | 100%  2025-05-07T19:44:07.5879078Z 2025-05-07T19:44:07.5879124Z 2025-05-07T19:44:07.5879135Z 2025-05-07T19:44:07.5879145Z 2025-05-07T19:44:07.5879156Z 2025-05-07T19:44:07.5879907Z pyopenssl-25.0.0 | 120 KB | ########## | 100%  2025-05-07T19:44:07.5880732Z 2025-05-07T19:44:07.5880778Z 2025-05-07T19:44:07.5880788Z 2025-05-07T19:44:07.5880799Z 2025-05-07T19:44:07.5880809Z 2025-05-07T19:44:07.5923155Z pyopenssl-25.0.0 | 120 KB | ########## | 100%  2025-05-07T19:44:07.5923498Z 2025-05-07T19:44:07.5923503Z 2025-05-07T19:44:07.5923507Z 2025-05-07T19:44:07.5923764Z libgomp-15.1.0 | 442 KB | ########## | 100%  2025-05-07T19:44:07.5924036Z 2025-05-07T19:44:07.5924040Z 2025-05-07T19:44:07.5924044Z 2025-05-07T19:44:07.5960614Z libgomp-15.1.0 | 442 KB | ########## | 100%  2025-05-07T19:44:07.5961505Z 2025-05-07T19:44:07.5961519Z 2025-05-07T19:44:07.5962609Z libgcc-15.1.0 | 810 KB | ########## | 100%  2025-05-07T19:44:07.5963360Z 2025-05-07T19:44:07.5963410Z 2025-05-07T19:44:07.6016825Z libgcc-15.1.0 | 810 KB | ########## | 100%  2025-05-07T19:44:07.6017319Z 2025-05-07T19:44:07.6017325Z 2025-05-07T19:44:07.6017328Z 2025-05-07T19:44:07.6017332Z 2025-05-07T19:44:07.6017336Z 2025-05-07T19:44:07.6017339Z 2025-05-07T19:44:07.6017343Z 2025-05-07T19:44:07.6017375Z 2025-05-07T19:44:07.6037402Z typing_extensions-4. | 51 KB | ###1 | 31%  2025-05-07T19:44:07.6038493Z 2025-05-07T19:44:07.6038498Z 2025-05-07T19:44:07.6038501Z 2025-05-07T19:44:07.6038505Z 2025-05-07T19:44:07.6038508Z 2025-05-07T19:44:07.6038512Z 2025-05-07T19:44:07.6038543Z 2025-05-07T19:44:07.6038547Z 2025-05-07T19:44:07.6038856Z typing_extensions-4. | 51 KB | ########## | 100%  2025-05-07T19:44:07.6039170Z 2025-05-07T19:44:07.6039174Z 2025-05-07T19:44:07.6039178Z 2025-05-07T19:44:07.6039191Z 2025-05-07T19:44:07.6039194Z 2025-05-07T19:44:07.6039198Z 2025-05-07T19:44:07.6039201Z 2025-05-07T19:44:07.6039204Z 2025-05-07T19:44:07.6039236Z 2025-05-07T19:44:07.6042218Z libgcc-ng-15.1.0 | 34 KB | ####7 | 47%  2025-05-07T19:44:07.6042515Z 2025-05-07T19:44:07.6042530Z 2025-05-07T19:44:07.6042534Z 2025-05-07T19:44:07.6042538Z 2025-05-07T19:44:07.6042541Z 2025-05-07T19:44:07.6042545Z 2025-05-07T19:44:07.6042959Z 2025-05-07T19:44:07.6054033Z typing-extensions-4. | 88 KB | ########## | 100%  2025-05-07T19:44:07.6054951Z 2025-05-07T19:44:07.6054983Z 2025-05-07T19:44:07.6054994Z 2025-05-07T19:44:07.6055005Z 2025-05-07T19:44:07.6055015Z 2025-05-07T19:44:07.6055025Z 2025-05-07T19:44:07.6055036Z 2025-05-07T19:44:07.6055078Z 2025-05-07T19:44:07.6055088Z 2025-05-07T19:44:07.6206164Z libgcc-ng-15.1.0 | 34 KB | ########## | 100%  2025-05-07T19:44:07.6207073Z 2025-05-07T19:44:07.6207087Z 2025-05-07T19:44:07.6207132Z 2025-05-07T19:44:07.6207143Z 2025-05-07T19:44:07.6207154Z 2025-05-07T19:44:07.6207165Z 2025-05-07T19:44:07.6207201Z 2025-05-07T19:44:07.6207211Z 2025-05-07T19:44:07.6360867Z typing_extensions-4. | 51 KB | ########## | 100%  2025-05-07T19:44:07.6361852Z 2025-05-07T19:44:07.6361866Z 2025-05-07T19:44:07.6361877Z 2025-05-07T19:44:07.6361887Z 2025-05-07T19:44:07.6361898Z 2025-05-07T19:44:07.6361937Z 2025-05-07T19:44:07.6361948Z 2025-05-07T19:44:07.6361958Z 2025-05-07T19:44:07.6362000Z 2025-05-07T19:44:07.6403178Z libgcc-ng-15.1.0 | 34 KB | ########## | 100%  2025-05-07T19:44:07.6546233Z openssl-3.5.0 | 3.0 MB | ########## | 100% 2025-05-07T19:44:07.6546989Z 2025-05-07T19:44:07.6546994Z 2025-05-07T19:44:07.6546998Z 2025-05-07T19:44:07.6547001Z 2025-05-07T19:44:07.6547005Z 2025-05-07T19:44:07.6547009Z 2025-05-07T19:44:07.6565390Z pycparser-2.22 | 108 KB | ########## | 100%  2025-05-07T19:44:07.6565912Z 2025-05-07T19:44:07.6712087Z cryptography-44.0.3 | 1.5 MB | ########5 | 86%  2025-05-07T19:44:07.6712958Z 2025-05-07T19:44:07.7404255Z cryptography-44.0.3 | 1.5 MB | ########## | 100%  2025-05-07T19:44:07.7405474Z openssl-3.5.0 | 3.0 MB | ########## | 100% 2025-05-07T19:44:07.7787546Z openssl-3.5.0 | 3.0 MB | ########## | 100% 2025-05-07T19:44:07.7788329Z 2025-05-07T19:44:07.7790882Z cryptography-44.0.3 | 1.5 MB | ########## | 100%  2025-05-07T19:44:07.7792013Z 2025-05-07T19:44:07.7792647Z 2025-05-07T19:44:07.7793133Z  2025-05-07T19:44:07.7793743Z 2025-05-07T19:44:07.7793755Z 2025-05-07T19:44:07.7794262Z  2025-05-07T19:44:07.7794880Z 2025-05-07T19:44:07.7794892Z 2025-05-07T19:44:07.7794902Z 2025-05-07T19:44:07.7795891Z  2025-05-07T19:44:07.7796141Z 2025-05-07T19:44:07.7796145Z 2025-05-07T19:44:07.7796148Z 2025-05-07T19:44:07.7796152Z 2025-05-07T19:44:07.7796346Z  2025-05-07T19:44:07.7796573Z 2025-05-07T19:44:07.7796576Z 2025-05-07T19:44:07.7796604Z 2025-05-07T19:44:07.7796608Z 2025-05-07T19:44:07.7796611Z 2025-05-07T19:44:07.7796795Z  2025-05-07T19:44:07.7797017Z 2025-05-07T19:44:07.7797107Z 2025-05-07T19:44:07.7797111Z 2025-05-07T19:44:07.7797115Z 2025-05-07T19:44:07.7797118Z 2025-05-07T19:44:07.7797121Z 2025-05-07T19:44:07.7797335Z  2025-05-07T19:44:07.7797561Z 2025-05-07T19:44:07.7797565Z 2025-05-07T19:44:07.7797568Z 2025-05-07T19:44:07.7797572Z 2025-05-07T19:44:07.7797575Z 2025-05-07T19:44:07.7797579Z 2025-05-07T19:44:07.7797582Z 2025-05-07T19:44:07.7797809Z  2025-05-07T19:44:07.7798038Z 2025-05-07T19:44:07.7798042Z 2025-05-07T19:44:07.7798045Z 2025-05-07T19:44:07.7798048Z 2025-05-07T19:44:07.7798052Z 2025-05-07T19:44:07.7798056Z 2025-05-07T19:44:07.7798059Z 2025-05-07T19:44:07.7798062Z 2025-05-07T19:44:07.7798256Z  2025-05-07T19:44:07.7798514Z 2025-05-07T19:44:07.7798518Z 2025-05-07T19:44:07.7798521Z 2025-05-07T19:44:07.7798529Z 2025-05-07T19:44:07.7798533Z 2025-05-07T19:44:07.7798536Z 2025-05-07T19:44:07.7798539Z 2025-05-07T19:44:07.7798543Z 2025-05-07T19:44:07.7798546Z 2025-05-07T19:44:07.7798758Z  done 2025-05-07T19:44:07.8804282Z Preparing transaction: \ done 2025-05-07T19:44:07.9811293Z Verifying transaction: / done 2025-05-07T19:44:09.3839776Z Executing transaction: \ | / - \ | / - \ | / - \ | done 2025-05-07T19:44:09.4811901Z [SETUP] Testing pyOpenSSL import ... 2025-05-07T19:44:11.1705743Z [CHECK] Python (sub-)package 'OpenSSL' found ... 2025-05-07T19:44:11.1725738Z [SETUP] Installing libxcrypt ... 2025-05-07T19:44:11.1754193Z [EXEC] [ATTEMPT 0/3] + conda install -n build_binary -c conda-forge --override-channels -y libxcrypt 2025-05-07T19:44:11.8416687Z Channels: 2025-05-07T19:44:11.8418043Z - conda-forge 2025-05-07T19:44:11.8418383Z Platform: linux-64 2025-05-07T19:44:14.9638741Z Collecting package metadata (repodata.json): - \ | / done 2025-05-07T19:44:15.3903375Z Solving environment: \ done 2025-05-07T19:44:15.4400025Z 2025-05-07T19:44:15.4400442Z ## Package Plan ## 2025-05-07T19:44:15.4400912Z 2025-05-07T19:44:15.4401497Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:44:15.4402465Z 2025-05-07T19:44:15.4402747Z added / updated specs: 2025-05-07T19:44:15.4403474Z - libxcrypt 2025-05-07T19:44:15.4403847Z 2025-05-07T19:44:15.4403859Z 2025-05-07T19:44:15.4404259Z The following packages will be downloaded: 2025-05-07T19:44:15.4404929Z 2025-05-07T19:44:15.4405264Z package | build 2025-05-07T19:44:15.4406101Z ---------------------------|----------------- 2025-05-07T19:44:15.4406521Z libxcrypt-4.4.36 | hd590300_1 98 KB conda-forge 2025-05-07T19:44:15.4407086Z ------------------------------------------------------------ 2025-05-07T19:44:15.4407442Z Total: 98 KB 2025-05-07T19:44:15.4407660Z 2025-05-07T19:44:15.4407814Z The following NEW packages will be INSTALLED: 2025-05-07T19:44:15.4408037Z 2025-05-07T19:44:15.4408278Z libxcrypt conda-forge/linux-64::libxcrypt-4.4.36-hd590300_1 2025-05-07T19:44:15.4408591Z 2025-05-07T19:44:15.4408595Z 2025-05-07T19:44:15.4408599Z 2025-05-07T19:44:15.4408743Z Downloading and Extracting Packages: ...working... 2025-05-07T19:44:15.5851654Z libxcrypt-4.4.36 | 98 KB | | 0% 2025-05-07T19:44:15.5869628Z libxcrypt-4.4.36 | 98 KB | #6 | 16% 2025-05-07T19:44:15.5964329Z libxcrypt-4.4.36 | 98 KB | ########## | 100% 2025-05-07T19:44:15.5964975Z libxcrypt-4.4.36 | 98 KB | ########## | 100% 2025-05-07T19:44:15.5965337Z 2025-05-07T19:44:15.5965667Z done 2025-05-07T19:44:15.6974345Z Preparing transaction: / done 2025-05-07T19:44:15.7981381Z Verifying transaction: \ done 2025-05-07T19:44:15.8995805Z Executing transaction: / done 2025-05-07T19:44:19.2258940Z [SETUP] Copying over ... 2025-05-07T19:44:19.2259716Z + cp /github/home/miniconda/envs/build_binary/include/crypt.h /github/home/miniconda/envs/build_binary/include/python3.13/crypt.h 2025-05-07T19:44:19.2260345Z 2025-05-07T19:44:19.2304843Z 2025-05-07T19:44:20.8438327Z [SETUP] Installed Python version: Python 3.13.2 2025-05-07T19:44:20.8439671Z [SETUP] Successfully created Conda environment: build_binary 2025-05-07T19:44:20.8514466Z ##[group]Run . $PRELUDE; install_cxx_compiler $BUILD_ENV clang 2025-05-07T19:44:20.8514962Z . $PRELUDE; install_cxx_compiler $BUILD_ENV clang 2025-05-07T19:44:20.8515595Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:44:20.8515958Z env: 2025-05-07T19:44:20.8516194Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:44:20.8516531Z BUILD_ENV: build_binary 2025-05-07T19:44:20.8516789Z BUILD_TARGET: default 2025-05-07T19:44:20.8517070Z BUILD_VARIANT: cuda 2025-05-07T19:44:20.8517353Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:44:20.8517651Z ##[endgroup] 2025-05-07T19:44:21.2812400Z ################################################################################ 2025-05-07T19:44:21.2813447Z # Install C/C++ Compilers 2025-05-07T19:44:21.2814162Z # 2025-05-07T19:44:21.2827035Z # [2025-05-07T19:44:21.282Z] + install_cxx_compiler build_binary clang 2025-05-07T19:44:21.2828436Z ################################################################################ 2025-05-07T19:44:21.2829124Z 2025-05-07T19:44:21.2845282Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:44:21.3697159Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:44:21.3706015Z [INSTALL] Installing GLIBC (architecture = 64) ... 2025-05-07T19:44:21.3738580Z [EXEC] [ATTEMPT 0/3] + conda install -n build_binary -c conda-forge --override-channels -y sysroot_linux-64=2.17 2025-05-07T19:44:22.0474649Z Channels: 2025-05-07T19:44:22.0474962Z - conda-forge 2025-05-07T19:44:22.0475267Z Platform: linux-64 2025-05-07T19:44:25.1261328Z Collecting package metadata (repodata.json): - \ | / done 2025-05-07T19:44:25.5498548Z Solving environment: \ done 2025-05-07T19:44:25.5981192Z 2025-05-07T19:44:25.5981521Z ## Package Plan ## 2025-05-07T19:44:25.5981708Z 2025-05-07T19:44:25.5981914Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:44:25.5982225Z 2025-05-07T19:44:25.5982343Z added / updated specs: 2025-05-07T19:44:25.5982627Z - sysroot_linux-64=2.17 2025-05-07T19:44:25.5982815Z 2025-05-07T19:44:25.5982819Z 2025-05-07T19:44:25.5982944Z The following packages will be downloaded: 2025-05-07T19:44:25.5983181Z 2025-05-07T19:44:25.5983317Z package | build 2025-05-07T19:44:25.5983648Z ---------------------------|----------------- 2025-05-07T19:44:25.5984097Z kernel-headers_linux-64-3.10.0| he073ed8_18 921 KB conda-forge 2025-05-07T19:44:25.5984609Z sysroot_linux-64-2.17 | h0157908_18 14.5 MB conda-forge 2025-05-07T19:44:25.5985057Z ------------------------------------------------------------ 2025-05-07T19:44:25.5985406Z Total: 15.4 MB 2025-05-07T19:44:25.5985635Z 2025-05-07T19:44:25.5985774Z The following NEW packages will be INSTALLED: 2025-05-07T19:44:25.5985999Z 2025-05-07T19:44:25.5986310Z kernel-headers_li~ conda-forge/noarch::kernel-headers_linux-64-3.10.0-he073ed8_18 2025-05-07T19:44:25.5986898Z sysroot_linux-64 conda-forge/noarch::sysroot_linux-64-2.17-h0157908_18 2025-05-07T19:44:25.5987229Z 2025-05-07T19:44:25.5987234Z 2025-05-07T19:44:25.5987238Z 2025-05-07T19:44:25.5987380Z Downloading and Extracting Packages: ...working... 2025-05-07T19:44:25.5987773Z sysroot_linux-64-2.1 | 14.5 MB | | 0% 2025-05-07T19:44:25.5988009Z 2025-05-07T19:44:25.8050253Z kernel-headers_linux | 921 KB | | 0%  2025-05-07T19:44:25.8050662Z 2025-05-07T19:44:25.8229592Z kernel-headers_linux | 921 KB | 1 | 2%  2025-05-07T19:44:25.8288372Z sysroot_linux-64-2.1 | 14.5 MB | | 0% 2025-05-07T19:44:25.8289199Z 2025-05-07T19:44:25.9245998Z kernel-headers_linux | 921 KB | ########## | 100%  2025-05-07T19:44:25.9969996Z sysroot_linux-64-2.1 | 14.5 MB | ######3 | 64% 2025-05-07T19:44:25.9971194Z 2025-05-07T19:44:25.9972035Z kernel-headers_linux | 921 KB | ########## | 100%  2025-05-07T19:44:25.9972788Z 2025-05-07T19:44:26.0564189Z kernel-headers_linux | 921 KB | ########## | 100%  2025-05-07T19:44:26.4952487Z sysroot_linux-64-2.1 | 14.5 MB | ########## | 100% 2025-05-07T19:44:26.4953689Z sysroot_linux-64-2.1 | 14.5 MB | ########## | 100% 2025-05-07T19:44:26.4954646Z 2025-05-07T19:44:26.4955047Z 2025-05-07T19:44:26.4955314Z  done 2025-05-07T19:44:26.5964244Z Preparing transaction: / done 2025-05-07T19:44:26.7974739Z Verifying transaction: \ | done 2025-05-07T19:44:26.8981554Z Executing transaction: - done 2025-05-07T19:44:26.9832595Z [CHECK] LD_LIBRARY_PATH = 2025-05-07T19:44:26.9833485Z [CHECK] CONDA_PREFIX is not set. 2025-05-07T19:44:28.6420199Z [CHECK] libstdc++.so.6 found in CONDA_PREFIX PATH (symbolic link): /github/home/miniconda/envs/build_binary/lib/libstdc++.so.6 2025-05-07T19:44:28.6440028Z [INSTALL] Installing GCC (11.4.0, 64) through Conda ... 2025-05-07T19:44:28.6467838Z [EXEC] [ATTEMPT 0/3] + conda install -n build_binary -c conda-forge --override-channels -y gxx_linux-64=11.4.0 2025-05-07T19:44:29.3469781Z Channels: 2025-05-07T19:44:29.3470828Z - conda-forge 2025-05-07T19:44:29.3471496Z Platform: linux-64 2025-05-07T19:44:32.3761694Z Collecting package metadata (repodata.json): - \ | / done 2025-05-07T19:44:33.5019629Z Solving environment: \ | / done 2025-05-07T19:44:33.5523022Z 2025-05-07T19:44:33.5523607Z ## Package Plan ## 2025-05-07T19:44:33.5524071Z 2025-05-07T19:44:33.5524711Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:44:33.5525533Z 2025-05-07T19:44:33.5525636Z added / updated specs: 2025-05-07T19:44:33.5526028Z - gxx_linux-64=11.4.0 2025-05-07T19:44:33.5526187Z 2025-05-07T19:44:33.5526191Z 2025-05-07T19:44:33.5526318Z The following packages will be downloaded: 2025-05-07T19:44:33.5526561Z 2025-05-07T19:44:33.5526679Z package | build 2025-05-07T19:44:33.5527020Z ---------------------------|----------------- 2025-05-07T19:44:33.5527444Z binutils_impl_linux-64-2.40| ha1999f0_7 6.0 MB conda-forge 2025-05-07T19:44:33.5527946Z binutils_linux-64-2.40 | hb3c18ed_4 28 KB conda-forge 2025-05-07T19:44:33.5528410Z gcc_impl_linux-64-11.4.0 | h00c12a0_13 53.0 MB conda-forge 2025-05-07T19:44:33.5528874Z gcc_linux-64-11.4.0 | ha077dfb_4 31 KB conda-forge 2025-05-07T19:44:33.5529317Z gxx_impl_linux-64-11.4.0 | h634f3ee_13 11.2 MB conda-forge 2025-05-07T19:44:33.5529782Z gxx_linux-64-11.4.0 | h35bfe5d_4 29 KB conda-forge 2025-05-07T19:44:33.5530211Z ld_impl_linux-64-2.40 | hf3520f5_7 691 KB conda-forge 2025-05-07T19:44:33.5530695Z libgcc-devel_linux-64-11.4.0| h8f596e0_113 2.3 MB conda-forge 2025-05-07T19:44:33.5531190Z libsanitizer-11.4.0 | h5763a12_13 3.5 MB conda-forge 2025-05-07T19:44:33.5531639Z libstdcxx-15.1.0 | h8f9b012_2 3.7 MB conda-forge 2025-05-07T19:44:33.5532136Z libstdcxx-devel_linux-64-11.4.0| h8f596e0_113 11.1 MB conda-forge 2025-05-07T19:44:33.5532621Z libstdcxx-ng-15.1.0 | h4852527_2 34 KB conda-forge 2025-05-07T19:44:33.5533063Z ------------------------------------------------------------ 2025-05-07T19:44:33.5533429Z Total: 91.6 MB 2025-05-07T19:44:33.5533646Z 2025-05-07T19:44:33.5534074Z The following NEW packages will be INSTALLED: 2025-05-07T19:44:33.5534303Z 2025-05-07T19:44:33.5534616Z binutils_impl_lin~ conda-forge/linux-64::binutils_impl_linux-64-2.40-ha1999f0_7 2025-05-07T19:44:33.5535190Z binutils_linux-64 conda-forge/linux-64::binutils_linux-64-2.40-hb3c18ed_4 2025-05-07T19:44:33.5535753Z gcc_impl_linux-64 conda-forge/linux-64::gcc_impl_linux-64-11.4.0-h00c12a0_13 2025-05-07T19:44:33.5536774Z gcc_linux-64 conda-forge/linux-64::gcc_linux-64-11.4.0-ha077dfb_4 2025-05-07T19:44:33.5537385Z gxx_impl_linux-64 conda-forge/linux-64::gxx_impl_linux-64-11.4.0-h634f3ee_13 2025-05-07T19:44:33.5537921Z gxx_linux-64 conda-forge/linux-64::gxx_linux-64-11.4.0-h35bfe5d_4 2025-05-07T19:44:33.5538471Z libgcc-devel_linu~ conda-forge/noarch::libgcc-devel_linux-64-11.4.0-h8f596e0_113 2025-05-07T19:44:33.5539064Z libsanitizer conda-forge/linux-64::libsanitizer-11.4.0-h5763a12_13 2025-05-07T19:44:33.5539594Z libstdcxx conda-forge/linux-64::libstdcxx-15.1.0-h8f9b012_2 2025-05-07T19:44:33.5540162Z libstdcxx-devel_l~ conda-forge/noarch::libstdcxx-devel_linux-64-11.4.0-h8f596e0_113 2025-05-07T19:44:33.5540557Z 2025-05-07T19:44:33.5540676Z The following packages will be UPDATED: 2025-05-07T19:44:33.5540890Z 2025-05-07T19:44:33.5541214Z ld_impl_linux-64 pkgs/main::ld_impl_linux-64-2.40-h12e~ --> conda-forge::ld_impl_linux-64-2.40-hf3520f5_7 2025-05-07T19:44:33.5541984Z libstdcxx-ng pkgs/main::libstdcxx-ng-11.2.0-h12345~ --> conda-forge::libstdcxx-ng-15.1.0-h4852527_2 2025-05-07T19:44:33.5542409Z 2025-05-07T19:44:33.5542414Z 2025-05-07T19:44:33.5542436Z 2025-05-07T19:44:33.5542585Z Downloading and Extracting Packages: ...working... 2025-05-07T19:44:33.5542971Z gcc_impl_linux-64-11 | 53.0 MB | | 0% 2025-05-07T19:44:33.5543226Z 2025-05-07T19:44:33.5543547Z gxx_impl_linux-64-11 | 11.2 MB | | 0%  2025-05-07T19:44:33.5543798Z 2025-05-07T19:44:33.5543803Z 2025-05-07T19:44:33.5544107Z libstdcxx-devel_linu | 11.1 MB | | 0%  2025-05-07T19:44:33.5544392Z 2025-05-07T19:44:33.5544395Z 2025-05-07T19:44:33.5544399Z 2025-05-07T19:44:33.5544640Z binutils_impl_linux- | 6.0 MB | | 0%  2025-05-07T19:44:33.5544945Z 2025-05-07T19:44:33.5544949Z 2025-05-07T19:44:33.5544953Z 2025-05-07T19:44:33.5544956Z 2025-05-07T19:44:33.5564409Z libstdcxx-15.1.0 | 3.7 MB | | 0%  2025-05-07T19:44:33.5564776Z 2025-05-07T19:44:33.5564800Z 2025-05-07T19:44:33.5564804Z 2025-05-07T19:44:33.5564808Z 2025-05-07T19:44:33.5564811Z 2025-05-07T19:44:33.5585606Z libsanitizer-11.4.0 | 3.5 MB | | 0%  2025-05-07T19:44:33.5585987Z 2025-05-07T19:44:33.5585992Z 2025-05-07T19:44:33.5586022Z 2025-05-07T19:44:33.5586026Z 2025-05-07T19:44:33.5586029Z 2025-05-07T19:44:33.5586033Z 2025-05-07T19:44:33.5586320Z libgcc-devel_linux-6 | 2.3 MB | | 0%  2025-05-07T19:44:33.5586633Z 2025-05-07T19:44:33.5586655Z 2025-05-07T19:44:33.5586659Z 2025-05-07T19:44:33.5586662Z 2025-05-07T19:44:33.5586666Z 2025-05-07T19:44:33.5586670Z 2025-05-07T19:44:33.5586703Z 2025-05-07T19:44:33.5586981Z ld_impl_linux-64-2.4 | 691 KB | | 0%  2025-05-07T19:44:33.5587282Z 2025-05-07T19:44:33.5587286Z 2025-05-07T19:44:33.5587290Z 2025-05-07T19:44:33.5587293Z 2025-05-07T19:44:33.5587297Z 2025-05-07T19:44:33.5587300Z 2025-05-07T19:44:33.5587304Z 2025-05-07T19:44:33.5587345Z 2025-05-07T19:44:33.5587847Z libstdcxx-ng-15.1.0 | 34 KB | | 0%  2025-05-07T19:44:33.5588154Z 2025-05-07T19:44:33.5588158Z 2025-05-07T19:44:33.5588161Z 2025-05-07T19:44:33.5588165Z 2025-05-07T19:44:33.5588168Z 2025-05-07T19:44:33.5588172Z 2025-05-07T19:44:33.5588175Z 2025-05-07T19:44:33.5588216Z 2025-05-07T19:44:33.5588220Z 2025-05-07T19:44:33.5588931Z gcc_linux-64-11.4.0 | 31 KB | | 0%  2025-05-07T19:44:33.5589244Z 2025-05-07T19:44:33.5589481Z 2025-05-07T19:44:33.5589485Z 2025-05-07T19:44:33.5589488Z 2025-05-07T19:44:33.5589492Z 2025-05-07T19:44:33.5589496Z 2025-05-07T19:44:33.5589527Z 2025-05-07T19:44:33.5589531Z 2025-05-07T19:44:33.5589534Z 2025-05-07T19:44:33.5591337Z 2025-05-07T19:44:33.5591653Z gxx_linux-64-11.4.0 | 29 KB | | 0%  2025-05-07T19:44:33.5591972Z 2025-05-07T19:44:33.5591976Z 2025-05-07T19:44:33.5592010Z 2025-05-07T19:44:33.5592122Z 2025-05-07T19:44:33.5592127Z 2025-05-07T19:44:33.5592131Z 2025-05-07T19:44:33.5592135Z 2025-05-07T19:44:33.5592138Z 2025-05-07T19:44:33.5592142Z 2025-05-07T19:44:33.5592145Z 2025-05-07T19:44:33.5592148Z 2025-05-07T19:44:33.6830392Z binutils_linux-64-2. | 28 KB | | 0%  2025-05-07T19:44:33.6831404Z 2025-05-07T19:44:33.6831418Z 2025-05-07T19:44:33.6831429Z 2025-05-07T19:44:33.6882654Z 2025-05-07T19:44:33.7906758Z libstdcxx-15.1.0 | 3.7 MB | 1 | 2%  2025-05-07T19:44:33.7907177Z 2025-05-07T19:44:33.7907182Z 2025-05-07T19:44:33.7907188Z 2025-05-07T19:44:33.7907191Z 2025-05-07T19:44:33.8548408Z libstdcxx-15.1.0 | 3.7 MB | 5 | 5%  2025-05-07T19:44:33.8549291Z 2025-05-07T19:44:33.8549308Z 2025-05-07T19:44:33.8919965Z libstdcxx-devel_linu | 11.1 MB | | 0%  2025-05-07T19:44:33.8920287Z 2025-05-07T19:44:33.8920649Z 2025-05-07T19:44:33.8920674Z 2025-05-07T19:44:33.8920745Z 2025-05-07T19:44:33.9064880Z libstdcxx-15.1.0 | 3.7 MB | #########4 | 95%  2025-05-07T19:44:33.9065773Z 2025-05-07T19:44:33.9065787Z 2025-05-07T19:44:33.9065798Z 2025-05-07T19:44:33.9065808Z 2025-05-07T19:44:33.9124512Z libstdcxx-15.1.0 | 3.7 MB | ########## | 100%  2025-05-07T19:44:33.9125076Z 2025-05-07T19:44:33.9201121Z gxx_impl_linux-64-11 | 11.2 MB | | 0%  2025-05-07T19:44:33.9320885Z gcc_impl_linux-64-11 | 53.0 MB | | 0% 2025-05-07T19:44:33.9321799Z 2025-05-07T19:44:33.9321813Z 2025-05-07T19:44:33.9321824Z 2025-05-07T19:44:33.9384291Z binutils_impl_linux- | 6.0 MB | | 0%  2025-05-07T19:44:33.9384609Z 2025-05-07T19:44:33.9384614Z 2025-05-07T19:44:33.9384619Z 2025-05-07T19:44:33.9384623Z 2025-05-07T19:44:33.9384837Z 2025-05-07T19:44:33.9549865Z libsanitizer-11.4.0 | 3.5 MB | | 0%  2025-05-07T19:44:33.9550783Z 2025-05-07T19:44:33.9550812Z 2025-05-07T19:44:34.0130089Z libstdcxx-devel_linu | 11.1 MB | ######1 | 61%  2025-05-07T19:44:34.0130449Z 2025-05-07T19:44:34.0202242Z gxx_impl_linux-64-11 | 11.2 MB | #####1 | 52%  2025-05-07T19:44:34.0320640Z gcc_impl_linux-64-11 | 53.0 MB | #3 | 14% 2025-05-07T19:44:34.0320998Z 2025-05-07T19:44:34.0321168Z 2025-05-07T19:44:34.0321179Z 2025-05-07T19:44:34.0327751Z binutils_impl_linux- | 6.0 MB | ######5 | 65%  2025-05-07T19:44:34.0328094Z 2025-05-07T19:44:34.0328099Z 2025-05-07T19:44:34.0328126Z 2025-05-07T19:44:34.0330056Z 2025-05-07T19:44:34.0330060Z 2025-05-07T19:44:34.0750579Z libsanitizer-11.4.0 | 3.5 MB | ########## | 100%  2025-05-07T19:44:34.0750954Z 2025-05-07T19:44:34.0750961Z 2025-05-07T19:44:34.0750968Z 2025-05-07T19:44:34.0750975Z 2025-05-07T19:44:34.0750981Z 2025-05-07T19:44:34.0750985Z 2025-05-07T19:44:34.0784946Z libgcc-devel_linux-6 | 2.3 MB | | 1%  2025-05-07T19:44:34.0785275Z 2025-05-07T19:44:34.0785322Z 2025-05-07T19:44:34.0785326Z 2025-05-07T19:44:34.0785330Z 2025-05-07T19:44:34.1164038Z libstdcxx-15.1.0 | 3.7 MB | ########## | 100%  2025-05-07T19:44:34.1164351Z 2025-05-07T19:44:34.1164357Z 2025-05-07T19:44:34.1164362Z 2025-05-07T19:44:34.1201822Z binutils_impl_linux- | 6.0 MB | ########## | 100%  2025-05-07T19:44:34.1289693Z gcc_impl_linux-64-11 | 53.0 MB | ##9 | 29% 2025-05-07T19:44:34.1290090Z 2025-05-07T19:44:34.1290256Z 2025-05-07T19:44:34.1290261Z 2025-05-07T19:44:34.1290622Z 2025-05-07T19:44:34.1290626Z 2025-05-07T19:44:34.1290630Z 2025-05-07T19:44:34.1580674Z libgcc-devel_linux-6 | 2.3 MB | ########## | 100%  2025-05-07T19:44:34.1581046Z 2025-05-07T19:44:34.1581055Z 2025-05-07T19:44:34.1581061Z 2025-05-07T19:44:34.1581066Z 2025-05-07T19:44:34.1581071Z 2025-05-07T19:44:34.1581077Z 2025-05-07T19:44:34.1581082Z 2025-05-07T19:44:34.1594383Z ld_impl_linux-64-2.4 | 691 KB | 2 | 2%  2025-05-07T19:44:34.1594765Z 2025-05-07T19:44:34.1594769Z 2025-05-07T19:44:34.1595034Z libstdcxx-devel_linu | 11.1 MB | ########## | 100%  2025-05-07T19:44:34.1595313Z 2025-05-07T19:44:34.1595318Z 2025-05-07T19:44:34.1767727Z libstdcxx-devel_linu | 11.1 MB | ########## | 100%  2025-05-07T19:44:34.1768593Z 2025-05-07T19:44:34.1768605Z 2025-05-07T19:44:34.1768616Z 2025-05-07T19:44:34.1768627Z 2025-05-07T19:44:34.1768638Z 2025-05-07T19:44:34.1768650Z 2025-05-07T19:44:34.1768660Z 2025-05-07T19:44:34.1818121Z ld_impl_linux-64-2.4 | 691 KB | ########## | 100%  2025-05-07T19:44:34.1818442Z 2025-05-07T19:44:34.1818447Z 2025-05-07T19:44:34.1818462Z 2025-05-07T19:44:34.1818466Z 2025-05-07T19:44:34.1818470Z 2025-05-07T19:44:34.1818473Z 2025-05-07T19:44:34.1818477Z 2025-05-07T19:44:34.1818480Z 2025-05-07T19:44:34.1833936Z libstdcxx-ng-15.1.0 | 34 KB | ####7 | 47%  2025-05-07T19:44:34.1834273Z 2025-05-07T19:44:34.1834294Z 2025-05-07T19:44:34.1834298Z 2025-05-07T19:44:34.1834301Z 2025-05-07T19:44:34.1834305Z 2025-05-07T19:44:34.1834308Z 2025-05-07T19:44:34.1834311Z 2025-05-07T19:44:34.1834315Z 2025-05-07T19:44:34.2156410Z libstdcxx-ng-15.1.0 | 34 KB | ########## | 100%  2025-05-07T19:44:34.2157361Z 2025-05-07T19:44:34.2157375Z 2025-05-07T19:44:34.2157386Z 2025-05-07T19:44:34.2157398Z 2025-05-07T19:44:34.2157411Z 2025-05-07T19:44:34.2157421Z 2025-05-07T19:44:34.2157431Z 2025-05-07T19:44:34.2157442Z 2025-05-07T19:44:34.2157486Z 2025-05-07T19:44:34.2157497Z 2025-05-07T19:44:34.2157507Z 2025-05-07T19:44:34.2168405Z binutils_linux-64-2. | 28 KB | #####6 | 56%  2025-05-07T19:44:34.2168756Z 2025-05-07T19:44:34.2168761Z 2025-05-07T19:44:34.2168765Z 2025-05-07T19:44:34.2168768Z 2025-05-07T19:44:34.2168773Z 2025-05-07T19:44:34.2169386Z libsanitizer-11.4.0 | 3.5 MB | ########## | 100%  2025-05-07T19:44:34.2169691Z 2025-05-07T19:44:34.2169711Z 2025-05-07T19:44:34.2169715Z 2025-05-07T19:44:34.2169719Z 2025-05-07T19:44:34.2169722Z 2025-05-07T19:44:34.2169726Z 2025-05-07T19:44:34.2169729Z 2025-05-07T19:44:34.2169732Z 2025-05-07T19:44:34.2171546Z 2025-05-07T19:44:34.2174166Z gcc_linux-64-11.4.0 | 31 KB | #####2 | 52%  2025-05-07T19:44:34.2174463Z 2025-05-07T19:44:34.2174472Z 2025-05-07T19:44:34.2174476Z 2025-05-07T19:44:34.2174479Z 2025-05-07T19:44:34.2174482Z 2025-05-07T19:44:34.2189264Z libsanitizer-11.4.0 | 3.5 MB | ########## | 100%  2025-05-07T19:44:34.2189634Z 2025-05-07T19:44:34.2189639Z 2025-05-07T19:44:34.2189643Z 2025-05-07T19:44:34.2189646Z 2025-05-07T19:44:34.2189649Z 2025-05-07T19:44:34.2189653Z 2025-05-07T19:44:34.2189656Z 2025-05-07T19:44:34.2189660Z 2025-05-07T19:44:34.2189663Z 2025-05-07T19:44:34.2189667Z 2025-05-07T19:44:34.2189670Z 2025-05-07T19:44:34.2190013Z binutils_linux-64-2. | 28 KB | ########## | 100%  2025-05-07T19:44:34.2190330Z 2025-05-07T19:44:34.2190334Z 2025-05-07T19:44:34.2190344Z 2025-05-07T19:44:34.2190347Z 2025-05-07T19:44:34.2190351Z 2025-05-07T19:44:34.2190354Z 2025-05-07T19:44:34.2190358Z 2025-05-07T19:44:34.2190361Z 2025-05-07T19:44:34.2190455Z 2025-05-07T19:44:34.2201333Z gcc_linux-64-11.4.0 | 31 KB | ########## | 100%  2025-05-07T19:44:34.2225349Z gcc_impl_linux-64-11 | 53.0 MB | ####9 | 49% 2025-05-07T19:44:34.2225808Z 2025-05-07T19:44:34.2225922Z 2025-05-07T19:44:34.2226139Z 2025-05-07T19:44:34.2226154Z 2025-05-07T19:44:34.2226158Z 2025-05-07T19:44:34.2226162Z 2025-05-07T19:44:34.2226166Z 2025-05-07T19:44:34.2226169Z 2025-05-07T19:44:34.2226173Z 2025-05-07T19:44:34.2226176Z 2025-05-07T19:44:34.2244097Z gxx_linux-64-11.4.0 | 29 KB | #####5 | 55%  2025-05-07T19:44:34.2244722Z 2025-05-07T19:44:34.2245578Z gxx_impl_linux-64-11 | 11.2 MB | ########## | 100%  2025-05-07T19:44:34.2246150Z 2025-05-07T19:44:34.2246460Z gxx_impl_linux-64-11 | 11.2 MB | ########## | 100%  2025-05-07T19:44:34.2246721Z 2025-05-07T19:44:34.2246727Z 2025-05-07T19:44:34.2246730Z 2025-05-07T19:44:34.2246733Z 2025-05-07T19:44:34.2246737Z 2025-05-07T19:44:34.2246743Z 2025-05-07T19:44:34.2246746Z 2025-05-07T19:44:34.2246750Z 2025-05-07T19:44:34.2246753Z 2025-05-07T19:44:34.2246762Z 2025-05-07T19:44:34.2334711Z gxx_linux-64-11.4.0 | 29 KB | ########## | 100%  2025-05-07T19:44:34.2335084Z 2025-05-07T19:44:34.2335191Z 2025-05-07T19:44:34.2335204Z 2025-05-07T19:44:34.2335212Z 2025-05-07T19:44:34.2335217Z 2025-05-07T19:44:34.2335223Z 2025-05-07T19:44:34.2335853Z libgcc-devel_linux-6 | 2.3 MB | ########## | 100%  2025-05-07T19:44:34.2336321Z 2025-05-07T19:44:34.2336327Z 2025-05-07T19:44:34.2336332Z 2025-05-07T19:44:34.2336336Z 2025-05-07T19:44:34.2336340Z 2025-05-07T19:44:34.2337747Z 2025-05-07T19:44:34.2683492Z libgcc-devel_linux-6 | 2.3 MB | ########## | 100%  2025-05-07T19:44:34.2684464Z 2025-05-07T19:44:34.2684477Z 2025-05-07T19:44:34.2684514Z 2025-05-07T19:44:34.2684524Z 2025-05-07T19:44:34.2684535Z 2025-05-07T19:44:34.2684545Z 2025-05-07T19:44:34.2684557Z 2025-05-07T19:44:34.2685324Z ld_impl_linux-64-2.4 | 691 KB | ########## | 100%  2025-05-07T19:44:34.2686162Z 2025-05-07T19:44:34.2686173Z 2025-05-07T19:44:34.2686183Z 2025-05-07T19:44:34.2686194Z 2025-05-07T19:44:34.2686206Z 2025-05-07T19:44:34.2686282Z 2025-05-07T19:44:34.2686292Z 2025-05-07T19:44:34.3055248Z ld_impl_linux-64-2.4 | 691 KB | ########## | 100%  2025-05-07T19:44:34.3055596Z 2025-05-07T19:44:34.3055603Z 2025-05-07T19:44:34.3055609Z 2025-05-07T19:44:34.3055613Z 2025-05-07T19:44:34.3055619Z 2025-05-07T19:44:34.3055647Z 2025-05-07T19:44:34.3055651Z 2025-05-07T19:44:34.3055654Z 2025-05-07T19:44:34.3055932Z libstdcxx-ng-15.1.0 | 34 KB | ########## | 100%  2025-05-07T19:44:34.3056338Z 2025-05-07T19:44:34.3056342Z 2025-05-07T19:44:34.3056345Z 2025-05-07T19:44:34.3056350Z 2025-05-07T19:44:34.3056355Z 2025-05-07T19:44:34.3056359Z 2025-05-07T19:44:34.3056365Z 2025-05-07T19:44:34.3056390Z 2025-05-07T19:44:34.3205987Z libstdcxx-ng-15.1.0 | 34 KB | ########## | 100%  2025-05-07T19:44:34.3335640Z gcc_impl_linux-64-11 | 53.0 MB | ######6 | 67% 2025-05-07T19:44:34.3336138Z 2025-05-07T19:44:34.3336481Z 2025-05-07T19:44:34.3336488Z 2025-05-07T19:44:34.3336528Z 2025-05-07T19:44:34.3336566Z 2025-05-07T19:44:34.3336575Z 2025-05-07T19:44:34.3336579Z 2025-05-07T19:44:34.3336582Z 2025-05-07T19:44:34.3336618Z 2025-05-07T19:44:34.3336623Z 2025-05-07T19:44:34.3336653Z 2025-05-07T19:44:34.3337655Z binutils_linux-64-2. | 28 KB | ########## | 100%  2025-05-07T19:44:34.3338021Z 2025-05-07T19:44:34.3338028Z 2025-05-07T19:44:34.3338038Z 2025-05-07T19:44:34.3338059Z 2025-05-07T19:44:34.3338098Z 2025-05-07T19:44:34.3338102Z 2025-05-07T19:44:34.3338105Z 2025-05-07T19:44:34.3338109Z 2025-05-07T19:44:34.3338114Z 2025-05-07T19:44:34.3338118Z 2025-05-07T19:44:34.3338121Z 2025-05-07T19:44:34.3678131Z binutils_linux-64-2. | 28 KB | ########## | 100%  2025-05-07T19:44:34.3678724Z 2025-05-07T19:44:34.3678748Z 2025-05-07T19:44:34.3678754Z 2025-05-07T19:44:34.3678759Z 2025-05-07T19:44:34.3678764Z 2025-05-07T19:44:34.3678769Z 2025-05-07T19:44:34.3678776Z 2025-05-07T19:44:34.3679048Z 2025-05-07T19:44:34.3679052Z 2025-05-07T19:44:34.3679592Z gcc_linux-64-11.4.0 | 31 KB | ########## | 100%  2025-05-07T19:44:34.3679907Z 2025-05-07T19:44:34.3679911Z 2025-05-07T19:44:34.3679915Z 2025-05-07T19:44:34.3679918Z 2025-05-07T19:44:34.3679922Z 2025-05-07T19:44:34.3679925Z 2025-05-07T19:44:34.3679928Z 2025-05-07T19:44:34.3679932Z 2025-05-07T19:44:34.3679935Z 2025-05-07T19:44:34.4207157Z gcc_linux-64-11.4.0 | 31 KB | ########## | 100%  2025-05-07T19:44:34.4884980Z gcc_impl_linux-64-11 | 53.0 MB | ########6 | 87% 2025-05-07T19:44:34.4885728Z 2025-05-07T19:44:34.4885751Z 2025-05-07T19:44:34.4885767Z 2025-05-07T19:44:34.5060412Z binutils_impl_linux- | 6.0 MB | ########## | 100%  2025-05-07T19:44:34.5061325Z 2025-05-07T19:44:34.5061339Z 2025-05-07T19:44:34.5061351Z 2025-05-07T19:44:34.5061362Z 2025-05-07T19:44:34.5061373Z 2025-05-07T19:44:34.5061384Z 2025-05-07T19:44:34.5061396Z 2025-05-07T19:44:34.5061456Z 2025-05-07T19:44:34.5061468Z 2025-05-07T19:44:34.5061478Z 2025-05-07T19:44:34.5062295Z gxx_linux-64-11.4.0 | 29 KB | ########## | 100%  2025-05-07T19:44:34.5063159Z 2025-05-07T19:44:34.5063171Z 2025-05-07T19:44:34.5063183Z 2025-05-07T19:44:34.5063195Z 2025-05-07T19:44:34.5063209Z 2025-05-07T19:44:34.5063220Z 2025-05-07T19:44:34.5063230Z 2025-05-07T19:44:34.5063241Z 2025-05-07T19:44:34.5063251Z 2025-05-07T19:44:34.5063261Z 2025-05-07T19:44:34.5924781Z gxx_linux-64-11.4.0 | 29 KB | ########## | 100%  2025-05-07T19:44:34.5925719Z 2025-05-07T19:44:34.6885621Z gxx_impl_linux-64-11 | 11.2 MB | ########## | 100%  2025-05-07T19:44:34.7219525Z gcc_impl_linux-64-11 | 53.0 MB | ########## | 100% 2025-05-07T19:44:34.7220338Z 2025-05-07T19:44:34.7220352Z 2025-05-07T19:44:35.2342813Z libstdcxx-devel_linu | 11.1 MB | ########## | 100%  2025-05-07T19:44:35.2346113Z gcc_impl_linux-64-11 | 53.0 MB | ########## | 100% 2025-05-07T19:44:35.2346544Z 2025-05-07T19:44:35.2346766Z 2025-05-07T19:44:35.2346973Z  2025-05-07T19:44:35.2347222Z 2025-05-07T19:44:35.2347226Z 2025-05-07T19:44:35.2347444Z  2025-05-07T19:44:35.2347666Z 2025-05-07T19:44:35.2347669Z 2025-05-07T19:44:35.2347673Z 2025-05-07T19:44:35.2347943Z  2025-05-07T19:44:35.2348173Z 2025-05-07T19:44:35.2348177Z 2025-05-07T19:44:35.2348181Z 2025-05-07T19:44:35.2348184Z 2025-05-07T19:44:35.2348370Z  2025-05-07T19:44:35.2348627Z 2025-05-07T19:44:35.2348630Z 2025-05-07T19:44:35.2348634Z 2025-05-07T19:44:35.2348637Z 2025-05-07T19:44:35.2348641Z 2025-05-07T19:44:35.2348827Z  2025-05-07T19:44:35.2349089Z 2025-05-07T19:44:35.2349097Z 2025-05-07T19:44:35.2349100Z 2025-05-07T19:44:35.2349104Z 2025-05-07T19:44:35.2349107Z 2025-05-07T19:44:35.2349111Z 2025-05-07T19:44:35.2349305Z  2025-05-07T19:44:35.2349538Z 2025-05-07T19:44:35.2349541Z 2025-05-07T19:44:35.2349545Z 2025-05-07T19:44:35.2349573Z 2025-05-07T19:44:35.2349577Z 2025-05-07T19:44:35.2349580Z 2025-05-07T19:44:35.2349584Z 2025-05-07T19:44:35.2349780Z  2025-05-07T19:44:35.2350014Z 2025-05-07T19:44:35.2350017Z 2025-05-07T19:44:35.2350021Z 2025-05-07T19:44:35.2350024Z 2025-05-07T19:44:35.2350027Z 2025-05-07T19:44:35.2350031Z 2025-05-07T19:44:35.2350060Z 2025-05-07T19:44:35.2350063Z 2025-05-07T19:44:35.2350261Z  2025-05-07T19:44:35.2350497Z 2025-05-07T19:44:35.2350500Z 2025-05-07T19:44:35.2350504Z 2025-05-07T19:44:35.2350765Z 2025-05-07T19:44:35.2350769Z 2025-05-07T19:44:35.2350772Z 2025-05-07T19:44:35.2350776Z 2025-05-07T19:44:35.2350780Z 2025-05-07T19:44:35.2350813Z 2025-05-07T19:44:35.2351028Z  2025-05-07T19:44:35.2351273Z 2025-05-07T19:44:35.2351277Z 2025-05-07T19:44:35.2351281Z 2025-05-07T19:44:35.2351284Z 2025-05-07T19:44:35.2351288Z 2025-05-07T19:44:35.2351291Z 2025-05-07T19:44:35.2351295Z 2025-05-07T19:44:35.2352673Z 2025-05-07T19:44:35.2352679Z 2025-05-07T19:44:35.2352709Z 2025-05-07T19:44:35.2352947Z  2025-05-07T19:44:35.2353195Z 2025-05-07T19:44:35.2353199Z 2025-05-07T19:44:35.2353202Z 2025-05-07T19:44:35.2353206Z 2025-05-07T19:44:35.2353210Z 2025-05-07T19:44:35.2353213Z 2025-05-07T19:44:35.2353216Z 2025-05-07T19:44:35.2353220Z 2025-05-07T19:44:35.2353252Z 2025-05-07T19:44:35.2353256Z 2025-05-07T19:44:35.2353259Z 2025-05-07T19:44:35.2353484Z  done 2025-05-07T19:44:35.3361286Z Preparing transaction: \ done 2025-05-07T19:44:35.6375300Z Verifying transaction: / - \ done 2025-05-07T19:44:35.7391026Z Executing transaction: / done 2025-05-07T19:44:35.8300409Z [INSTALL] Setting the C/C++ compiler symlinks ... 2025-05-07T19:44:39.5721895Z + ln -sf /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-cc /github/home/miniconda/envs/build_binary/bin/cc 2025-05-07T19:44:39.5723648Z 2025-05-07T19:44:39.5734585Z 2025-05-07T19:44:39.5752757Z + ln -sf /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-cc /github/home/miniconda/envs/build_binary/bin/gcc 2025-05-07T19:44:39.5753383Z 2025-05-07T19:44:39.5771460Z 2025-05-07T19:44:39.5798319Z + ln -sf /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ /github/home/miniconda/envs/build_binary/bin/c++ 2025-05-07T19:44:39.5799591Z 2025-05-07T19:44:39.5818136Z 2025-05-07T19:44:39.5841358Z + ln -sf /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ /github/home/miniconda/envs/build_binary/bin/g++ 2025-05-07T19:44:39.5843086Z 2025-05-07T19:44:39.5856224Z 2025-05-07T19:44:39.5866134Z [INSTALL] Installing Clang (16.0.6, 64) and relevant libraries through Conda ... 2025-05-07T19:44:39.5895604Z [EXEC] [ATTEMPT 0/3] + conda install -n build_binary -c conda-forge --override-channels -y clangxx=16.0.6 libcxx llvm-openmp=16.0.6 compiler-rt=16.0.6 2025-05-07T19:44:40.3081838Z Channels: 2025-05-07T19:44:40.3082506Z - conda-forge 2025-05-07T19:44:40.3083148Z Platform: linux-64 2025-05-07T19:44:43.3958774Z Collecting package metadata (repodata.json): - \ | / done 2025-05-07T19:44:44.7496421Z Solving environment: \ | / done 2025-05-07T19:44:44.8081496Z 2025-05-07T19:44:44.8082042Z ## Package Plan ## 2025-05-07T19:44:44.8082508Z 2025-05-07T19:44:44.8083123Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:44:44.8084112Z 2025-05-07T19:44:44.8084381Z added / updated specs: 2025-05-07T19:44:44.8085109Z - clangxx=16.0.6 2025-05-07T19:44:44.8085792Z - compiler-rt=16.0.6 2025-05-07T19:44:44.8086520Z - libcxx 2025-05-07T19:44:44.8086757Z - llvm-openmp=16.0.6 2025-05-07T19:44:44.8086918Z 2025-05-07T19:44:44.8086922Z 2025-05-07T19:44:44.8087050Z The following packages will be downloaded: 2025-05-07T19:44:44.8087294Z 2025-05-07T19:44:44.8087429Z package | build 2025-05-07T19:44:44.8087766Z ---------------------------|----------------- 2025-05-07T19:44:44.8088169Z clang-16.0.6 |default_h9e3a008_14 110 KB conda-forge 2025-05-07T19:44:44.8088652Z clang-16-16.0.6 |default_hb5137d0_14 780 KB conda-forge 2025-05-07T19:44:44.8089110Z clangxx-16.0.6 |default_ha78316a_14 110 KB conda-forge 2025-05-07T19:44:44.8089585Z compiler-rt-16.0.6 | h00ab1b0_2 107 KB conda-forge 2025-05-07T19:44:44.8090409Z compiler-rt_linux-64-16.0.6| h00ab1b0_2 36.0 MB conda-forge 2025-05-07T19:44:44.8090876Z icu-73.2 | h59595ed_0 11.5 MB conda-forge 2025-05-07T19:44:44.8091328Z libclang-cpp16-16.0.6 |default_hb5137d0_14 17.3 MB conda-forge 2025-05-07T19:44:44.8091814Z libcxx-19.1.7 | h2713693_1 1000 KB conda-forge 2025-05-07T19:44:44.8092542Z libcxxabi-19.1.7 | hd85fd95_1 158 KB conda-forge 2025-05-07T19:44:44.8093082Z libiconv-1.18 | h4ce23a2_1 696 KB conda-forge 2025-05-07T19:44:44.8093507Z libllvm16-16.0.6 | hb3ce162_3 33.7 MB conda-forge 2025-05-07T19:44:44.8093909Z libxml2-2.12.7 | hc051c1a_1 688 KB conda-forge 2025-05-07T19:44:44.8094320Z libzlib-1.2.13 | h4ab18f5_6 60 KB conda-forge 2025-05-07T19:44:44.8094759Z llvm-openmp-16.0.6 | h4dfa4b3_0 39.9 MB conda-forge 2025-05-07T19:44:44.8095171Z zlib-1.2.13 | h4ab18f5_6 91 KB conda-forge 2025-05-07T19:44:44.8095745Z zstd-1.5.6 | ha6fb4c9_0 542 KB conda-forge 2025-05-07T19:44:44.8096257Z ------------------------------------------------------------ 2025-05-07T19:44:44.8096635Z Total: 142.6 MB 2025-05-07T19:44:44.8097039Z 2025-05-07T19:44:44.8097179Z The following NEW packages will be INSTALLED: 2025-05-07T19:44:44.8097517Z 2025-05-07T19:44:44.8097750Z clang conda-forge/linux-64::clang-16.0.6-default_h9e3a008_14 2025-05-07T19:44:44.8098270Z clang-16 conda-forge/linux-64::clang-16-16.0.6-default_hb5137d0_14 2025-05-07T19:44:44.8098782Z clangxx conda-forge/linux-64::clangxx-16.0.6-default_ha78316a_14 2025-05-07T19:44:44.8099316Z compiler-rt conda-forge/linux-64::compiler-rt-16.0.6-h00ab1b0_2 2025-05-07T19:44:44.8099872Z compiler-rt_linux~ conda-forge/noarch::compiler-rt_linux-64-16.0.6-h00ab1b0_2 2025-05-07T19:44:44.8100393Z icu conda-forge/linux-64::icu-73.2-h59595ed_0 2025-05-07T19:44:44.8100920Z libclang-cpp16 conda-forge/linux-64::libclang-cpp16-16.0.6-default_hb5137d0_14 2025-05-07T19:44:44.8101450Z libcxx conda-forge/linux-64::libcxx-19.1.7-h2713693_1 2025-05-07T19:44:44.8101932Z libcxxabi conda-forge/linux-64::libcxxabi-19.1.7-hd85fd95_1 2025-05-07T19:44:44.8102414Z libiconv conda-forge/linux-64::libiconv-1.18-h4ce23a2_1 2025-05-07T19:44:44.8102899Z libllvm16 conda-forge/linux-64::libllvm16-16.0.6-hb3ce162_3 2025-05-07T19:44:44.8103556Z libxml2 conda-forge/linux-64::libxml2-2.12.7-hc051c1a_1 2025-05-07T19:44:44.8103995Z libzlib conda-forge/linux-64::libzlib-1.2.13-h4ab18f5_6 2025-05-07T19:44:44.8104481Z llvm-openmp conda-forge/linux-64::llvm-openmp-16.0.6-h4dfa4b3_0 2025-05-07T19:44:44.8104971Z zstd conda-forge/linux-64::zstd-1.5.6-ha6fb4c9_0 2025-05-07T19:44:44.8105237Z 2025-05-07T19:44:44.8105356Z The following packages will be UPDATED: 2025-05-07T19:44:44.8105563Z 2025-05-07T19:44:44.8105818Z zlib pkgs/main::zlib-1.2.13-h5eee18b_1 --> conda-forge::zlib-1.2.13-h4ab18f5_6 2025-05-07T19:44:44.8106158Z 2025-05-07T19:44:44.8106161Z 2025-05-07T19:44:44.8106165Z 2025-05-07T19:44:44.8106313Z Downloading and Extracting Packages: ...working... 2025-05-07T19:44:44.8106712Z llvm-openmp-16.0.6 | 39.9 MB | | 0% 2025-05-07T19:44:44.8106954Z 2025-05-07T19:44:44.8107325Z compiler-rt_linux-64 | 36.0 MB | | 0%  2025-05-07T19:44:44.8107576Z 2025-05-07T19:44:44.8107580Z 2025-05-07T19:44:44.8107792Z libllvm16-16.0.6 | 33.7 MB | | 0%  2025-05-07T19:44:44.8108062Z 2025-05-07T19:44:44.8108066Z 2025-05-07T19:44:44.8108070Z 2025-05-07T19:44:44.8108305Z libclang-cpp16-16.0. | 17.3 MB | | 0%  2025-05-07T19:44:44.8108684Z 2025-05-07T19:44:44.8108688Z 2025-05-07T19:44:44.8108691Z 2025-05-07T19:44:44.8108694Z 2025-05-07T19:44:44.8117922Z icu-73.2 | 11.5 MB | | 0%  2025-05-07T19:44:44.8118398Z 2025-05-07T19:44:44.8118403Z 2025-05-07T19:44:44.8118407Z 2025-05-07T19:44:44.8118411Z 2025-05-07T19:44:44.8118415Z 2025-05-07T19:44:44.8118683Z libcxx-19.1.7 | 1000 KB | | 0%  2025-05-07T19:44:44.8119176Z 2025-05-07T19:44:44.8119182Z 2025-05-07T19:44:44.8119186Z 2025-05-07T19:44:44.8119189Z 2025-05-07T19:44:44.8119193Z 2025-05-07T19:44:44.8119197Z 2025-05-07T19:44:44.8119453Z clang-16-16.0.6 | 780 KB | | 0%  2025-05-07T19:44:44.8119745Z 2025-05-07T19:44:44.8119748Z 2025-05-07T19:44:44.8119752Z 2025-05-07T19:44:44.8119756Z 2025-05-07T19:44:44.8119759Z 2025-05-07T19:44:44.8119763Z 2025-05-07T19:44:44.8119767Z 2025-05-07T19:44:44.8120715Z libiconv-1.18 | 696 KB | | 0%  2025-05-07T19:44:44.8121061Z 2025-05-07T19:44:44.8121066Z 2025-05-07T19:44:44.8121070Z 2025-05-07T19:44:44.8121075Z 2025-05-07T19:44:44.8121080Z 2025-05-07T19:44:44.8121084Z 2025-05-07T19:44:44.8121089Z 2025-05-07T19:44:44.8121108Z 2025-05-07T19:44:44.8121386Z libxml2-2.12.7 | 688 KB | | 0%  2025-05-07T19:44:44.8121667Z 2025-05-07T19:44:44.8121671Z 2025-05-07T19:44:44.8121691Z 2025-05-07T19:44:44.8121709Z 2025-05-07T19:44:44.8121713Z 2025-05-07T19:44:44.8121717Z 2025-05-07T19:44:44.8121720Z 2025-05-07T19:44:44.8121724Z 2025-05-07T19:44:44.8121728Z 2025-05-07T19:44:44.8123430Z zstd-1.5.6 | 542 KB | | 0%  2025-05-07T19:44:44.8123781Z 2025-05-07T19:44:44.8123787Z 2025-05-07T19:44:44.8123791Z 2025-05-07T19:44:44.8123796Z 2025-05-07T19:44:44.8123800Z 2025-05-07T19:44:44.8123805Z 2025-05-07T19:44:44.8123809Z 2025-05-07T19:44:44.8123813Z 2025-05-07T19:44:44.8123840Z 2025-05-07T19:44:44.8123844Z 2025-05-07T19:44:44.8124122Z libcxxabi-19.1.7 | 158 KB | | 0%  2025-05-07T19:44:44.8124440Z 2025-05-07T19:44:44.8124444Z 2025-05-07T19:44:44.8124448Z 2025-05-07T19:44:44.8124453Z 2025-05-07T19:44:44.8124457Z 2025-05-07T19:44:44.8124461Z 2025-05-07T19:44:44.8124465Z 2025-05-07T19:44:44.8124469Z 2025-05-07T19:44:44.8124473Z 2025-05-07T19:44:44.8124477Z 2025-05-07T19:44:44.8124494Z 2025-05-07T19:44:44.8124943Z clang-16.0.6 | 110 KB | | 0%  2025-05-07T19:44:44.8125245Z 2025-05-07T19:44:44.8125249Z 2025-05-07T19:44:44.8125263Z 2025-05-07T19:44:44.8125267Z 2025-05-07T19:44:44.8125270Z 2025-05-07T19:44:44.8125274Z 2025-05-07T19:44:44.8125277Z 2025-05-07T19:44:44.8125280Z 2025-05-07T19:44:44.8125284Z 2025-05-07T19:44:44.8125287Z 2025-05-07T19:44:44.8125291Z 2025-05-07T19:44:44.8125294Z 2025-05-07T19:44:44.8126088Z clangxx-16.0.6 | 110 KB | | 0%  2025-05-07T19:44:44.8126386Z 2025-05-07T19:44:44.8126401Z 2025-05-07T19:44:44.8126405Z 2025-05-07T19:44:44.8126408Z 2025-05-07T19:44:44.8126412Z 2025-05-07T19:44:44.8126415Z 2025-05-07T19:44:44.8126419Z 2025-05-07T19:44:44.8126422Z 2025-05-07T19:44:44.8126426Z 2025-05-07T19:44:44.8126429Z 2025-05-07T19:44:44.8126433Z 2025-05-07T19:44:44.8126436Z 2025-05-07T19:44:44.8126459Z 2025-05-07T19:44:44.8127176Z compiler-rt-16.0.6 | 107 KB | | 0%  2025-05-07T19:44:44.8127484Z 2025-05-07T19:44:44.8127502Z 2025-05-07T19:44:44.8127506Z 2025-05-07T19:44:44.8127509Z 2025-05-07T19:44:44.8127513Z 2025-05-07T19:44:44.8127517Z 2025-05-07T19:44:44.8127520Z 2025-05-07T19:44:44.8127548Z 2025-05-07T19:44:44.8127551Z 2025-05-07T19:44:44.8127554Z 2025-05-07T19:44:44.8127558Z 2025-05-07T19:44:44.8127562Z 2025-05-07T19:44:44.8127565Z 2025-05-07T19:44:44.8127568Z 2025-05-07T19:44:44.8128238Z zlib-1.2.13 | 91 KB | | 0%  2025-05-07T19:44:44.8128718Z 2025-05-07T19:44:44.8128741Z 2025-05-07T19:44:44.8128745Z 2025-05-07T19:44:44.8128748Z 2025-05-07T19:44:44.8128752Z 2025-05-07T19:44:44.8128756Z 2025-05-07T19:44:44.8128759Z 2025-05-07T19:44:44.8128763Z 2025-05-07T19:44:44.8128766Z 2025-05-07T19:44:44.8128782Z 2025-05-07T19:44:44.8128785Z 2025-05-07T19:44:44.8128788Z 2025-05-07T19:44:44.8128792Z 2025-05-07T19:44:44.8128891Z 2025-05-07T19:44:44.8128896Z 2025-05-07T19:44:44.9411065Z libzlib-1.2.13 | 60 KB | | 0%  2025-05-07T19:44:44.9412039Z 2025-05-07T19:44:44.9412052Z 2025-05-07T19:44:44.9628357Z 2025-05-07T19:44:45.0665779Z libclang-cpp16-16.0. | 17.3 MB | 1 | 1%  2025-05-07T19:44:45.0666645Z 2025-05-07T19:44:45.0666661Z 2025-05-07T19:44:45.0666685Z 2025-05-07T19:44:45.1586167Z libclang-cpp16-16.0. | 17.3 MB | 2 | 2%  2025-05-07T19:44:45.1586515Z 2025-05-07T19:44:45.1586520Z 2025-05-07T19:44:45.1586524Z 2025-05-07T19:44:45.1586528Z 2025-05-07T19:44:45.1606341Z icu-73.2 | 11.5 MB | | 0%  2025-05-07T19:44:45.1645142Z llvm-openmp-16.0.6 | 39.9 MB | | 0% 2025-05-07T19:44:45.1645944Z 2025-05-07T19:44:45.1664541Z compiler-rt_linux-64 | 36.0 MB | | 0%  2025-05-07T19:44:45.1665307Z 2025-05-07T19:44:45.1665321Z 2025-05-07T19:44:45.1665368Z 2025-05-07T19:44:45.1735271Z libclang-cpp16-16.0. | 17.3 MB | ##5 | 25%  2025-05-07T19:44:45.1735616Z 2025-05-07T19:44:45.1735621Z 2025-05-07T19:44:45.2585188Z libllvm16-16.0.6 | 33.7 MB | | 0%  2025-05-07T19:44:45.2585469Z 2025-05-07T19:44:45.2585474Z 2025-05-07T19:44:45.2606517Z 2025-05-07T19:44:45.2606558Z 2025-05-07T19:44:45.2606927Z icu-73.2 | 11.5 MB | #######2 | 72%  2025-05-07T19:44:45.2646833Z llvm-openmp-16.0.6 | 39.9 MB | #9 | 20% 2025-05-07T19:44:45.2647160Z 2025-05-07T19:44:45.2665672Z compiler-rt_linux-64 | 36.0 MB | ##3 | 24%  2025-05-07T19:44:45.2666489Z 2025-05-07T19:44:45.2666503Z 2025-05-07T19:44:45.2666514Z 2025-05-07T19:44:45.2739657Z libclang-cpp16-16.0. | 17.3 MB | ####### | 71%  2025-05-07T19:44:45.2739988Z 2025-05-07T19:44:45.2739993Z 2025-05-07T19:44:45.3607373Z libllvm16-16.0.6 | 33.7 MB | #5 | 16%  2025-05-07T19:44:45.3646346Z llvm-openmp-16.0.6 | 39.9 MB | ####8 | 48% 2025-05-07T19:44:45.3646869Z 2025-05-07T19:44:45.3745060Z compiler-rt_linux-64 | 36.0 MB | ####6 | 47%  2025-05-07T19:44:45.3745365Z 2025-05-07T19:44:45.3745369Z 2025-05-07T19:44:45.3837700Z libllvm16-16.0.6 | 33.7 MB | ####3 | 44%  2025-05-07T19:44:45.3837993Z 2025-05-07T19:44:45.3838150Z 2025-05-07T19:44:45.3838158Z 2025-05-07T19:44:45.3838163Z 2025-05-07T19:44:45.4246249Z icu-73.2 | 11.5 MB | ########## | 100%  2025-05-07T19:44:45.4246816Z 2025-05-07T19:44:45.4247010Z 2025-05-07T19:44:45.4247016Z 2025-05-07T19:44:45.4247047Z 2025-05-07T19:44:45.4247051Z 2025-05-07T19:44:45.4298797Z libcxx-19.1.7 | 1000 KB | 1 | 2%  2025-05-07T19:44:45.4299103Z 2025-05-07T19:44:45.4299108Z 2025-05-07T19:44:45.4299112Z 2025-05-07T19:44:45.4299361Z libclang-cpp16-16.0. | 17.3 MB | ########## | 100%  2025-05-07T19:44:45.4299652Z 2025-05-07T19:44:45.4299670Z 2025-05-07T19:44:45.4299674Z 2025-05-07T19:44:45.4480334Z libclang-cpp16-16.0. | 17.3 MB | ########## | 100%  2025-05-07T19:44:45.4481555Z 2025-05-07T19:44:45.4481595Z 2025-05-07T19:44:45.4481607Z 2025-05-07T19:44:45.4481619Z 2025-05-07T19:44:45.4481630Z 2025-05-07T19:44:45.4687305Z libcxx-19.1.7 | 1000 KB | ########## | 100%  2025-05-07T19:44:45.4687613Z 2025-05-07T19:44:45.4687618Z 2025-05-07T19:44:45.4687621Z 2025-05-07T19:44:45.4687643Z 2025-05-07T19:44:45.4687867Z 2025-05-07T19:44:45.4687871Z 2025-05-07T19:44:45.4746616Z clang-16-16.0.6 | 780 KB | 2 | 2%  2025-05-07T19:44:45.4747476Z 2025-05-07T19:44:45.4747490Z 2025-05-07T19:44:45.4876571Z libllvm16-16.0.6 | 33.7 MB | ######4 | 65%  2025-05-07T19:44:45.4876861Z 2025-05-07T19:44:45.4876909Z 2025-05-07T19:44:45.4876913Z 2025-05-07T19:44:45.4876943Z 2025-05-07T19:44:45.4876947Z 2025-05-07T19:44:45.4876951Z 2025-05-07T19:44:45.4877295Z 2025-05-07T19:44:45.4966914Z libiconv-1.18 | 696 KB | 2 | 2%  2025-05-07T19:44:45.4999385Z llvm-openmp-16.0.6 | 39.9 MB | ######7 | 67% 2025-05-07T19:44:45.5000187Z 2025-05-07T19:44:45.5000202Z 2025-05-07T19:44:45.5000214Z 2025-05-07T19:44:45.5000225Z 2025-05-07T19:44:45.5000235Z 2025-05-07T19:44:45.5000264Z 2025-05-07T19:44:45.5024601Z clang-16-16.0.6 | 780 KB | ########## | 100%  2025-05-07T19:44:45.5025461Z 2025-05-07T19:44:45.5146657Z compiler-rt_linux-64 | 36.0 MB | ######4 | 65%  2025-05-07T19:44:45.5147508Z 2025-05-07T19:44:45.5147521Z 2025-05-07T19:44:45.5147531Z 2025-05-07T19:44:45.5147541Z 2025-05-07T19:44:45.5147552Z 2025-05-07T19:44:45.5147562Z 2025-05-07T19:44:45.5147573Z 2025-05-07T19:44:45.5380185Z libiconv-1.18 | 696 KB | ########## | 100%  2025-05-07T19:44:45.5381095Z 2025-05-07T19:44:45.5381109Z 2025-05-07T19:44:45.5381120Z 2025-05-07T19:44:45.5381131Z 2025-05-07T19:44:45.5381173Z 2025-05-07T19:44:45.5381184Z 2025-05-07T19:44:45.5381194Z 2025-05-07T19:44:45.5381204Z 2025-05-07T19:44:45.5495520Z libxml2-2.12.7 | 688 KB | 2 | 2%  2025-05-07T19:44:45.5495840Z 2025-05-07T19:44:45.5496361Z 2025-05-07T19:44:45.5496370Z 2025-05-07T19:44:45.5496375Z 2025-05-07T19:44:45.5496412Z 2025-05-07T19:44:45.5496417Z 2025-05-07T19:44:45.5496421Z 2025-05-07T19:44:45.5496426Z 2025-05-07T19:44:45.5496431Z 2025-05-07T19:44:45.5644421Z zstd-1.5.6 | 542 KB | 2 | 3%  2025-05-07T19:44:45.5645640Z 2025-05-07T19:44:45.5645653Z 2025-05-07T19:44:45.5645664Z 2025-05-07T19:44:45.5645695Z 2025-05-07T19:44:45.5645706Z 2025-05-07T19:44:45.5645717Z 2025-05-07T19:44:45.5645727Z 2025-05-07T19:44:45.5645737Z 2025-05-07T19:44:45.5729559Z libxml2-2.12.7 | 688 KB | ########## | 100%  2025-05-07T19:44:45.5730455Z 2025-05-07T19:44:45.5730468Z 2025-05-07T19:44:45.5730512Z 2025-05-07T19:44:45.5730546Z 2025-05-07T19:44:45.5730556Z 2025-05-07T19:44:45.5730566Z 2025-05-07T19:44:45.5730576Z 2025-05-07T19:44:45.5730586Z 2025-05-07T19:44:45.5730596Z 2025-05-07T19:44:45.5780881Z zstd-1.5.6 | 542 KB | ########## | 100%  2025-05-07T19:44:45.5781733Z 2025-05-07T19:44:45.5781746Z 2025-05-07T19:44:45.5873877Z libllvm16-16.0.6 | 33.7 MB | ########4 | 84%  2025-05-07T19:44:45.5874712Z 2025-05-07T19:44:45.5874725Z 2025-05-07T19:44:45.5874769Z 2025-05-07T19:44:45.5874780Z 2025-05-07T19:44:45.5874791Z 2025-05-07T19:44:45.5876190Z libcxx-19.1.7 | 1000 KB | ########## | 100%  2025-05-07T19:44:45.5876473Z 2025-05-07T19:44:45.5876477Z 2025-05-07T19:44:45.5876480Z 2025-05-07T19:44:45.5876484Z 2025-05-07T19:44:45.5876487Z 2025-05-07T19:44:45.5979354Z libcxx-19.1.7 | 1000 KB | ########## | 100%  2025-05-07T19:44:45.5980230Z 2025-05-07T19:44:45.5980246Z 2025-05-07T19:44:45.5980289Z 2025-05-07T19:44:45.5980301Z 2025-05-07T19:44:45.5980312Z 2025-05-07T19:44:45.5980322Z 2025-05-07T19:44:45.5980332Z 2025-05-07T19:44:45.5980342Z 2025-05-07T19:44:45.5980353Z 2025-05-07T19:44:45.5980363Z 2025-05-07T19:44:45.6025577Z libcxxabi-19.1.7 | 158 KB | # | 10%  2025-05-07T19:44:45.6026508Z 2025-05-07T19:44:45.6045053Z compiler-rt_linux-64 | 36.0 MB | ########2 | 83%  2025-05-07T19:44:45.6045368Z 2025-05-07T19:44:45.6045927Z 2025-05-07T19:44:45.6046293Z 2025-05-07T19:44:45.6046408Z 2025-05-07T19:44:45.6046416Z 2025-05-07T19:44:45.6046421Z 2025-05-07T19:44:45.6046515Z 2025-05-07T19:44:45.6046549Z 2025-05-07T19:44:45.6046558Z 2025-05-07T19:44:45.6046594Z 2025-05-07T19:44:45.6065555Z libcxxabi-19.1.7 | 158 KB | ########## | 100%  2025-05-07T19:44:45.6131468Z llvm-openmp-16.0.6 | 39.9 MB | ########4 | 85% 2025-05-07T19:44:45.6131885Z 2025-05-07T19:44:45.6132130Z 2025-05-07T19:44:45.6132553Z 2025-05-07T19:44:45.6132561Z 2025-05-07T19:44:45.6132566Z 2025-05-07T19:44:45.6132592Z 2025-05-07T19:44:45.6132597Z 2025-05-07T19:44:45.6132601Z 2025-05-07T19:44:45.6132645Z 2025-05-07T19:44:45.6132649Z 2025-05-07T19:44:45.6132654Z 2025-05-07T19:44:45.6183566Z clang-16.0.6 | 110 KB | #4 | 15%  2025-05-07T19:44:45.6184437Z 2025-05-07T19:44:45.6184451Z 2025-05-07T19:44:45.6184462Z 2025-05-07T19:44:45.6184473Z 2025-05-07T19:44:45.6184483Z 2025-05-07T19:44:45.6184527Z 2025-05-07T19:44:45.6184538Z 2025-05-07T19:44:45.6184548Z 2025-05-07T19:44:45.6184559Z 2025-05-07T19:44:45.6184569Z 2025-05-07T19:44:45.6184592Z 2025-05-07T19:44:45.6348584Z clang-16.0.6 | 110 KB | ########## | 100%  2025-05-07T19:44:45.6348919Z 2025-05-07T19:44:45.6348924Z 2025-05-07T19:44:45.6348928Z 2025-05-07T19:44:45.6348932Z 2025-05-07T19:44:45.6348937Z 2025-05-07T19:44:45.6348941Z 2025-05-07T19:44:45.6348962Z 2025-05-07T19:44:45.6348966Z 2025-05-07T19:44:45.6348969Z 2025-05-07T19:44:45.6348973Z 2025-05-07T19:44:45.6348994Z 2025-05-07T19:44:45.6348997Z 2025-05-07T19:44:45.6396629Z clangxx-16.0.6 | 110 KB | #4 | 15%  2025-05-07T19:44:45.6397580Z 2025-05-07T19:44:45.6397594Z 2025-05-07T19:44:45.6397605Z 2025-05-07T19:44:45.6397615Z 2025-05-07T19:44:45.6397626Z 2025-05-07T19:44:45.6397783Z 2025-05-07T19:44:45.6397787Z 2025-05-07T19:44:45.6397791Z 2025-05-07T19:44:45.6397809Z 2025-05-07T19:44:45.6397813Z 2025-05-07T19:44:45.6397817Z 2025-05-07T19:44:45.6397820Z 2025-05-07T19:44:45.6571853Z clangxx-16.0.6 | 110 KB | ########## | 100%  2025-05-07T19:44:45.6572834Z 2025-05-07T19:44:45.6572849Z 2025-05-07T19:44:45.6572861Z 2025-05-07T19:44:45.6572871Z 2025-05-07T19:44:45.6572882Z 2025-05-07T19:44:45.6572892Z 2025-05-07T19:44:45.6572903Z 2025-05-07T19:44:45.6572913Z 2025-05-07T19:44:45.6572954Z 2025-05-07T19:44:45.6572965Z 2025-05-07T19:44:45.6572976Z 2025-05-07T19:44:45.6572986Z 2025-05-07T19:44:45.6572996Z 2025-05-07T19:44:45.6599640Z compiler-rt-16.0.6 | 107 KB | #4 | 15%  2025-05-07T19:44:45.6600670Z 2025-05-07T19:44:45.6600684Z 2025-05-07T19:44:45.6600695Z 2025-05-07T19:44:45.6600706Z 2025-05-07T19:44:45.6600717Z 2025-05-07T19:44:45.6600727Z 2025-05-07T19:44:45.6600738Z 2025-05-07T19:44:45.6600748Z 2025-05-07T19:44:45.6600758Z 2025-05-07T19:44:45.6600801Z 2025-05-07T19:44:45.6600812Z 2025-05-07T19:44:45.6600822Z 2025-05-07T19:44:45.6600832Z 2025-05-07T19:44:45.6775639Z compiler-rt-16.0.6 | 107 KB | ########## | 100%  2025-05-07T19:44:45.6776298Z 2025-05-07T19:44:45.6776304Z 2025-05-07T19:44:45.6776308Z 2025-05-07T19:44:45.6776312Z 2025-05-07T19:44:45.6776316Z 2025-05-07T19:44:45.6776321Z 2025-05-07T19:44:45.6776325Z 2025-05-07T19:44:45.6776329Z 2025-05-07T19:44:45.6776351Z 2025-05-07T19:44:45.6776354Z 2025-05-07T19:44:45.6776358Z 2025-05-07T19:44:45.6776361Z 2025-05-07T19:44:45.6776383Z 2025-05-07T19:44:45.6776387Z 2025-05-07T19:44:45.6804633Z zlib-1.2.13 | 91 KB | #7 | 18%  2025-05-07T19:44:45.6805530Z 2025-05-07T19:44:45.6805578Z 2025-05-07T19:44:45.6805589Z 2025-05-07T19:44:45.6805600Z 2025-05-07T19:44:45.6805611Z 2025-05-07T19:44:45.6805647Z 2025-05-07T19:44:45.6805658Z 2025-05-07T19:44:45.6805668Z 2025-05-07T19:44:45.6806169Z 2025-05-07T19:44:45.6806180Z 2025-05-07T19:44:45.6806190Z 2025-05-07T19:44:45.6806200Z 2025-05-07T19:44:45.6806211Z 2025-05-07T19:44:45.6806221Z 2025-05-07T19:44:45.7074422Z zlib-1.2.13 | 91 KB | ########## | 100%  2025-05-07T19:44:45.7075299Z 2025-05-07T19:44:45.7075303Z 2025-05-07T19:44:45.7075307Z 2025-05-07T19:44:45.7075310Z 2025-05-07T19:44:45.7075314Z 2025-05-07T19:44:45.7075317Z 2025-05-07T19:44:45.7075657Z 2025-05-07T19:44:45.7075662Z 2025-05-07T19:44:45.7075666Z 2025-05-07T19:44:45.7075670Z 2025-05-07T19:44:45.7075674Z 2025-05-07T19:44:45.7075677Z 2025-05-07T19:44:45.7075681Z 2025-05-07T19:44:45.7075685Z 2025-05-07T19:44:45.7075688Z 2025-05-07T19:44:45.7092112Z libzlib-1.2.13 | 60 KB | ##6 | 27%  2025-05-07T19:44:45.7093095Z 2025-05-07T19:44:45.7093109Z 2025-05-07T19:44:45.7093120Z 2025-05-07T19:44:45.7093130Z 2025-05-07T19:44:45.7093141Z 2025-05-07T19:44:45.7093185Z 2025-05-07T19:44:45.7093195Z 2025-05-07T19:44:45.7093205Z 2025-05-07T19:44:45.7093216Z 2025-05-07T19:44:45.7093227Z 2025-05-07T19:44:45.7093237Z 2025-05-07T19:44:45.7093247Z 2025-05-07T19:44:45.7093258Z 2025-05-07T19:44:45.7093268Z 2025-05-07T19:44:45.7093300Z 2025-05-07T19:44:45.8280079Z libzlib-1.2.13 | 60 KB | ########## | 100%  2025-05-07T19:44:45.8281053Z 2025-05-07T19:44:45.8281066Z 2025-05-07T19:44:45.8281111Z 2025-05-07T19:44:45.8281121Z 2025-05-07T19:44:45.8281131Z 2025-05-07T19:44:45.8281142Z 2025-05-07T19:44:45.8281884Z clang-16-16.0.6 | 780 KB | ########## | 100%  2025-05-07T19:44:45.8282664Z 2025-05-07T19:44:45.8282675Z 2025-05-07T19:44:45.8282686Z 2025-05-07T19:44:45.8282696Z 2025-05-07T19:44:45.8282706Z 2025-05-07T19:44:45.8282716Z 2025-05-07T19:44:45.8764751Z clang-16-16.0.6 | 780 KB | ########## | 100%  2025-05-07T19:44:45.8765380Z 2025-05-07T19:44:45.8765403Z 2025-05-07T19:44:45.8765408Z 2025-05-07T19:44:45.8765411Z 2025-05-07T19:44:45.8765415Z 2025-05-07T19:44:45.8765418Z 2025-05-07T19:44:45.8765422Z 2025-05-07T19:44:45.8765696Z libiconv-1.18 | 696 KB | ########## | 100%  2025-05-07T19:44:45.8765969Z 2025-05-07T19:44:45.8765973Z 2025-05-07T19:44:45.8765976Z 2025-05-07T19:44:45.8765980Z 2025-05-07T19:44:45.8765984Z 2025-05-07T19:44:45.8765988Z 2025-05-07T19:44:45.8765992Z 2025-05-07T19:44:45.9159488Z libiconv-1.18 | 696 KB | ########## | 100%  2025-05-07T19:44:45.9160403Z 2025-05-07T19:44:45.9160417Z 2025-05-07T19:44:45.9160430Z 2025-05-07T19:44:45.9160440Z 2025-05-07T19:44:45.9259039Z icu-73.2 | 11.5 MB | ########## | 100%  2025-05-07T19:44:45.9259359Z 2025-05-07T19:44:45.9259364Z 2025-05-07T19:44:45.9259368Z 2025-05-07T19:44:45.9259372Z 2025-05-07T19:44:45.9259376Z 2025-05-07T19:44:45.9259380Z 2025-05-07T19:44:45.9259383Z 2025-05-07T19:44:45.9259404Z 2025-05-07T19:44:45.9261064Z libxml2-2.12.7 | 688 KB | ########## | 100%  2025-05-07T19:44:45.9261389Z 2025-05-07T19:44:45.9261393Z 2025-05-07T19:44:45.9261396Z 2025-05-07T19:44:45.9261400Z 2025-05-07T19:44:45.9261403Z 2025-05-07T19:44:45.9261407Z 2025-05-07T19:44:45.9261411Z 2025-05-07T19:44:45.9261414Z 2025-05-07T19:44:45.9290340Z libxml2-2.12.7 | 688 KB | ########## | 100%  2025-05-07T19:44:45.9291285Z 2025-05-07T19:44:45.9291299Z 2025-05-07T19:44:45.9495605Z libllvm16-16.0.6 | 33.7 MB | ########## | 100%  2025-05-07T19:44:45.9496673Z 2025-05-07T19:44:45.9496992Z 2025-05-07T19:44:45.9497019Z 2025-05-07T19:44:45.9497039Z 2025-05-07T19:44:45.9497057Z 2025-05-07T19:44:45.9499683Z 2025-05-07T19:44:45.9499809Z 2025-05-07T19:44:45.9499814Z 2025-05-07T19:44:45.9499908Z 2025-05-07T19:44:45.9500442Z zstd-1.5.6 | 542 KB | ########## | 100%  2025-05-07T19:44:45.9501020Z 2025-05-07T19:44:45.9501025Z 2025-05-07T19:44:45.9501028Z 2025-05-07T19:44:45.9501032Z 2025-05-07T19:44:45.9501036Z 2025-05-07T19:44:45.9501040Z 2025-05-07T19:44:45.9501043Z 2025-05-07T19:44:45.9501048Z 2025-05-07T19:44:45.9501051Z 2025-05-07T19:44:45.9501330Z zstd-1.5.6 | 542 KB | ########## | 100%  2025-05-07T19:44:45.9501600Z 2025-05-07T19:44:45.9501604Z 2025-05-07T19:44:45.9501608Z 2025-05-07T19:44:45.9501611Z 2025-05-07T19:44:45.9501715Z 2025-05-07T19:44:45.9501720Z 2025-05-07T19:44:45.9501724Z 2025-05-07T19:44:45.9501728Z 2025-05-07T19:44:45.9501732Z 2025-05-07T19:44:45.9501735Z 2025-05-07T19:44:45.9502023Z libcxxabi-19.1.7 | 158 KB | ########## | 100%  2025-05-07T19:44:45.9502317Z 2025-05-07T19:44:45.9502321Z 2025-05-07T19:44:45.9502324Z 2025-05-07T19:44:45.9502328Z 2025-05-07T19:44:45.9502331Z 2025-05-07T19:44:45.9502335Z 2025-05-07T19:44:45.9502339Z 2025-05-07T19:44:45.9502342Z 2025-05-07T19:44:45.9502350Z 2025-05-07T19:44:45.9502354Z 2025-05-07T19:44:45.9820076Z libcxxabi-19.1.7 | 158 KB | ########## | 100%  2025-05-07T19:44:45.9821011Z 2025-05-07T19:44:45.9821672Z compiler-rt_linux-64 | 36.0 MB | ########## | 100%  2025-05-07T19:44:45.9822436Z 2025-05-07T19:44:46.0200977Z compiler-rt_linux-64 | 36.0 MB | ########## | 100%  2025-05-07T19:44:46.0249959Z llvm-openmp-16.0.6 | 39.9 MB | ########## | 100% 2025-05-07T19:44:46.0250794Z 2025-05-07T19:44:46.0250855Z 2025-05-07T19:44:46.0250866Z 2025-05-07T19:44:46.0250877Z 2025-05-07T19:44:46.0250887Z 2025-05-07T19:44:46.0250898Z 2025-05-07T19:44:46.0250908Z 2025-05-07T19:44:46.0250919Z 2025-05-07T19:44:46.0250929Z 2025-05-07T19:44:46.0250939Z 2025-05-07T19:44:46.0250974Z 2025-05-07T19:44:46.0251885Z clang-16.0.6 | 110 KB | ########## | 100%  2025-05-07T19:44:46.0252705Z 2025-05-07T19:44:46.0252716Z 2025-05-07T19:44:46.0252726Z 2025-05-07T19:44:46.0252749Z 2025-05-07T19:44:46.0252760Z 2025-05-07T19:44:46.0252770Z 2025-05-07T19:44:46.0252780Z 2025-05-07T19:44:46.0252790Z 2025-05-07T19:44:46.0252800Z 2025-05-07T19:44:46.0252831Z 2025-05-07T19:44:46.0252841Z 2025-05-07T19:44:46.0257640Z clang-16.0.6 | 110 KB | ########## | 100%  2025-05-07T19:44:46.0257938Z 2025-05-07T19:44:46.0257942Z 2025-05-07T19:44:46.0257945Z 2025-05-07T19:44:46.0257949Z 2025-05-07T19:44:46.0257960Z 2025-05-07T19:44:46.0257963Z 2025-05-07T19:44:46.0257984Z 2025-05-07T19:44:46.0257987Z 2025-05-07T19:44:46.0257990Z 2025-05-07T19:44:46.0257994Z 2025-05-07T19:44:46.0257997Z 2025-05-07T19:44:46.0258000Z 2025-05-07T19:44:46.0258638Z clangxx-16.0.6 | 110 KB | ########## | 100%  2025-05-07T19:44:46.0258931Z 2025-05-07T19:44:46.0258935Z 2025-05-07T19:44:46.0258939Z 2025-05-07T19:44:46.0258971Z 2025-05-07T19:44:46.0258974Z 2025-05-07T19:44:46.0258978Z 2025-05-07T19:44:46.0258987Z 2025-05-07T19:44:46.0258991Z 2025-05-07T19:44:46.0258994Z 2025-05-07T19:44:46.0258998Z 2025-05-07T19:44:46.0259001Z 2025-05-07T19:44:46.0259005Z 2025-05-07T19:44:46.0419566Z clangxx-16.0.6 | 110 KB | ########## | 100%  2025-05-07T19:44:46.0420690Z 2025-05-07T19:44:46.0420706Z 2025-05-07T19:44:46.0420717Z 2025-05-07T19:44:46.0420728Z 2025-05-07T19:44:46.0420739Z 2025-05-07T19:44:46.0420749Z 2025-05-07T19:44:46.0420759Z 2025-05-07T19:44:46.0420803Z 2025-05-07T19:44:46.0420814Z 2025-05-07T19:44:46.0420825Z 2025-05-07T19:44:46.0420835Z 2025-05-07T19:44:46.0420845Z 2025-05-07T19:44:46.0420855Z 2025-05-07T19:44:46.0420866Z 2025-05-07T19:44:46.0421692Z zlib-1.2.13 | 91 KB | ########## | 100%  2025-05-07T19:44:46.0422526Z 2025-05-07T19:44:46.0422536Z 2025-05-07T19:44:46.0422547Z 2025-05-07T19:44:46.0422557Z 2025-05-07T19:44:46.0422567Z 2025-05-07T19:44:46.0422577Z 2025-05-07T19:44:46.0422987Z 2025-05-07T19:44:46.0422998Z 2025-05-07T19:44:46.0423009Z 2025-05-07T19:44:46.0423019Z 2025-05-07T19:44:46.0423029Z 2025-05-07T19:44:46.0423040Z 2025-05-07T19:44:46.0423050Z 2025-05-07T19:44:46.0423060Z 2025-05-07T19:44:46.0508195Z zlib-1.2.13 | 91 KB | ########## | 100%  2025-05-07T19:44:46.0508711Z 2025-05-07T19:44:46.0508716Z 2025-05-07T19:44:46.0508720Z 2025-05-07T19:44:46.0508724Z 2025-05-07T19:44:46.0509008Z 2025-05-07T19:44:46.0509014Z 2025-05-07T19:44:46.0509018Z 2025-05-07T19:44:46.0509022Z 2025-05-07T19:44:46.0509026Z 2025-05-07T19:44:46.0509030Z 2025-05-07T19:44:46.0509033Z 2025-05-07T19:44:46.0509037Z 2025-05-07T19:44:46.0509040Z 2025-05-07T19:44:46.0509374Z compiler-rt-16.0.6 | 107 KB | ########## | 100%  2025-05-07T19:44:46.0509709Z 2025-05-07T19:44:46.0509713Z 2025-05-07T19:44:46.0509717Z 2025-05-07T19:44:46.0509720Z 2025-05-07T19:44:46.0509724Z 2025-05-07T19:44:46.0509736Z 2025-05-07T19:44:46.0509740Z 2025-05-07T19:44:46.0509744Z 2025-05-07T19:44:46.0509748Z 2025-05-07T19:44:46.0509751Z 2025-05-07T19:44:46.0509755Z 2025-05-07T19:44:46.0509758Z 2025-05-07T19:44:46.0509761Z 2025-05-07T19:44:46.0538709Z compiler-rt-16.0.6 | 107 KB | ########## | 100%  2025-05-07T19:44:46.0539705Z 2025-05-07T19:44:46.0539776Z 2025-05-07T19:44:46.0539787Z 2025-05-07T19:44:46.0539798Z 2025-05-07T19:44:46.0539841Z 2025-05-07T19:44:46.0539853Z 2025-05-07T19:44:46.0539863Z 2025-05-07T19:44:46.0539873Z 2025-05-07T19:44:46.0539884Z 2025-05-07T19:44:46.0539894Z 2025-05-07T19:44:46.0539904Z 2025-05-07T19:44:46.0539916Z 2025-05-07T19:44:46.0539926Z 2025-05-07T19:44:46.0539937Z 2025-05-07T19:44:46.0539947Z 2025-05-07T19:44:46.0540807Z libzlib-1.2.13 | 60 KB | ########## | 100%  2025-05-07T19:44:46.0541676Z 2025-05-07T19:44:46.0541688Z 2025-05-07T19:44:46.0541698Z 2025-05-07T19:44:46.0541724Z 2025-05-07T19:44:46.0541734Z 2025-05-07T19:44:46.0541744Z 2025-05-07T19:44:46.0541754Z 2025-05-07T19:44:46.0541765Z 2025-05-07T19:44:46.0541775Z 2025-05-07T19:44:46.0541785Z 2025-05-07T19:44:46.0541795Z 2025-05-07T19:44:46.0541805Z 2025-05-07T19:44:46.0541840Z 2025-05-07T19:44:46.0541850Z 2025-05-07T19:44:46.0541860Z 2025-05-07T19:44:46.0855651Z libzlib-1.2.13 | 60 KB | ########## | 100%  2025-05-07T19:44:46.0856922Z 2025-05-07T19:44:46.0856938Z 2025-05-07T19:44:46.0856949Z 2025-05-07T19:44:46.4595969Z libclang-cpp16-16.0. | 17.3 MB | ########## | 100%  2025-05-07T19:44:46.4596883Z 2025-05-07T19:44:46.5063100Z compiler-rt_linux-64 | 36.0 MB | ########## | 100%  2025-05-07T19:44:46.5063393Z 2025-05-07T19:44:46.5063755Z 2025-05-07T19:44:46.5744435Z libllvm16-16.0.6 | 33.7 MB | ########## | 100%  2025-05-07T19:44:46.5745832Z llvm-openmp-16.0.6 | 39.9 MB | ########## | 100% 2025-05-07T19:44:46.5746423Z 2025-05-07T19:44:46.5746645Z 2025-05-07T19:44:46.5746842Z  2025-05-07T19:44:46.5747073Z 2025-05-07T19:44:46.5747077Z 2025-05-07T19:44:46.5747251Z  2025-05-07T19:44:46.5747471Z 2025-05-07T19:44:46.5747474Z 2025-05-07T19:44:46.5747478Z 2025-05-07T19:44:46.5747736Z  2025-05-07T19:44:46.5747962Z 2025-05-07T19:44:46.5747966Z 2025-05-07T19:44:46.5747970Z 2025-05-07T19:44:46.5747991Z 2025-05-07T19:44:46.5748173Z  2025-05-07T19:44:46.5748435Z 2025-05-07T19:44:46.5748439Z 2025-05-07T19:44:46.5748442Z 2025-05-07T19:44:46.5748446Z 2025-05-07T19:44:46.5748449Z 2025-05-07T19:44:46.5748651Z  2025-05-07T19:44:46.5748877Z 2025-05-07T19:44:46.5749121Z 2025-05-07T19:44:46.5749125Z 2025-05-07T19:44:46.5749129Z 2025-05-07T19:44:46.5749132Z 2025-05-07T19:44:46.5749136Z 2025-05-07T19:44:46.5749330Z  2025-05-07T19:44:46.5749579Z 2025-05-07T19:44:46.5749582Z 2025-05-07T19:44:46.5749586Z 2025-05-07T19:44:46.5749589Z 2025-05-07T19:44:46.5749593Z 2025-05-07T19:44:46.5749596Z 2025-05-07T19:44:46.5749599Z 2025-05-07T19:44:46.5749894Z  2025-05-07T19:44:46.5750149Z 2025-05-07T19:44:46.5750152Z 2025-05-07T19:44:46.5750156Z 2025-05-07T19:44:46.5750159Z 2025-05-07T19:44:46.5750163Z 2025-05-07T19:44:46.5750166Z 2025-05-07T19:44:46.5750169Z 2025-05-07T19:44:46.5750173Z 2025-05-07T19:44:46.5750368Z  2025-05-07T19:44:46.5750620Z 2025-05-07T19:44:46.5750624Z 2025-05-07T19:44:46.5750627Z 2025-05-07T19:44:46.5750631Z 2025-05-07T19:44:46.5750639Z 2025-05-07T19:44:46.5750643Z 2025-05-07T19:44:46.5750646Z 2025-05-07T19:44:46.5750650Z 2025-05-07T19:44:46.5750653Z 2025-05-07T19:44:46.5750846Z  2025-05-07T19:44:46.5751075Z 2025-05-07T19:44:46.5751100Z 2025-05-07T19:44:46.5751104Z 2025-05-07T19:44:46.5751108Z 2025-05-07T19:44:46.5751111Z 2025-05-07T19:44:46.5751115Z 2025-05-07T19:44:46.5751118Z 2025-05-07T19:44:46.5751121Z 2025-05-07T19:44:46.5751129Z 2025-05-07T19:44:46.5751132Z 2025-05-07T19:44:46.5751333Z  2025-05-07T19:44:46.5751568Z 2025-05-07T19:44:46.5751595Z 2025-05-07T19:44:46.5751598Z 2025-05-07T19:44:46.5751602Z 2025-05-07T19:44:46.5751605Z 2025-05-07T19:44:46.5751609Z 2025-05-07T19:44:46.5751612Z 2025-05-07T19:44:46.5751615Z 2025-05-07T19:44:46.5751619Z 2025-05-07T19:44:46.5751622Z 2025-05-07T19:44:46.5751626Z 2025-05-07T19:44:46.5751829Z  2025-05-07T19:44:46.5752111Z 2025-05-07T19:44:46.5752115Z 2025-05-07T19:44:46.5752118Z 2025-05-07T19:44:46.5752122Z 2025-05-07T19:44:46.5752125Z 2025-05-07T19:44:46.5752129Z 2025-05-07T19:44:46.5752133Z 2025-05-07T19:44:46.5752136Z 2025-05-07T19:44:46.5752140Z 2025-05-07T19:44:46.5752143Z 2025-05-07T19:44:46.5752146Z 2025-05-07T19:44:46.5752150Z 2025-05-07T19:44:46.5752356Z  2025-05-07T19:44:46.5752614Z 2025-05-07T19:44:46.5752617Z 2025-05-07T19:44:46.5752620Z 2025-05-07T19:44:46.5752624Z 2025-05-07T19:44:46.5752628Z 2025-05-07T19:44:46.5752631Z 2025-05-07T19:44:46.5752635Z 2025-05-07T19:44:46.5752638Z 2025-05-07T19:44:46.5752642Z 2025-05-07T19:44:46.5752645Z 2025-05-07T19:44:46.5752648Z 2025-05-07T19:44:46.5752652Z 2025-05-07T19:44:46.5752655Z 2025-05-07T19:44:46.5752878Z  2025-05-07T19:44:46.5753118Z 2025-05-07T19:44:46.5753122Z 2025-05-07T19:44:46.5753125Z 2025-05-07T19:44:46.5753129Z 2025-05-07T19:44:46.5753133Z 2025-05-07T19:44:46.5753136Z 2025-05-07T19:44:46.5753139Z 2025-05-07T19:44:46.5753143Z 2025-05-07T19:44:46.5753146Z 2025-05-07T19:44:46.5753149Z 2025-05-07T19:44:46.5753153Z 2025-05-07T19:44:46.5753156Z 2025-05-07T19:44:46.5753161Z 2025-05-07T19:44:46.5753164Z 2025-05-07T19:44:46.5753404Z  2025-05-07T19:44:46.5753646Z 2025-05-07T19:44:46.5753649Z 2025-05-07T19:44:46.5753653Z 2025-05-07T19:44:46.5753656Z 2025-05-07T19:44:46.5753660Z 2025-05-07T19:44:46.5753663Z 2025-05-07T19:44:46.5753666Z 2025-05-07T19:44:46.5753670Z 2025-05-07T19:44:46.5753674Z 2025-05-07T19:44:46.5753677Z 2025-05-07T19:44:46.5753705Z 2025-05-07T19:44:46.5753709Z 2025-05-07T19:44:46.5753712Z 2025-05-07T19:44:46.5753716Z 2025-05-07T19:44:46.5753787Z 2025-05-07T19:44:46.5754156Z  done 2025-05-07T19:44:46.6759661Z Preparing transaction: \ done 2025-05-07T19:44:46.7768078Z Verifying transaction: / done 2025-05-07T19:44:46.8782696Z Executing transaction: \ done 2025-05-07T19:44:46.9695453Z [INSTALL] Setting the C/C++ compiler symlinks ... 2025-05-07T19:44:50.7558588Z + ln -sf /github/home/miniconda/envs/build_binary/bin/clang /github/home/miniconda/envs/build_binary/bin/cc 2025-05-07T19:44:50.7559193Z 2025-05-07T19:44:50.7571634Z 2025-05-07T19:44:50.7586685Z + ln -sf /github/home/miniconda/envs/build_binary/bin/clang /github/home/miniconda/envs/build_binary/bin/gcc 2025-05-07T19:44:50.7587178Z 2025-05-07T19:44:50.7603026Z 2025-05-07T19:44:50.7625805Z + ln -sf /github/home/miniconda/envs/build_binary/bin/clang++ /github/home/miniconda/envs/build_binary/bin/c++ 2025-05-07T19:44:50.7626336Z 2025-05-07T19:44:50.7644014Z 2025-05-07T19:44:50.7665601Z + ln -sf /github/home/miniconda/envs/build_binary/bin/clang++ /github/home/miniconda/envs/build_binary/bin/g++ 2025-05-07T19:44:50.7666122Z 2025-05-07T19:44:50.7678988Z 2025-05-07T19:44:50.7680578Z [INSTALL] Removing GCC package activation scripts ... 2025-05-07T19:44:52.6508362Z + ls -la /github/home/miniconda/envs/build_binary/etc/conda/activate.d 2025-05-07T19:44:52.6508756Z 2025-05-07T19:44:52.6534614Z total 28 2025-05-07T19:44:52.6535414Z drwxr-xr-x. 2 root root 134 May 7 19:44 . 2025-05-07T19:44:52.6536649Z drwxr-xr-x. 5 root root 62 May 7 19:44 .. 2025-05-07T19:44:52.6537855Z -rw-r--r--. 2 root root 3778 Jun 10 2024 activate-binutils_linux-64.sh 2025-05-07T19:44:52.6539385Z -rw-r--r--. 2 root root 11630 Jun 10 2024 activate-gcc_linux-64.sh 2025-05-07T19:44:52.6539828Z -rw-r--r--. 2 root root 5190 Jun 10 2024 activate-gxx_linux-64.sh 2025-05-07T19:44:52.6540271Z -rw-r--r--. 2 root root 873 Jun 5 2024 libxml2_activate.sh 2025-05-07T19:44:52.6540534Z 2025-05-07T19:44:52.6540871Z + rm -rf /github/home/miniconda/envs/build_binary/etc/conda/activate.d/activate-gcc_linux-64.sh 2025-05-07T19:44:52.6541291Z 2025-05-07T19:44:52.6562549Z 2025-05-07T19:44:52.6563718Z + rm -rf /github/home/miniconda/envs/build_binary/etc/conda/activate.d/activate-gxx_linux-64.sh 2025-05-07T19:44:52.6565028Z 2025-05-07T19:44:52.6587338Z 2025-05-07T19:44:52.6588733Z + conda env config vars set -n build_binary CC= 2025-05-07T19:44:52.6589484Z 2025-05-07T19:44:53.0741517Z 2025-05-07T19:44:53.0742035Z + conda env config vars set -n build_binary CXX= 2025-05-07T19:44:53.0742323Z 2025-05-07T19:44:53.5016593Z 2025-05-07T19:44:53.5016941Z + conda run -n build_binary printenv CC 2025-05-07T19:44:53.5017529Z 2025-05-07T19:44:55.0808847Z 2025-05-07T19:44:55.0808929Z 2025-05-07T19:44:55.1379743Z 2025-05-07T19:44:55.1380221Z + conda run -n build_binary printenv CXX 2025-05-07T19:44:55.1380522Z 2025-05-07T19:44:56.7378681Z 2025-05-07T19:44:56.7378696Z 2025-05-07T19:44:56.8022330Z 2025-05-07T19:44:58.4406903Z [ENV] Appending to LD_LIBRARY_PATH: /github/home/miniconda/envs/build_binary/lib ... 2025-05-07T19:45:00.0222830Z ERROR conda.cli.main_run:execute(125): `conda run printenv LD_LIBRARY_PATH` failed. (See above for error) 2025-05-07T19:45:00.0794010Z + conda env config vars set -n build_binary LD_LIBRARY_PATH=/github/home/miniconda/envs/build_binary/lib 2025-05-07T19:45:00.0795321Z 2025-05-07T19:45:00.4899432Z 2025-05-07T19:45:02.0739326Z /github/home/miniconda/envs/build_binary/bin/cc 2025-05-07T19:45:02.1320006Z 2025-05-07T19:45:02.1320242Z [CHECK] Binary cc found in PATH 2025-05-07T19:45:03.7346207Z /github/home/miniconda/envs/build_binary/bin/gcc 2025-05-07T19:45:03.7347266Z 2025-05-07T19:45:03.7913334Z [CHECK] Binary gcc found in PATH 2025-05-07T19:45:05.4034032Z /github/home/miniconda/envs/build_binary/bin/c++ 2025-05-07T19:45:05.4034596Z 2025-05-07T19:45:05.4842320Z [CHECK] Binary c++ found in PATH 2025-05-07T19:45:07.0766947Z /github/home/miniconda/envs/build_binary/bin/g++ 2025-05-07T19:45:07.0767631Z 2025-05-07T19:45:07.1353383Z [CHECK] Binary g++ found in PATH 2025-05-07T19:45:07.1353982Z [INFO] Printing out all preprocessor defines in the C compiler ... 2025-05-07T19:45:07.1354555Z + conda run -n build_binary cc -dM -E - 2025-05-07T19:45:07.1354774Z 2025-05-07T19:45:08.7421026Z #define _LP64 1 2025-05-07T19:45:08.7421785Z #define __ATOMIC_ACQUIRE 2 2025-05-07T19:45:08.7422551Z #define __ATOMIC_ACQ_REL 4 2025-05-07T19:45:08.7423536Z #define __ATOMIC_CONSUME 1 2025-05-07T19:45:08.7424795Z #define __ATOMIC_RELAXED 0 2025-05-07T19:45:08.7425553Z #define __ATOMIC_RELEASE 3 2025-05-07T19:45:08.7426310Z #define __ATOMIC_SEQ_CST 5 2025-05-07T19:45:08.7427034Z #define __BIGGEST_ALIGNMENT__ 16 2025-05-07T19:45:08.7427847Z #define __BITINT_MAXWIDTH__ 8388608 2025-05-07T19:45:08.7428408Z #define __BOOL_WIDTH__ 8 2025-05-07T19:45:08.7428706Z #define __BYTE_ORDER__ __ORDER_LITTLE_ENDIAN__ 2025-05-07T19:45:08.7429042Z #define __CHAR16_TYPE__ unsigned short 2025-05-07T19:45:08.7429362Z #define __CHAR32_TYPE__ unsigned int 2025-05-07T19:45:08.7429657Z #define __CHAR_BIT__ 8 2025-05-07T19:45:08.7429936Z #define __CLANG_ATOMIC_BOOL_LOCK_FREE 2 2025-05-07T19:45:08.7430258Z #define __CLANG_ATOMIC_CHAR16_T_LOCK_FREE 2 2025-05-07T19:45:08.7430610Z #define __CLANG_ATOMIC_CHAR32_T_LOCK_FREE 2 2025-05-07T19:45:08.7430953Z #define __CLANG_ATOMIC_CHAR_LOCK_FREE 2 2025-05-07T19:45:08.7431266Z #define __CLANG_ATOMIC_INT_LOCK_FREE 2 2025-05-07T19:45:08.7431602Z #define __CLANG_ATOMIC_LLONG_LOCK_FREE 2 2025-05-07T19:45:08.7431918Z #define __CLANG_ATOMIC_LONG_LOCK_FREE 2 2025-05-07T19:45:08.7432257Z #define __CLANG_ATOMIC_POINTER_LOCK_FREE 2 2025-05-07T19:45:08.7432585Z #define __CLANG_ATOMIC_SHORT_LOCK_FREE 2 2025-05-07T19:45:08.7432926Z #define __CLANG_ATOMIC_WCHAR_T_LOCK_FREE 2 2025-05-07T19:45:08.7433244Z #define __CONSTANT_CFSTRINGS__ 1 2025-05-07T19:45:08.7433688Z #define __DBL_DECIMAL_DIG__ 17 2025-05-07T19:45:08.7434111Z #define __DBL_DENORM_MIN__ 4.9406564584124654e-324 2025-05-07T19:45:08.7434413Z #define __DBL_DIG__ 15 2025-05-07T19:45:08.7434683Z #define __DBL_EPSILON__ 2.2204460492503131e-16 2025-05-07T19:45:08.7434980Z #define __DBL_HAS_DENORM__ 1 2025-05-07T19:45:08.7435249Z #define __DBL_HAS_INFINITY__ 1 2025-05-07T19:45:08.7435505Z #define __DBL_HAS_QUIET_NAN__ 1 2025-05-07T19:45:08.7435772Z #define __DBL_MANT_DIG__ 53 2025-05-07T19:45:08.7436018Z #define __DBL_MAX_10_EXP__ 308 2025-05-07T19:45:08.7436294Z #define __DBL_MAX_EXP__ 1024 2025-05-07T19:45:08.7436560Z #define __DBL_MAX__ 1.7976931348623157e+308 2025-05-07T19:45:08.7436873Z #define __DBL_MIN_10_EXP__ (-307) 2025-05-07T19:45:08.7437156Z #define __DBL_MIN_EXP__ (-1021) 2025-05-07T19:45:08.7437420Z #define __DBL_MIN__ 2.2250738585072014e-308 2025-05-07T19:45:08.7437741Z #define __DECIMAL_DIG__ __LDBL_DECIMAL_DIG__ 2025-05-07T19:45:08.7438027Z #define __ELF__ 1 2025-05-07T19:45:08.7438269Z #define __FINITE_MATH_ONLY__ 0 2025-05-07T19:45:08.7438520Z #define __FLOAT128__ 1 2025-05-07T19:45:08.7438772Z #define __FLT16_DECIMAL_DIG__ 5 2025-05-07T19:45:08.7439068Z #define __FLT16_DENORM_MIN__ 5.9604644775390625e-8F16 2025-05-07T19:45:08.7439397Z #define __FLT16_DIG__ 3 2025-05-07T19:45:08.7439671Z #define __FLT16_EPSILON__ 9.765625e-4F16 2025-05-07T19:45:08.7439958Z #define __FLT16_HAS_DENORM__ 1 2025-05-07T19:45:08.7440237Z #define __FLT16_HAS_INFINITY__ 1 2025-05-07T19:45:08.7440507Z #define __FLT16_HAS_QUIET_NAN__ 1 2025-05-07T19:45:08.7440794Z #define __FLT16_MANT_DIG__ 11 2025-05-07T19:45:08.7441044Z #define __FLT16_MAX_10_EXP__ 4 2025-05-07T19:45:08.7441310Z #define __FLT16_MAX_EXP__ 16 2025-05-07T19:45:08.7441579Z #define __FLT16_MAX__ 6.5504e+4F16 2025-05-07T19:45:08.7441843Z #define __FLT16_MIN_10_EXP__ (-4) 2025-05-07T19:45:08.7442120Z #define __FLT16_MIN_EXP__ (-13) 2025-05-07T19:45:08.7442374Z #define __FLT16_MIN__ 6.103515625e-5F16 2025-05-07T19:45:08.7442664Z #define __FLT_DECIMAL_DIG__ 9 2025-05-07T19:45:08.7442928Z #define __FLT_DENORM_MIN__ 1.40129846e-45F 2025-05-07T19:45:08.7443226Z #define __FLT_DIG__ 6 2025-05-07T19:45:08.7443461Z #define __FLT_EPSILON__ 1.19209290e-7F 2025-05-07T19:45:08.7443930Z #define __FLT_HAS_DENORM__ 1 2025-05-07T19:45:08.7444186Z #define __FLT_HAS_INFINITY__ 1 2025-05-07T19:45:08.7444465Z #define __FLT_HAS_QUIET_NAN__ 1 2025-05-07T19:45:08.7444739Z #define __FLT_MANT_DIG__ 24 2025-05-07T19:45:08.7444988Z #define __FLT_MAX_10_EXP__ 38 2025-05-07T19:45:08.7445259Z #define __FLT_MAX_EXP__ 128 2025-05-07T19:45:08.7445694Z #define __FLT_MAX__ 3.40282347e+38F 2025-05-07T19:45:08.7446130Z #define __FLT_MIN_10_EXP__ (-37) 2025-05-07T19:45:08.7446401Z #define __FLT_MIN_EXP__ (-125) 2025-05-07T19:45:08.7446683Z #define __FLT_MIN__ 1.17549435e-38F 2025-05-07T19:45:08.7446950Z #define __FLT_RADIX__ 2 2025-05-07T19:45:08.7447210Z #define __FXSR__ 1 2025-05-07T19:45:08.7447443Z #define __GCC_ASM_FLAG_OUTPUTS__ 1 2025-05-07T19:45:08.7447753Z #define __GCC_ATOMIC_BOOL_LOCK_FREE 2 2025-05-07T19:45:08.7448073Z #define __GCC_ATOMIC_CHAR16_T_LOCK_FREE 2 2025-05-07T19:45:08.7448389Z #define __GCC_ATOMIC_CHAR32_T_LOCK_FREE 2 2025-05-07T19:45:08.7448722Z #define __GCC_ATOMIC_CHAR_LOCK_FREE 2 2025-05-07T19:45:08.7449015Z #define __GCC_ATOMIC_INT_LOCK_FREE 2 2025-05-07T19:45:08.7449324Z #define __GCC_ATOMIC_LLONG_LOCK_FREE 2 2025-05-07T19:45:08.7449623Z #define __GCC_ATOMIC_LONG_LOCK_FREE 2 2025-05-07T19:45:08.7449937Z #define __GCC_ATOMIC_POINTER_LOCK_FREE 2 2025-05-07T19:45:08.7450244Z #define __GCC_ATOMIC_SHORT_LOCK_FREE 2 2025-05-07T19:45:08.7450567Z #define __GCC_ATOMIC_TEST_AND_SET_TRUEVAL 1 2025-05-07T19:45:08.7450902Z #define __GCC_ATOMIC_WCHAR_T_LOCK_FREE 2 2025-05-07T19:45:08.7451317Z #define __GCC_HAVE_DWARF2_CFI_ASM 1 2025-05-07T19:45:08.7451627Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_1 1 2025-05-07T19:45:08.7451938Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_2 1 2025-05-07T19:45:08.7452269Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_4 1 2025-05-07T19:45:08.7452577Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_8 1 2025-05-07T19:45:08.7452877Z #define __GNUC_MINOR__ 2 2025-05-07T19:45:08.7453116Z #define __GNUC_PATCHLEVEL__ 1 2025-05-07T19:45:08.7453392Z #define __GNUC_STDC_INLINE__ 1 2025-05-07T19:45:08.7453636Z #define __GNUC__ 4 2025-05-07T19:45:08.7453867Z #define __GXX_ABI_VERSION 1002 2025-05-07T19:45:08.7454136Z #define __INT16_C_SUFFIX__ 2025-05-07T19:45:08.7454376Z #define __INT16_FMTd__ "hd" 2025-05-07T19:45:08.7454642Z #define __INT16_FMTi__ "hi" 2025-05-07T19:45:08.7454888Z #define __INT16_MAX__ 32767 2025-05-07T19:45:08.7455257Z #define __INT16_TYPE__ short 2025-05-07T19:45:08.7455510Z #define __INT32_C_SUFFIX__ 2025-05-07T19:45:08.7455762Z #define __INT32_FMTd__ "d" 2025-05-07T19:45:08.7455997Z #define __INT32_FMTi__ "i" 2025-05-07T19:45:08.7456364Z #define __INT32_MAX__ 2147483647 2025-05-07T19:45:08.7456806Z #define __INT32_TYPE__ int 2025-05-07T19:45:08.7457086Z #define __INT64_C_SUFFIX__ L 2025-05-07T19:45:08.7457461Z #define __INT64_FMTd__ "ld" 2025-05-07T19:45:08.7457720Z #define __INT64_FMTi__ "li" 2025-05-07T19:45:08.7458010Z #define __INT64_MAX__ 9223372036854775807L 2025-05-07T19:45:08.7458317Z #define __INT64_TYPE__ long int 2025-05-07T19:45:08.7458611Z #define __INT8_C_SUFFIX__ 2025-05-07T19:45:08.7458867Z #define __INT8_FMTd__ "hhd" 2025-05-07T19:45:08.7459151Z #define __INT8_FMTi__ "hhi" 2025-05-07T19:45:08.7459407Z #define __INT8_MAX__ 127 2025-05-07T19:45:08.7459681Z #define __INT8_TYPE__ signed char 2025-05-07T19:45:08.7459967Z #define __INTMAX_C_SUFFIX__ L 2025-05-07T19:45:08.7460259Z #define __INTMAX_FMTd__ "ld" 2025-05-07T19:45:08.7460541Z #define __INTMAX_FMTi__ "li" 2025-05-07T19:45:08.7460825Z #define __INTMAX_MAX__ 9223372036854775807L 2025-05-07T19:45:08.7461153Z #define __INTMAX_TYPE__ long int 2025-05-07T19:45:08.7461429Z #define __INTMAX_WIDTH__ 64 2025-05-07T19:45:08.7461706Z #define __INTPTR_FMTd__ "ld" 2025-05-07T19:45:08.7461971Z #define __INTPTR_FMTi__ "li" 2025-05-07T19:45:08.7462266Z #define __INTPTR_MAX__ 9223372036854775807L 2025-05-07T19:45:08.7462579Z #define __INTPTR_TYPE__ long int 2025-05-07T19:45:08.7462879Z #define __INTPTR_WIDTH__ 64 2025-05-07T19:45:08.7463144Z #define __INT_FAST16_FMTd__ "hd" 2025-05-07T19:45:08.7463554Z #define __INT_FAST16_FMTi__ "hi" 2025-05-07T19:45:08.7463853Z #define __INT_FAST16_MAX__ 32767 2025-05-07T19:45:08.7464129Z #define __INT_FAST16_TYPE__ short 2025-05-07T19:45:08.7464431Z #define __INT_FAST16_WIDTH__ 16 2025-05-07T19:45:08.7464704Z #define __INT_FAST32_FMTd__ "d" 2025-05-07T19:45:08.7464996Z #define __INT_FAST32_FMTi__ "i" 2025-05-07T19:45:08.7465273Z #define __INT_FAST32_MAX__ 2147483647 2025-05-07T19:45:08.7465653Z #define __INT_FAST32_TYPE__ int 2025-05-07T19:45:08.7465922Z #define __INT_FAST32_WIDTH__ 32 2025-05-07T19:45:08.7466208Z #define __INT_FAST64_FMTd__ "ld" 2025-05-07T19:45:08.7466486Z #define __INT_FAST64_FMTi__ "li" 2025-05-07T19:45:08.7466797Z #define __INT_FAST64_MAX__ 9223372036854775807L 2025-05-07T19:45:08.7467140Z #define __INT_FAST64_TYPE__ long int 2025-05-07T19:45:08.7467424Z #define __INT_FAST64_WIDTH__ 64 2025-05-07T19:45:08.7467713Z #define __INT_FAST8_FMTd__ "hhd" 2025-05-07T19:45:08.7467987Z #define __INT_FAST8_FMTi__ "hhi" 2025-05-07T19:45:08.7468282Z #define __INT_FAST8_MAX__ 127 2025-05-07T19:45:08.7468560Z #define __INT_FAST8_TYPE__ signed char 2025-05-07T19:45:08.7468877Z #define __INT_FAST8_WIDTH__ 8 2025-05-07T19:45:08.7469153Z #define __INT_LEAST16_FMTd__ "hd" 2025-05-07T19:45:08.7469461Z #define __INT_LEAST16_FMTi__ "hi" 2025-05-07T19:45:08.7469743Z #define __INT_LEAST16_MAX__ 32767 2025-05-07T19:45:08.7470046Z #define __INT_LEAST16_TYPE__ short 2025-05-07T19:45:08.7470571Z #define __INT_LEAST16_WIDTH__ 16 2025-05-07T19:45:08.7470951Z #define __INT_LEAST32_FMTd__ "d" 2025-05-07T19:45:08.7471243Z #define __INT_LEAST32_FMTi__ "i" 2025-05-07T19:45:08.7471523Z #define __INT_LEAST32_MAX__ 2147483647 2025-05-07T19:45:08.7471833Z #define __INT_LEAST32_TYPE__ int 2025-05-07T19:45:08.7472109Z #define __INT_LEAST32_WIDTH__ 32 2025-05-07T19:45:08.7472401Z #define __INT_LEAST64_FMTd__ "ld" 2025-05-07T19:45:08.7472683Z #define __INT_LEAST64_FMTi__ "li" 2025-05-07T19:45:08.7473004Z #define __INT_LEAST64_MAX__ 9223372036854775807L 2025-05-07T19:45:08.7473342Z #define __INT_LEAST64_TYPE__ long int 2025-05-07T19:45:08.7473654Z #define __INT_LEAST64_WIDTH__ 64 2025-05-07T19:45:08.7473953Z #define __INT_LEAST8_FMTd__ "hhd" 2025-05-07T19:45:08.7474237Z #define __INT_LEAST8_FMTi__ "hhi" 2025-05-07T19:45:08.7474537Z #define __INT_LEAST8_MAX__ 127 2025-05-07T19:45:08.7474817Z #define __INT_LEAST8_TYPE__ signed char 2025-05-07T19:45:08.7475134Z #define __INT_LEAST8_WIDTH__ 8 2025-05-07T19:45:08.7475402Z #define __INT_MAX__ 2147483647 2025-05-07T19:45:08.7475685Z #define __INT_WIDTH__ 32 2025-05-07T19:45:08.7475938Z #define __LDBL_DECIMAL_DIG__ 21 2025-05-07T19:45:08.7476277Z #define __LDBL_DENORM_MIN__ 3.64519953188247460253e-4951L 2025-05-07T19:45:08.7476637Z #define __LDBL_DIG__ 18 2025-05-07T19:45:08.7476920Z #define __LDBL_EPSILON__ 1.08420217248550443401e-19L 2025-05-07T19:45:08.7477266Z #define __LDBL_HAS_DENORM__ 1 2025-05-07T19:45:08.7477539Z #define __LDBL_HAS_INFINITY__ 1 2025-05-07T19:45:08.7477834Z #define __LDBL_HAS_QUIET_NAN__ 1 2025-05-07T19:45:08.7478116Z #define __LDBL_MANT_DIG__ 64 2025-05-07T19:45:08.7478397Z #define __LDBL_MAX_10_EXP__ 4932 2025-05-07T19:45:08.7478674Z #define __LDBL_MAX_EXP__ 16384 2025-05-07T19:45:08.7478987Z #define __LDBL_MAX__ 1.18973149535723176502e+4932L 2025-05-07T19:45:08.7479320Z #define __LDBL_MIN_10_EXP__ (-4931) 2025-05-07T19:45:08.7479633Z #define __LDBL_MIN_EXP__ (-16381) 2025-05-07T19:45:08.7479957Z #define __LDBL_MIN__ 3.36210314311209350626e-4932L 2025-05-07T19:45:08.7480284Z #define __LITTLE_ENDIAN__ 1 2025-05-07T19:45:08.7480560Z #define __LLONG_WIDTH__ 64 2025-05-07T19:45:08.7480837Z #define __LONG_LONG_MAX__ 9223372036854775807LL 2025-05-07T19:45:08.7481178Z #define __LONG_MAX__ 9223372036854775807L 2025-05-07T19:45:08.7481473Z #define __LONG_WIDTH__ 64 2025-05-07T19:45:08.7481731Z #define __LP64__ 1 2025-05-07T19:45:08.7481948Z #define __MMX__ 1 2025-05-07T19:45:08.7482185Z #define __NO_INLINE__ 1 2025-05-07T19:45:08.7482429Z #define __NO_MATH_INLINES 1 2025-05-07T19:45:08.7482831Z #define __OBJC_BOOL_IS_BOOL 0 2025-05-07T19:45:08.7483305Z #define __OPENCL_MEMORY_SCOPE_ALL_SVM_DEVICES 3 2025-05-07T19:45:08.7483638Z #define __OPENCL_MEMORY_SCOPE_DEVICE 2 2025-05-07T19:45:08.7483967Z #define __OPENCL_MEMORY_SCOPE_SUB_GROUP 4 2025-05-07T19:45:08.7484289Z #define __OPENCL_MEMORY_SCOPE_WORK_GROUP 1 2025-05-07T19:45:08.7484620Z #define __OPENCL_MEMORY_SCOPE_WORK_ITEM 0 2025-05-07T19:45:08.7484926Z #define __ORDER_BIG_ENDIAN__ 4321 2025-05-07T19:45:08.7485321Z #define __ORDER_LITTLE_ENDIAN__ 1234 2025-05-07T19:45:08.7485613Z #define __ORDER_PDP_ENDIAN__ 3412 2025-05-07T19:45:08.7485897Z #define __PIC__ 2 2025-05-07T19:45:08.7486128Z #define __PIE__ 2 2025-05-07T19:45:08.7486356Z #define __POINTER_WIDTH__ 64 2025-05-07T19:45:08.7486652Z #define __PRAGMA_REDEFINE_EXTNAME 1 2025-05-07T19:45:08.7486954Z #define __PTRDIFF_FMTd__ "ld" 2025-05-07T19:45:08.7487221Z #define __PTRDIFF_FMTi__ "li" 2025-05-07T19:45:08.7487516Z #define __PTRDIFF_MAX__ 9223372036854775807L 2025-05-07T19:45:08.7487822Z #define __PTRDIFF_TYPE__ long int 2025-05-07T19:45:08.7488120Z #define __PTRDIFF_WIDTH__ 64 2025-05-07T19:45:08.7488383Z #define __REGISTER_PREFIX__ 2025-05-07T19:45:08.7488767Z #define __SCHAR_MAX__ 127 2025-05-07T19:45:08.7488995Z #define __SEG_FS 1 2025-05-07T19:45:08.7489217Z #define __SEG_GS 1 2025-05-07T19:45:08.7489430Z #define __SHRT_MAX__ 32767 2025-05-07T19:45:08.7489681Z #define __SHRT_WIDTH__ 16 2025-05-07T19:45:08.7489939Z #define __SIG_ATOMIC_MAX__ 2147483647 2025-05-07T19:45:08.7490216Z #define __SIG_ATOMIC_WIDTH__ 32 2025-05-07T19:45:08.7490488Z #define __SIZEOF_DOUBLE__ 8 2025-05-07T19:45:08.7490733Z #define __SIZEOF_FLOAT128__ 16 2025-05-07T19:45:08.7490998Z #define __SIZEOF_FLOAT__ 4 2025-05-07T19:45:08.7491236Z #define __SIZEOF_INT128__ 16 2025-05-07T19:45:08.7491494Z #define __SIZEOF_INT__ 4 2025-05-07T19:45:08.7491734Z #define __SIZEOF_LONG_DOUBLE__ 16 2025-05-07T19:45:08.7492014Z #define __SIZEOF_LONG_LONG__ 8 2025-05-07T19:45:08.7492260Z #define __SIZEOF_LONG__ 8 2025-05-07T19:45:08.7492525Z #define __SIZEOF_POINTER__ 8 2025-05-07T19:45:08.7492821Z #define __SIZEOF_PTRDIFF_T__ 8 2025-05-07T19:45:08.7493085Z #define __SIZEOF_SHORT__ 2 2025-05-07T19:45:08.7493362Z #define __SIZEOF_SIZE_T__ 8 2025-05-07T19:45:08.7493615Z #define __SIZEOF_WCHAR_T__ 4 2025-05-07T19:45:08.7493898Z #define __SIZEOF_WINT_T__ 4 2025-05-07T19:45:08.7494152Z #define __SIZE_FMTX__ "lX" 2025-05-07T19:45:08.7494437Z #define __SIZE_FMTo__ "lo" 2025-05-07T19:45:08.7494686Z #define __SIZE_FMTu__ "lu" 2025-05-07T19:45:08.7494954Z #define __SIZE_FMTx__ "lx" 2025-05-07T19:45:08.7495201Z #define __SIZE_MAX__ 18446744073709551615UL 2025-05-07T19:45:08.7495530Z #define __SIZE_TYPE__ long unsigned int 2025-05-07T19:45:08.7495821Z #define __SIZE_WIDTH__ 64 2025-05-07T19:45:08.7496168Z #define __SSE2_MATH__ 1 2025-05-07T19:45:08.7496409Z #define __SSE2__ 1 2025-05-07T19:45:08.7496615Z #define __SSE_MATH__ 1 2025-05-07T19:45:08.7497042Z #define __SSE__ 1 2025-05-07T19:45:08.7497363Z #define __STDC_HOSTED__ 1 2025-05-07T19:45:08.7497631Z #define __STDC_UTF_16__ 1 2025-05-07T19:45:08.7497881Z #define __STDC_UTF_32__ 1 2025-05-07T19:45:08.7498154Z #define __STDC_VERSION__ 201710L 2025-05-07T19:45:08.7498421Z #define __STDC__ 1 2025-05-07T19:45:08.7498666Z #define __UINT16_C_SUFFIX__ 2025-05-07T19:45:08.7498929Z #define __UINT16_FMTX__ "hX" 2025-05-07T19:45:08.7499208Z #define __UINT16_FMTo__ "ho" 2025-05-07T19:45:08.7499489Z #define __UINT16_FMTu__ "hu" 2025-05-07T19:45:08.7499749Z #define __UINT16_FMTx__ "hx" 2025-05-07T19:45:08.7500029Z #define __UINT16_MAX__ 65535 2025-05-07T19:45:08.7500308Z #define __UINT16_TYPE__ unsigned short 2025-05-07T19:45:08.7500629Z #define __UINT32_C_SUFFIX__ U 2025-05-07T19:45:08.7500895Z #define __UINT32_FMTX__ "X" 2025-05-07T19:45:08.7501170Z #define __UINT32_FMTo__ "o" 2025-05-07T19:45:08.7501426Z #define __UINT32_FMTu__ "u" 2025-05-07T19:45:08.7501702Z #define __UINT32_FMTx__ "x" 2025-05-07T19:45:08.7501966Z #define __UINT32_MAX__ 4294967295U 2025-05-07T19:45:08.7502272Z #define __UINT32_TYPE__ unsigned int 2025-05-07T19:45:08.7502720Z #define __UINT64_C_SUFFIX__ UL 2025-05-07T19:45:08.7502993Z #define __UINT64_FMTX__ "lX" 2025-05-07T19:45:08.7503277Z #define __UINT64_FMTo__ "lo" 2025-05-07T19:45:08.7503540Z #define __UINT64_FMTu__ "lu" 2025-05-07T19:45:08.7503823Z #define __UINT64_FMTx__ "lx" 2025-05-07T19:45:08.7504118Z #define __UINT64_MAX__ 18446744073709551615UL 2025-05-07T19:45:08.7504489Z #define __UINT64_TYPE__ long unsigned int 2025-05-07T19:45:08.7504812Z #define __UINT8_C_SUFFIX__ 2025-05-07T19:45:08.7505190Z #define __UINT8_FMTX__ "hhX" 2025-05-07T19:45:08.7505472Z #define __UINT8_FMTo__ "hho" 2025-05-07T19:45:08.7505776Z #define __UINT8_FMTu__ "hhu" 2025-05-07T19:45:08.7506078Z #define __UINT8_FMTx__ "hhx" 2025-05-07T19:45:08.7506353Z #define __UINT8_MAX__ 255 2025-05-07T19:45:08.7506663Z #define __UINT8_TYPE__ unsigned char 2025-05-07T19:45:08.7506978Z #define __UINTMAX_C_SUFFIX__ UL 2025-05-07T19:45:08.7507297Z #define __UINTMAX_FMTX__ "lX" 2025-05-07T19:45:08.7507585Z #define __UINTMAX_FMTo__ "lo" 2025-05-07T19:45:08.7507900Z #define __UINTMAX_FMTu__ "lu" 2025-05-07T19:45:08.7508182Z #define __UINTMAX_FMTx__ "lx" 2025-05-07T19:45:08.7508510Z #define __UINTMAX_MAX__ 18446744073709551615UL 2025-05-07T19:45:08.7508861Z #define __UINTMAX_TYPE__ long unsigned int 2025-05-07T19:45:08.7509321Z #define __UINTMAX_WIDTH__ 64 2025-05-07T19:45:08.7509621Z #define __UINTPTR_FMTX__ "lX" 2025-05-07T19:45:08.7509899Z #define __UINTPTR_FMTo__ "lo" 2025-05-07T19:45:08.7510204Z #define __UINTPTR_FMTu__ "lu" 2025-05-07T19:45:08.7510483Z #define __UINTPTR_FMTx__ "lx" 2025-05-07T19:45:08.7510803Z #define __UINTPTR_MAX__ 18446744073709551615UL 2025-05-07T19:45:08.7511140Z #define __UINTPTR_TYPE__ long unsigned int 2025-05-07T19:45:08.7511480Z #define __UINTPTR_WIDTH__ 64 2025-05-07T19:45:08.7511758Z #define __UINT_FAST16_FMTX__ "hX" 2025-05-07T19:45:08.7512077Z #define __UINT_FAST16_FMTo__ "ho" 2025-05-07T19:45:08.7512375Z #define __UINT_FAST16_FMTu__ "hu" 2025-05-07T19:45:08.7512687Z #define __UINT_FAST16_FMTx__ "hx" 2025-05-07T19:45:08.7513019Z #define __UINT_FAST16_MAX__ 65535 2025-05-07T19:45:08.7513334Z #define __UINT_FAST16_TYPE__ unsigned short 2025-05-07T19:45:08.7513682Z #define __UINT_FAST32_FMTX__ "X" 2025-05-07T19:45:08.7513974Z #define __UINT_FAST32_FMTo__ "o" 2025-05-07T19:45:08.7514281Z #define __UINT_FAST32_FMTu__ "u" 2025-05-07T19:45:08.7514564Z #define __UINT_FAST32_FMTx__ "x" 2025-05-07T19:45:08.7514884Z #define __UINT_FAST32_MAX__ 4294967295U 2025-05-07T19:45:08.7515209Z #define __UINT_FAST32_TYPE__ unsigned int 2025-05-07T19:45:08.7515549Z #define __UINT_FAST64_FMTX__ "lX" 2025-05-07T19:45:08.7515837Z #define __UINT_FAST64_FMTo__ "lo" 2025-05-07T19:45:08.7516156Z #define __UINT_FAST64_FMTu__ "lu" 2025-05-07T19:45:08.7516589Z #define __UINT_FAST64_FMTx__ "lx" 2025-05-07T19:45:08.7516893Z #define __UINT_FAST64_MAX__ 18446744073709551615UL 2025-05-07T19:45:08.7517260Z #define __UINT_FAST64_TYPE__ long unsigned int 2025-05-07T19:45:08.7517581Z #define __UINT_FAST8_FMTX__ "hhX" 2025-05-07T19:45:08.7517886Z #define __UINT_FAST8_FMTo__ "hho" 2025-05-07T19:45:08.7518169Z #define __UINT_FAST8_FMTu__ "hhu" 2025-05-07T19:45:08.7518476Z #define __UINT_FAST8_FMTx__ "hhx" 2025-05-07T19:45:08.7518754Z #define __UINT_FAST8_MAX__ 255 2025-05-07T19:45:08.7519059Z #define __UINT_FAST8_TYPE__ unsigned char 2025-05-07T19:45:08.7519393Z #define __UINT_LEAST16_FMTX__ "hX" 2025-05-07T19:45:08.7519677Z #define __UINT_LEAST16_FMTo__ "ho" 2025-05-07T19:45:08.7519984Z #define __UINT_LEAST16_FMTu__ "hu" 2025-05-07T19:45:08.7520271Z #define __UINT_LEAST16_FMTx__ "hx" 2025-05-07T19:45:08.7520579Z #define __UINT_LEAST16_MAX__ 65535 2025-05-07T19:45:08.7520873Z #define __UINT_LEAST16_TYPE__ unsigned short 2025-05-07T19:45:08.7521213Z #define __UINT_LEAST32_FMTX__ "X" 2025-05-07T19:45:08.7521495Z #define __UINT_LEAST32_FMTo__ "o" 2025-05-07T19:45:08.7521798Z #define __UINT_LEAST32_FMTu__ "u" 2025-05-07T19:45:08.7522079Z #define __UINT_LEAST32_FMTx__ "x" 2025-05-07T19:45:08.7522392Z #define __UINT_LEAST32_MAX__ 4294967295U 2025-05-07T19:45:08.7522736Z #define __UINT_LEAST32_TYPE__ unsigned int 2025-05-07T19:45:08.7523135Z #define __UINT_LEAST64_FMTX__ "lX" 2025-05-07T19:45:08.7523450Z #define __UINT_LEAST64_FMTo__ "lo" 2025-05-07T19:45:08.7523736Z #define __UINT_LEAST64_FMTu__ "lu" 2025-05-07T19:45:08.7524041Z #define __UINT_LEAST64_FMTx__ "lx" 2025-05-07T19:45:08.7524342Z #define __UINT_LEAST64_MAX__ 18446744073709551615UL 2025-05-07T19:45:08.7524733Z #define __UINT_LEAST64_TYPE__ long unsigned int 2025-05-07T19:45:08.7525107Z #define __UINT_LEAST8_FMTX__ "hhX" 2025-05-07T19:45:08.7525398Z #define __UINT_LEAST8_FMTo__ "hho" 2025-05-07T19:45:08.7525666Z #define __UINT_LEAST8_FMTu__ "hhu" 2025-05-07T19:45:08.7525952Z #define __UINT_LEAST8_FMTx__ "hhx" 2025-05-07T19:45:08.7526220Z #define __UINT_LEAST8_MAX__ 255 2025-05-07T19:45:08.7526512Z #define __UINT_LEAST8_TYPE__ unsigned char 2025-05-07T19:45:08.7526810Z #define __USER_LABEL_PREFIX__ 2025-05-07T19:45:08.7527413Z #define __VERSION__ "Clang 16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4)" 2025-05-07T19:45:08.7528019Z #define __WCHAR_MAX__ 2147483647 2025-05-07T19:45:08.7528282Z #define __WCHAR_TYPE__ int 2025-05-07T19:45:08.7528552Z #define __WCHAR_WIDTH__ 32 2025-05-07T19:45:08.7528803Z #define __WINT_MAX__ 4294967295U 2025-05-07T19:45:08.7529091Z #define __WINT_TYPE__ unsigned int 2025-05-07T19:45:08.7529364Z #define __WINT_UNSIGNED__ 1 2025-05-07T19:45:08.7529637Z #define __WINT_WIDTH__ 32 2025-05-07T19:45:08.7529872Z #define __amd64 1 2025-05-07T19:45:08.7530105Z #define __amd64__ 1 2025-05-07T19:45:08.7530336Z #define __clang__ 1 2025-05-07T19:45:08.7530580Z #define __clang_literal_encoding__ "UTF-8" 2025-05-07T19:45:08.7530891Z #define __clang_major__ 16 2025-05-07T19:45:08.7531131Z #define __clang_minor__ 0 2025-05-07T19:45:08.7531403Z #define __clang_patchlevel__ 6 2025-05-07T19:45:08.7531960Z #define __clang_version__ "16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4)" 2025-05-07T19:45:08.7532588Z #define __clang_wide_literal_encoding__ "UTF-32" 2025-05-07T19:45:08.7532905Z #define __code_model_small__ 1 2025-05-07T19:45:08.7533175Z #define __gnu_linux__ 1 2025-05-07T19:45:08.7533406Z #define __k8 1 2025-05-07T19:45:08.7533600Z #define __k8__ 1 2025-05-07T19:45:08.7533817Z #define __linux 1 2025-05-07T19:45:08.7534021Z #define __linux__ 1 2025-05-07T19:45:08.7534242Z #define __llvm__ 1 2025-05-07T19:45:08.7534443Z #define __pic__ 2 2025-05-07T19:45:08.7534664Z #define __pie__ 2 2025-05-07T19:45:08.7534918Z #define __seg_fs __attribute__((address_space(257))) 2025-05-07T19:45:08.7535286Z #define __seg_gs __attribute__((address_space(256))) 2025-05-07T19:45:08.7535594Z #define __tune_k8__ 1 2025-05-07T19:45:08.7535831Z #define __unix 1 2025-05-07T19:45:08.7536111Z #define __unix__ 1 2025-05-07T19:45:08.7536352Z #define __x86_64 1 2025-05-07T19:45:08.7536763Z #define __x86_64__ 1 2025-05-07T19:45:08.7536986Z #define linux 1 2025-05-07T19:45:08.7537223Z #define unix 1 2025-05-07T19:45:08.7537353Z 2025-05-07T19:45:08.8009540Z 2025-05-07T19:45:08.8010428Z [INFO] Printing out all preprocessor defines in the C++ compiler ... 2025-05-07T19:45:08.8011773Z + conda run -n build_binary c++ -dM -E -x c++ - 2025-05-07T19:45:08.8012465Z 2025-05-07T19:45:10.4092791Z #define _GNU_SOURCE 1 2025-05-07T19:45:10.4093806Z #define _LP64 1 2025-05-07T19:45:10.4094766Z #define __ATOMIC_ACQUIRE 2 2025-05-07T19:45:10.4095823Z #define __ATOMIC_ACQ_REL 4 2025-05-07T19:45:10.4097103Z #define __ATOMIC_CONSUME 1 2025-05-07T19:45:10.4098443Z #define __ATOMIC_RELAXED 0 2025-05-07T19:45:10.4099334Z #define __ATOMIC_RELEASE 3 2025-05-07T19:45:10.4099771Z #define __ATOMIC_SEQ_CST 5 2025-05-07T19:45:10.4100033Z #define __BIGGEST_ALIGNMENT__ 16 2025-05-07T19:45:10.4100336Z #define __BITINT_MAXWIDTH__ 8388608 2025-05-07T19:45:10.4100621Z #define __BOOL_WIDTH__ 8 2025-05-07T19:45:10.4100919Z #define __BYTE_ORDER__ __ORDER_LITTLE_ENDIAN__ 2025-05-07T19:45:10.4101250Z #define __CHAR16_TYPE__ unsigned short 2025-05-07T19:45:10.4101566Z #define __CHAR32_TYPE__ unsigned int 2025-05-07T19:45:10.4102078Z #define __CHAR_BIT__ 8 2025-05-07T19:45:10.4102350Z #define __CLANG_ATOMIC_BOOL_LOCK_FREE 2 2025-05-07T19:45:10.4102666Z #define __CLANG_ATOMIC_CHAR16_T_LOCK_FREE 2 2025-05-07T19:45:10.4103009Z #define __CLANG_ATOMIC_CHAR32_T_LOCK_FREE 2 2025-05-07T19:45:10.4103345Z #define __CLANG_ATOMIC_CHAR_LOCK_FREE 2 2025-05-07T19:45:10.4103797Z #define __CLANG_ATOMIC_INT_LOCK_FREE 2 2025-05-07T19:45:10.4104124Z #define __CLANG_ATOMIC_LLONG_LOCK_FREE 2 2025-05-07T19:45:10.4106484Z #define __CLANG_ATOMIC_LONG_LOCK_FREE 2 2025-05-07T19:45:10.4106814Z #define __CLANG_ATOMIC_POINTER_LOCK_FREE 2 2025-05-07T19:45:10.4107115Z #define __CLANG_ATOMIC_SHORT_LOCK_FREE 2 2025-05-07T19:45:10.4107436Z #define __CLANG_ATOMIC_WCHAR_T_LOCK_FREE 2 2025-05-07T19:45:10.4107734Z #define __CONSTANT_CFSTRINGS__ 1 2025-05-07T19:45:10.4108023Z #define __DBL_DECIMAL_DIG__ 17 2025-05-07T19:45:10.4108321Z #define __DBL_DENORM_MIN__ 4.9406564584124654e-324 2025-05-07T19:45:10.4108619Z #define __DBL_DIG__ 15 2025-05-07T19:45:10.4108888Z #define __DBL_EPSILON__ 2.2204460492503131e-16 2025-05-07T19:45:10.4109184Z #define __DBL_HAS_DENORM__ 1 2025-05-07T19:45:10.4109450Z #define __DBL_HAS_INFINITY__ 1 2025-05-07T19:45:10.4109699Z #define __DBL_HAS_QUIET_NAN__ 1 2025-05-07T19:45:10.4109972Z #define __DBL_MANT_DIG__ 53 2025-05-07T19:45:10.4110219Z #define __DBL_MAX_10_EXP__ 308 2025-05-07T19:45:10.4110486Z #define __DBL_MAX_EXP__ 1024 2025-05-07T19:45:10.4110747Z #define __DBL_MAX__ 1.7976931348623157e+308 2025-05-07T19:45:10.4111051Z #define __DBL_MIN_10_EXP__ (-307) 2025-05-07T19:45:10.4111331Z #define __DBL_MIN_EXP__ (-1021) 2025-05-07T19:45:10.4111591Z #define __DBL_MIN__ 2.2250738585072014e-308 2025-05-07T19:45:10.4111907Z #define __DECIMAL_DIG__ __LDBL_DECIMAL_DIG__ 2025-05-07T19:45:10.4112192Z #define __DEPRECATED 1 2025-05-07T19:45:10.4112427Z #define __ELF__ 1 2025-05-07T19:45:10.4112634Z #define __EXCEPTIONS 1 2025-05-07T19:45:10.4112888Z #define __FINITE_MATH_ONLY__ 0 2025-05-07T19:45:10.4113136Z #define __FLOAT128__ 1 2025-05-07T19:45:10.4113391Z #define __FLT16_DECIMAL_DIG__ 5 2025-05-07T19:45:10.4113679Z #define __FLT16_DENORM_MIN__ 5.9604644775390625e-8F16 2025-05-07T19:45:10.4114003Z #define __FLT16_DIG__ 3 2025-05-07T19:45:10.4114259Z #define __FLT16_EPSILON__ 9.765625e-4F16 2025-05-07T19:45:10.4114542Z #define __FLT16_HAS_DENORM__ 1 2025-05-07T19:45:10.4114818Z #define __FLT16_HAS_INFINITY__ 1 2025-05-07T19:45:10.4115087Z #define __FLT16_HAS_QUIET_NAN__ 1 2025-05-07T19:45:10.4115367Z #define __FLT16_MANT_DIG__ 11 2025-05-07T19:45:10.4115623Z #define __FLT16_MAX_10_EXP__ 4 2025-05-07T19:45:10.4115898Z #define __FLT16_MAX_EXP__ 16 2025-05-07T19:45:10.4116145Z #define __FLT16_MAX__ 6.5504e+4F16 2025-05-07T19:45:10.4116424Z #define __FLT16_MIN_10_EXP__ (-4) 2025-05-07T19:45:10.4116683Z #define __FLT16_MIN_EXP__ (-13) 2025-05-07T19:45:10.4116954Z #define __FLT16_MIN__ 6.103515625e-5F16 2025-05-07T19:45:10.4117243Z #define __FLT_DECIMAL_DIG__ 9 2025-05-07T19:45:10.4117502Z #define __FLT_DENORM_MIN__ 1.40129846e-45F 2025-05-07T19:45:10.4117800Z #define __FLT_DIG__ 6 2025-05-07T19:45:10.4118031Z #define __FLT_EPSILON__ 1.19209290e-7F 2025-05-07T19:45:10.4118320Z #define __FLT_HAS_DENORM__ 1 2025-05-07T19:45:10.4118568Z #define __FLT_HAS_INFINITY__ 1 2025-05-07T19:45:10.4118837Z #define __FLT_HAS_QUIET_NAN__ 1 2025-05-07T19:45:10.4119086Z #define __FLT_MANT_DIG__ 24 2025-05-07T19:45:10.4119342Z #define __FLT_MAX_10_EXP__ 38 2025-05-07T19:45:10.4119588Z #define __FLT_MAX_EXP__ 128 2025-05-07T19:45:10.4119852Z #define __FLT_MAX__ 3.40282347e+38F 2025-05-07T19:45:10.4120134Z #define __FLT_MIN_10_EXP__ (-37) 2025-05-07T19:45:10.4120391Z #define __FLT_MIN_EXP__ (-125) 2025-05-07T19:45:10.4120661Z #define __FLT_MIN__ 1.17549435e-38F 2025-05-07T19:45:10.4120919Z #define __FLT_RADIX__ 2 2025-05-07T19:45:10.4121157Z #define __FXSR__ 1 2025-05-07T19:45:10.4121379Z #define __GCC_ASM_FLAG_OUTPUTS__ 1 2025-05-07T19:45:10.4121670Z #define __GCC_ATOMIC_BOOL_LOCK_FREE 2 2025-05-07T19:45:10.4121958Z #define __GCC_ATOMIC_CHAR16_T_LOCK_FREE 2 2025-05-07T19:45:10.4122384Z #define __GCC_ATOMIC_CHAR32_T_LOCK_FREE 2 2025-05-07T19:45:10.4122696Z #define __GCC_ATOMIC_CHAR_LOCK_FREE 2 2025-05-07T19:45:10.4122975Z #define __GCC_ATOMIC_INT_LOCK_FREE 2 2025-05-07T19:45:10.4123279Z #define __GCC_ATOMIC_LLONG_LOCK_FREE 2 2025-05-07T19:45:10.4123568Z #define __GCC_ATOMIC_LONG_LOCK_FREE 2 2025-05-07T19:45:10.4123878Z #define __GCC_ATOMIC_POINTER_LOCK_FREE 2 2025-05-07T19:45:10.4124175Z #define __GCC_ATOMIC_SHORT_LOCK_FREE 2 2025-05-07T19:45:10.4124567Z #define __GCC_ATOMIC_TEST_AND_SET_TRUEVAL 1 2025-05-07T19:45:10.4124872Z #define __GCC_ATOMIC_WCHAR_T_LOCK_FREE 2 2025-05-07T19:45:10.4125178Z #define __GCC_HAVE_DWARF2_CFI_ASM 1 2025-05-07T19:45:10.4125471Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_1 1 2025-05-07T19:45:10.4125806Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_2 1 2025-05-07T19:45:10.4126135Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_4 1 2025-05-07T19:45:10.4126446Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_8 1 2025-05-07T19:45:10.4126762Z #define __GLIBCXX_BITSIZE_INT_N_0 128 2025-05-07T19:45:10.4127046Z #define __GLIBCXX_TYPE_INT_N_0 __int128 2025-05-07T19:45:10.4127343Z #define __GNUC_GNU_INLINE__ 1 2025-05-07T19:45:10.4127595Z #define __GNUC_MINOR__ 2 2025-05-07T19:45:10.4127852Z #define __GNUC_PATCHLEVEL__ 1 2025-05-07T19:45:10.4128095Z #define __GNUC__ 4 2025-05-07T19:45:10.4128319Z #define __GNUG__ 4 2025-05-07T19:45:10.4128540Z #define __GXX_ABI_VERSION 1002 2025-05-07T19:45:10.4128824Z #define __GXX_EXPERIMENTAL_CXX0X__ 1 2025-05-07T19:45:10.4129109Z #define __GXX_RTTI 1 2025-05-07T19:45:10.4129328Z #define __GXX_WEAK__ 1 2025-05-07T19:45:10.4129571Z #define __INT16_C_SUFFIX__ 2025-05-07T19:45:10.4129814Z #define __INT16_FMTd__ "hd" 2025-05-07T19:45:10.4130069Z #define __INT16_FMTi__ "hi" 2025-05-07T19:45:10.4130304Z #define __INT16_MAX__ 32767 2025-05-07T19:45:10.4130555Z #define __INT16_TYPE__ short 2025-05-07T19:45:10.4130797Z #define __INT32_C_SUFFIX__ 2025-05-07T19:45:10.4131047Z #define __INT32_FMTd__ "d" 2025-05-07T19:45:10.4131284Z #define __INT32_FMTi__ "i" 2025-05-07T19:45:10.4131535Z #define __INT32_MAX__ 2147483647 2025-05-07T19:45:10.4131802Z #define __INT32_TYPE__ int 2025-05-07T19:45:10.4132038Z #define __INT64_C_SUFFIX__ L 2025-05-07T19:45:10.4132298Z #define __INT64_FMTd__ "ld" 2025-05-07T19:45:10.4132536Z #define __INT64_FMTi__ "li" 2025-05-07T19:45:10.4132801Z #define __INT64_MAX__ 9223372036854775807L 2025-05-07T19:45:10.4133080Z #define __INT64_TYPE__ long int 2025-05-07T19:45:10.4133349Z #define __INT8_C_SUFFIX__ 2025-05-07T19:45:10.4133586Z #define __INT8_FMTd__ "hhd" 2025-05-07T19:45:10.4133842Z #define __INT8_FMTi__ "hhi" 2025-05-07T19:45:10.4134084Z #define __INT8_MAX__ 127 2025-05-07T19:45:10.4134349Z #define __INT8_TYPE__ signed char 2025-05-07T19:45:10.4134639Z #define __INTMAX_C_SUFFIX__ L 2025-05-07T19:45:10.4134891Z #define __INTMAX_FMTd__ "ld" 2025-05-07T19:45:10.4135159Z #define __INTMAX_FMTi__ "li" 2025-05-07T19:45:10.4135415Z #define __INTMAX_MAX__ 9223372036854775807L 2025-05-07T19:45:10.4135721Z #define __INTMAX_TYPE__ long int 2025-05-07T19:45:10.4135978Z #define __INTMAX_WIDTH__ 64 2025-05-07T19:45:10.4136365Z #define __INTPTR_FMTd__ "ld" 2025-05-07T19:45:10.4136802Z #define __INTPTR_FMTi__ "li" 2025-05-07T19:45:10.4137103Z #define __INTPTR_MAX__ 9223372036854775807L 2025-05-07T19:45:10.4137486Z #define __INTPTR_TYPE__ long int 2025-05-07T19:45:10.4137789Z #define __INTPTR_WIDTH__ 64 2025-05-07T19:45:10.4138072Z #define __INT_FAST16_FMTd__ "hd" 2025-05-07T19:45:10.4138354Z #define __INT_FAST16_FMTi__ "hi" 2025-05-07T19:45:10.4138649Z #define __INT_FAST16_MAX__ 32767 2025-05-07T19:45:10.4138927Z #define __INT_FAST16_TYPE__ short 2025-05-07T19:45:10.4139230Z #define __INT_FAST16_WIDTH__ 16 2025-05-07T19:45:10.4139499Z #define __INT_FAST32_FMTd__ "d" 2025-05-07T19:45:10.4139797Z #define __INT_FAST32_FMTi__ "i" 2025-05-07T19:45:10.4140071Z #define __INT_FAST32_MAX__ 2147483647 2025-05-07T19:45:10.4140378Z #define __INT_FAST32_TYPE__ int 2025-05-07T19:45:10.4140669Z #define __INT_FAST32_WIDTH__ 32 2025-05-07T19:45:10.4141053Z #define __INT_FAST64_FMTd__ "ld" 2025-05-07T19:45:10.4141354Z #define __INT_FAST64_FMTi__ "li" 2025-05-07T19:45:10.4141657Z #define __INT_FAST64_MAX__ 9223372036854775807L 2025-05-07T19:45:10.4142008Z #define __INT_FAST64_TYPE__ long int 2025-05-07T19:45:10.4142303Z #define __INT_FAST64_WIDTH__ 64 2025-05-07T19:45:10.4142598Z #define __INT_FAST8_FMTd__ "hhd" 2025-05-07T19:45:10.4142985Z #define __INT_FAST8_FMTi__ "hhi" 2025-05-07T19:45:10.4143359Z #define __INT_FAST8_MAX__ 127 2025-05-07T19:45:10.4143627Z #define __INT_FAST8_TYPE__ signed char 2025-05-07T19:45:10.4143933Z #define __INT_FAST8_WIDTH__ 8 2025-05-07T19:45:10.4144209Z #define __INT_LEAST16_FMTd__ "hd" 2025-05-07T19:45:10.4144486Z #define __INT_LEAST16_FMTi__ "hi" 2025-05-07T19:45:10.4144771Z #define __INT_LEAST16_MAX__ 32767 2025-05-07T19:45:10.4145045Z #define __INT_LEAST16_TYPE__ short 2025-05-07T19:45:10.4145336Z #define __INT_LEAST16_WIDTH__ 16 2025-05-07T19:45:10.4145602Z #define __INT_LEAST32_FMTd__ "d" 2025-05-07T19:45:10.4145891Z #define __INT_LEAST32_FMTi__ "i" 2025-05-07T19:45:10.4146159Z #define __INT_LEAST32_MAX__ 2147483647 2025-05-07T19:45:10.4146461Z #define __INT_LEAST32_TYPE__ int 2025-05-07T19:45:10.4146723Z #define __INT_LEAST32_WIDTH__ 32 2025-05-07T19:45:10.4147008Z #define __INT_LEAST64_FMTd__ "ld" 2025-05-07T19:45:10.4147300Z #define __INT_LEAST64_FMTi__ "li" 2025-05-07T19:45:10.4147581Z #define __INT_LEAST64_MAX__ 9223372036854775807L 2025-05-07T19:45:10.4147906Z #define __INT_LEAST64_TYPE__ long int 2025-05-07T19:45:10.4148177Z #define __INT_LEAST64_WIDTH__ 64 2025-05-07T19:45:10.4148453Z #define __INT_LEAST8_FMTd__ "hhd" 2025-05-07T19:45:10.4148715Z #define __INT_LEAST8_FMTi__ "hhi" 2025-05-07T19:45:10.4148992Z #define __INT_LEAST8_MAX__ 127 2025-05-07T19:45:10.4149251Z #define __INT_LEAST8_TYPE__ signed char 2025-05-07T19:45:10.4149549Z #define __INT_LEAST8_WIDTH__ 8 2025-05-07T19:45:10.4149797Z #define __INT_MAX__ 2147483647 2025-05-07T19:45:10.4150054Z #define __INT_WIDTH__ 32 2025-05-07T19:45:10.4150310Z #define __LDBL_DECIMAL_DIG__ 21 2025-05-07T19:45:10.4150610Z #define __LDBL_DENORM_MIN__ 3.64519953188247460253e-4951L 2025-05-07T19:45:10.4150947Z #define __LDBL_DIG__ 18 2025-05-07T19:45:10.4151211Z #define __LDBL_EPSILON__ 1.08420217248550443401e-19L 2025-05-07T19:45:10.4151540Z #define __LDBL_HAS_DENORM__ 1 2025-05-07T19:45:10.4151793Z #define __LDBL_HAS_INFINITY__ 1 2025-05-07T19:45:10.4152073Z #define __LDBL_HAS_QUIET_NAN__ 1 2025-05-07T19:45:10.4152332Z #define __LDBL_MANT_DIG__ 64 2025-05-07T19:45:10.4152595Z #define __LDBL_MAX_10_EXP__ 4932 2025-05-07T19:45:10.4152869Z #define __LDBL_MAX_EXP__ 16384 2025-05-07T19:45:10.4153142Z #define __LDBL_MAX__ 1.18973149535723176502e+4932L 2025-05-07T19:45:10.4153461Z #define __LDBL_MIN_10_EXP__ (-4931) 2025-05-07T19:45:10.4153729Z #define __LDBL_MIN_EXP__ (-16381) 2025-05-07T19:45:10.4154032Z #define __LDBL_MIN__ 3.36210314311209350626e-4932L 2025-05-07T19:45:10.4154334Z #define __LITTLE_ENDIAN__ 1 2025-05-07T19:45:10.4154596Z #define __LLONG_WIDTH__ 64 2025-05-07T19:45:10.4154861Z #define __LONG_LONG_MAX__ 9223372036854775807LL 2025-05-07T19:45:10.4155181Z #define __LONG_MAX__ 9223372036854775807L 2025-05-07T19:45:10.4155456Z #define __LONG_WIDTH__ 64 2025-05-07T19:45:10.4155704Z #define __LP64__ 1 2025-05-07T19:45:10.4155930Z #define __MMX__ 1 2025-05-07T19:45:10.4156134Z #define __NO_INLINE__ 1 2025-05-07T19:45:10.4156377Z #define __NO_MATH_INLINES 1 2025-05-07T19:45:10.4156620Z #define __OBJC_BOOL_IS_BOOL 0 2025-05-07T19:45:10.4156921Z #define __OPENCL_MEMORY_SCOPE_ALL_SVM_DEVICES 3 2025-05-07T19:45:10.4157238Z #define __OPENCL_MEMORY_SCOPE_DEVICE 2 2025-05-07T19:45:10.4157550Z #define __OPENCL_MEMORY_SCOPE_SUB_GROUP 4 2025-05-07T19:45:10.4157853Z #define __OPENCL_MEMORY_SCOPE_WORK_GROUP 1 2025-05-07T19:45:10.4158177Z #define __OPENCL_MEMORY_SCOPE_WORK_ITEM 0 2025-05-07T19:45:10.4158469Z #define __ORDER_BIG_ENDIAN__ 4321 2025-05-07T19:45:10.4158756Z #define __ORDER_LITTLE_ENDIAN__ 1234 2025-05-07T19:45:10.4159051Z #define __ORDER_PDP_ENDIAN__ 3412 2025-05-07T19:45:10.4159396Z #define __PIC__ 2 2025-05-07T19:45:10.4159625Z #define __PIE__ 2 2025-05-07T19:45:10.4159842Z #define __POINTER_WIDTH__ 64 2025-05-07T19:45:10.4160118Z #define __PRAGMA_REDEFINE_EXTNAME 1 2025-05-07T19:45:10.4160392Z #define __PTRDIFF_FMTd__ "ld" 2025-05-07T19:45:10.4160663Z #define __PTRDIFF_FMTi__ "li" 2025-05-07T19:45:10.4160925Z #define __PTRDIFF_MAX__ 9223372036854775807L 2025-05-07T19:45:10.4161230Z #define __PTRDIFF_TYPE__ long int 2025-05-07T19:45:10.4161562Z #define __PTRDIFF_WIDTH__ 64 2025-05-07T19:45:10.4161827Z #define __REGISTER_PREFIX__ 2025-05-07T19:45:10.4162086Z #define __SCHAR_MAX__ 127 2025-05-07T19:45:10.4162311Z #define __SEG_FS 1 2025-05-07T19:45:10.4162533Z #define __SEG_GS 1 2025-05-07T19:45:10.4162740Z #define __SHRT_MAX__ 32767 2025-05-07T19:45:10.4162996Z #define __SHRT_WIDTH__ 16 2025-05-07T19:45:10.4163238Z #define __SIG_ATOMIC_MAX__ 2147483647 2025-05-07T19:45:10.4163524Z #define __SIG_ATOMIC_WIDTH__ 32 2025-05-07T19:45:10.4163774Z #define __SIZEOF_DOUBLE__ 8 2025-05-07T19:45:10.4164037Z #define __SIZEOF_FLOAT128__ 16 2025-05-07T19:45:10.4164286Z #define __SIZEOF_FLOAT__ 4 2025-05-07T19:45:10.4164541Z #define __SIZEOF_INT128__ 16 2025-05-07T19:45:10.4164807Z #define __SIZEOF_INT__ 4 2025-05-07T19:45:10.4165053Z #define __SIZEOF_LONG_DOUBLE__ 16 2025-05-07T19:45:10.4165338Z #define __SIZEOF_LONG_LONG__ 8 2025-05-07T19:45:10.4165590Z #define __SIZEOF_LONG__ 8 2025-05-07T19:45:10.4165841Z #define __SIZEOF_POINTER__ 8 2025-05-07T19:45:10.4166095Z #define __SIZEOF_PTRDIFF_T__ 8 2025-05-07T19:45:10.4166357Z #define __SIZEOF_SHORT__ 2 2025-05-07T19:45:10.4166593Z #define __SIZEOF_SIZE_T__ 8 2025-05-07T19:45:10.4166851Z #define __SIZEOF_WCHAR_T__ 4 2025-05-07T19:45:10.4167098Z #define __SIZEOF_WINT_T__ 4 2025-05-07T19:45:10.4167355Z #define __SIZE_FMTX__ "lX" 2025-05-07T19:45:10.4167596Z #define __SIZE_FMTo__ "lo" 2025-05-07T19:45:10.4167851Z #define __SIZE_FMTu__ "lu" 2025-05-07T19:45:10.4168108Z #define __SIZE_FMTx__ "lx" 2025-05-07T19:45:10.4168362Z #define __SIZE_MAX__ 18446744073709551615UL 2025-05-07T19:45:10.4168680Z #define __SIZE_TYPE__ long unsigned int 2025-05-07T19:45:10.4168961Z #define __SIZE_WIDTH__ 64 2025-05-07T19:45:10.4169212Z #define __SSE2_MATH__ 1 2025-05-07T19:45:10.4169436Z #define __SSE2__ 1 2025-05-07T19:45:10.4169664Z #define __SSE_MATH__ 1 2025-05-07T19:45:10.4169885Z #define __SSE__ 1 2025-05-07T19:45:10.4170549Z #define __STDCPP_DEFAULT_NEW_ALIGNMENT__ 16UL 2025-05-07T19:45:10.4170999Z #define __STDCPP_THREADS__ 1 2025-05-07T19:45:10.4171336Z #define __STDC_HOSTED__ 1 2025-05-07T19:45:10.4171609Z #define __STDC_UTF_16__ 1 2025-05-07T19:45:10.4171863Z #define __STDC_UTF_32__ 1 2025-05-07T19:45:10.4172125Z #define __STDC__ 1 2025-05-07T19:45:10.4172362Z #define __UINT16_C_SUFFIX__ 2025-05-07T19:45:10.4172645Z #define __UINT16_FMTX__ "hX" 2025-05-07T19:45:10.4172908Z #define __UINT16_FMTo__ "ho" 2025-05-07T19:45:10.4173191Z #define __UINT16_FMTu__ "hu" 2025-05-07T19:45:10.4173456Z #define __UINT16_FMTx__ "hx" 2025-05-07T19:45:10.4173745Z #define __UINT16_MAX__ 65535 2025-05-07T19:45:10.4174028Z #define __UINT16_TYPE__ unsigned short 2025-05-07T19:45:10.4174357Z #define __UINT32_C_SUFFIX__ U 2025-05-07T19:45:10.4174653Z #define __UINT32_FMTX__ "X" 2025-05-07T19:45:10.4174916Z #define __UINT32_FMTo__ "o" 2025-05-07T19:45:10.4175190Z #define __UINT32_FMTu__ "u" 2025-05-07T19:45:10.4175442Z #define __UINT32_FMTx__ "x" 2025-05-07T19:45:10.4175722Z #define __UINT32_MAX__ 4294967295U 2025-05-07T19:45:10.4176010Z #define __UINT32_TYPE__ unsigned int 2025-05-07T19:45:10.4176408Z #define __UINT64_C_SUFFIX__ UL 2025-05-07T19:45:10.4176671Z #define __UINT64_FMTX__ "lX" 2025-05-07T19:45:10.4176952Z #define __UINT64_FMTo__ "lo" 2025-05-07T19:45:10.4177208Z #define __UINT64_FMTu__ "lu" 2025-05-07T19:45:10.4177509Z #define __UINT64_FMTx__ "lx" 2025-05-07T19:45:10.4177809Z #define __UINT64_MAX__ 18446744073709551615UL 2025-05-07T19:45:10.4178131Z #define __UINT64_TYPE__ long unsigned int 2025-05-07T19:45:10.4178448Z #define __UINT8_C_SUFFIX__ 2025-05-07T19:45:10.4178871Z #define __UINT8_FMTX__ "hhX" 2025-05-07T19:45:10.4179152Z #define __UINT8_FMTo__ "hho" 2025-05-07T19:45:10.4179417Z #define __UINT8_FMTu__ "hhu" 2025-05-07T19:45:10.4179699Z #define __UINT8_FMTx__ "hhx" 2025-05-07T19:45:10.4179961Z #define __UINT8_MAX__ 255 2025-05-07T19:45:10.4180242Z #define __UINT8_TYPE__ unsigned char 2025-05-07T19:45:10.4180535Z #define __UINTMAX_C_SUFFIX__ UL 2025-05-07T19:45:10.4180826Z #define __UINTMAX_FMTX__ "lX" 2025-05-07T19:45:10.4181199Z #define __UINTMAX_FMTo__ "lo" 2025-05-07T19:45:10.4181467Z #define __UINTMAX_FMTu__ "lu" 2025-05-07T19:45:10.4181753Z #define __UINTMAX_FMTx__ "lx" 2025-05-07T19:45:10.4182044Z #define __UINTMAX_MAX__ 18446744073709551615UL 2025-05-07T19:45:10.4182402Z #define __UINTMAX_TYPE__ long unsigned int 2025-05-07T19:45:10.4182717Z #define __UINTMAX_WIDTH__ 64 2025-05-07T19:45:10.4183012Z #define __UINTPTR_FMTX__ "lX" 2025-05-07T19:45:10.4183286Z #define __UINTPTR_FMTo__ "lo" 2025-05-07T19:45:10.4183580Z #define __UINTPTR_FMTu__ "lu" 2025-05-07T19:45:10.4183851Z #define __UINTPTR_FMTx__ "lx" 2025-05-07T19:45:10.4184156Z #define __UINTPTR_MAX__ 18446744073709551615UL 2025-05-07T19:45:10.4184504Z #define __UINTPTR_TYPE__ long unsigned int 2025-05-07T19:45:10.4184812Z #define __UINTPTR_WIDTH__ 64 2025-05-07T19:45:10.4185098Z #define __UINT_FAST16_FMTX__ "hX" 2025-05-07T19:45:10.4185383Z #define __UINT_FAST16_FMTo__ "ho" 2025-05-07T19:45:10.4185680Z #define __UINT_FAST16_FMTu__ "hu" 2025-05-07T19:45:10.4185965Z #define __UINT_FAST16_FMTx__ "hx" 2025-05-07T19:45:10.4186259Z #define __UINT_FAST16_MAX__ 65535 2025-05-07T19:45:10.4186551Z #define __UINT_FAST16_TYPE__ unsigned short 2025-05-07T19:45:10.4186884Z #define __UINT_FAST32_FMTX__ "X" 2025-05-07T19:45:10.4187161Z #define __UINT_FAST32_FMTo__ "o" 2025-05-07T19:45:10.4187454Z #define __UINT_FAST32_FMTu__ "u" 2025-05-07T19:45:10.4187743Z #define __UINT_FAST32_FMTx__ "x" 2025-05-07T19:45:10.4188022Z #define __UINT_FAST32_MAX__ 4294967295U 2025-05-07T19:45:10.4188352Z #define __UINT_FAST32_TYPE__ unsigned int 2025-05-07T19:45:10.4188780Z #define __UINT_FAST64_FMTX__ "lX" 2025-05-07T19:45:10.4189064Z #define __UINT_FAST64_FMTo__ "lo" 2025-05-07T19:45:10.4189329Z #define __UINT_FAST64_FMTu__ "lu" 2025-05-07T19:45:10.4189601Z #define __UINT_FAST64_FMTx__ "lx" 2025-05-07T19:45:10.4189882Z #define __UINT_FAST64_MAX__ 18446744073709551615UL 2025-05-07T19:45:10.4190227Z #define __UINT_FAST64_TYPE__ long unsigned int 2025-05-07T19:45:10.4190542Z #define __UINT_FAST8_FMTX__ "hhX" 2025-05-07T19:45:10.4190804Z #define __UINT_FAST8_FMTo__ "hho" 2025-05-07T19:45:10.4191078Z #define __UINT_FAST8_FMTu__ "hhu" 2025-05-07T19:45:10.4191340Z #define __UINT_FAST8_FMTx__ "hhx" 2025-05-07T19:45:10.4191618Z #define __UINT_FAST8_MAX__ 255 2025-05-07T19:45:10.4191882Z #define __UINT_FAST8_TYPE__ unsigned char 2025-05-07T19:45:10.4192184Z #define __UINT_LEAST16_FMTX__ "hX" 2025-05-07T19:45:10.4192451Z #define __UINT_LEAST16_FMTo__ "ho" 2025-05-07T19:45:10.4192730Z #define __UINT_LEAST16_FMTu__ "hu" 2025-05-07T19:45:10.4192994Z #define __UINT_LEAST16_FMTx__ "hx" 2025-05-07T19:45:10.4193275Z #define __UINT_LEAST16_MAX__ 65535 2025-05-07T19:45:10.4193573Z #define __UINT_LEAST16_TYPE__ unsigned short 2025-05-07T19:45:10.4193874Z #define __UINT_LEAST32_FMTX__ "X" 2025-05-07T19:45:10.4194158Z #define __UINT_LEAST32_FMTo__ "o" 2025-05-07T19:45:10.4194424Z #define __UINT_LEAST32_FMTu__ "u" 2025-05-07T19:45:10.4194711Z #define __UINT_LEAST32_FMTx__ "x" 2025-05-07T19:45:10.4194979Z #define __UINT_LEAST32_MAX__ 4294967295U 2025-05-07T19:45:10.4195290Z #define __UINT_LEAST32_TYPE__ unsigned int 2025-05-07T19:45:10.4195577Z #define __UINT_LEAST64_FMTX__ "lX" 2025-05-07T19:45:10.4195859Z #define __UINT_LEAST64_FMTo__ "lo" 2025-05-07T19:45:10.4196125Z #define __UINT_LEAST64_FMTu__ "lu" 2025-05-07T19:45:10.4196405Z #define __UINT_LEAST64_FMTx__ "lx" 2025-05-07T19:45:10.4196711Z #define __UINT_LEAST64_MAX__ 18446744073709551615UL 2025-05-07T19:45:10.4197046Z #define __UINT_LEAST64_TYPE__ long unsigned int 2025-05-07T19:45:10.4197410Z #define __UINT_LEAST8_FMTX__ "hhX" 2025-05-07T19:45:10.4197811Z #define __UINT_LEAST8_FMTo__ "hho" 2025-05-07T19:45:10.4198128Z #define __UINT_LEAST8_FMTu__ "hhu" 2025-05-07T19:45:10.4198410Z #define __UINT_LEAST8_FMTx__ "hhx" 2025-05-07T19:45:10.4198728Z #define __UINT_LEAST8_MAX__ 255 2025-05-07T19:45:10.4199020Z #define __UINT_LEAST8_TYPE__ unsigned char 2025-05-07T19:45:10.4199367Z #define __USER_LABEL_PREFIX__ 2025-05-07T19:45:10.4200050Z #define __VERSION__ "Clang 16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4)" 2025-05-07T19:45:10.4200645Z #define __WCHAR_MAX__ 2147483647 2025-05-07T19:45:10.4200959Z #define __WCHAR_TYPE__ int 2025-05-07T19:45:10.4201224Z #define __WCHAR_WIDTH__ 32 2025-05-07T19:45:10.4201521Z #define __WINT_MAX__ 4294967295U 2025-05-07T19:45:10.4201809Z #define __WINT_TYPE__ unsigned int 2025-05-07T19:45:10.4202128Z #define __WINT_UNSIGNED__ 1 2025-05-07T19:45:10.4202398Z #define __WINT_WIDTH__ 32 2025-05-07T19:45:10.4202686Z #define __amd64 1 2025-05-07T19:45:10.4202919Z #define __amd64__ 1 2025-05-07T19:45:10.4203187Z #define __clang__ 1 2025-05-07T19:45:10.4203474Z #define __clang_literal_encoding__ "UTF-8" 2025-05-07T19:45:10.4203787Z #define __clang_major__ 16 2025-05-07T19:45:10.4204079Z #define __clang_minor__ 0 2025-05-07T19:45:10.4204348Z #define __clang_patchlevel__ 6 2025-05-07T19:45:10.4204950Z #define __clang_version__ "16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4)" 2025-05-07T19:45:10.4205582Z #define __clang_wide_literal_encoding__ "UTF-32" 2025-05-07T19:45:10.4205943Z #define __code_model_small__ 1 2025-05-07T19:45:10.4206208Z #define __cplusplus 201703L 2025-05-07T19:45:10.4206498Z #define __cpp_aggregate_bases 201603L 2025-05-07T19:45:10.4206827Z #define __cpp_aggregate_nsdmi 201304L 2025-05-07T19:45:10.4207124Z #define __cpp_alias_templates 200704L 2025-05-07T19:45:10.4207431Z #define __cpp_aligned_new 201606L 2025-05-07T19:45:10.4207703Z #define __cpp_attributes 200809L 2025-05-07T19:45:10.4208012Z #define __cpp_binary_literals 201304L 2025-05-07T19:45:10.4208308Z #define __cpp_capture_star_this 201603L 2025-05-07T19:45:10.4208637Z #define __cpp_constexpr 201603L 2025-05-07T19:45:10.4208928Z #define __cpp_constexpr_in_decltype 201711L 2025-05-07T19:45:10.4209263Z #define __cpp_decltype 200707L 2025-05-07T19:45:10.4209565Z #define __cpp_decltype_auto 201304L 2025-05-07T19:45:10.4209852Z #define __cpp_deduction_guides 201703L 2025-05-07T19:45:10.4210207Z #define __cpp_delegating_constructors 200604L 2025-05-07T19:45:10.4210540Z #define __cpp_digit_separators 201309L 2025-05-07T19:45:10.4210887Z #define __cpp_enumerator_attributes 201411L 2025-05-07T19:45:10.4211202Z #define __cpp_exceptions 199711L 2025-05-07T19:45:10.4211516Z #define __cpp_fold_expressions 201603L 2025-05-07T19:45:10.4211822Z #define __cpp_generic_lambdas 201304L 2025-05-07T19:45:10.4212160Z #define __cpp_guaranteed_copy_elision 201606L 2025-05-07T19:45:10.4212480Z #define __cpp_hex_float 201603L 2025-05-07T19:45:10.4212789Z #define __cpp_if_constexpr 201606L 2025-05-07T19:45:10.4213123Z #define __cpp_impl_destroying_delete 201806L 2025-05-07T19:45:10.4213460Z #define __cpp_inheriting_constructors 201511L 2025-05-07T19:45:10.4213796Z #define __cpp_init_captures 201304L 2025-05-07T19:45:10.4214079Z #define __cpp_initializer_lists 200806L 2025-05-07T19:45:10.4214392Z #define __cpp_inline_variables 201606L 2025-05-07T19:45:10.4214689Z #define __cpp_lambdas 200907L 2025-05-07T19:45:10.4215014Z #define __cpp_named_character_escapes 202207L 2025-05-07T19:45:10.4215349Z #define __cpp_namespace_attributes 201411L 2025-05-07T19:45:10.4215716Z #define __cpp_nested_namespace_definitions 201411L 2025-05-07T19:45:10.4216195Z #define __cpp_noexcept_function_type 201510L 2025-05-07T19:45:10.4216718Z #define __cpp_nontype_template_args 201411L 2025-05-07T19:45:10.4217136Z #define __cpp_nontype_template_parameter_auto 201606L 2025-05-07T19:45:10.4217542Z #define __cpp_nsdmi 200809L 2025-05-07T19:45:10.4217860Z #define __cpp_range_based_for 201603L 2025-05-07T19:45:10.4218277Z #define __cpp_raw_strings 200710L 2025-05-07T19:45:10.4218615Z #define __cpp_ref_qualifiers 200710L 2025-05-07T19:45:10.4218944Z #define __cpp_return_type_deduction 201304L 2025-05-07T19:45:10.4219303Z #define __cpp_rtti 199711L 2025-05-07T19:45:10.4219599Z #define __cpp_rvalue_references 200610L 2025-05-07T19:45:10.4219953Z #define __cpp_static_assert 201411L 2025-05-07T19:45:10.4220299Z #define __cpp_static_call_operator 202207L 2025-05-07T19:45:10.4220729Z #define __cpp_structured_bindings 201606L 2025-05-07T19:45:10.4221095Z #define __cpp_template_auto 201606L 2025-05-07T19:45:10.4221429Z #define __cpp_threadsafe_static_init 200806L 2025-05-07T19:45:10.4221801Z #define __cpp_unicode_characters 200704L 2025-05-07T19:45:10.4222134Z #define __cpp_unicode_literals 200710L 2025-05-07T19:45:10.4222492Z #define __cpp_user_defined_literals 200809L 2025-05-07T19:45:10.4222836Z #define __cpp_variable_templates 201304L 2025-05-07T19:45:10.4223191Z #define __cpp_variadic_templates 200704L 2025-05-07T19:45:10.4223545Z #define __cpp_variadic_using 201611L 2025-05-07T19:45:10.4223849Z #define __gnu_linux__ 1 2025-05-07T19:45:10.4224126Z #define __k8 1 2025-05-07T19:45:10.4224350Z #define __k8__ 1 2025-05-07T19:45:10.4224611Z #define __linux 1 2025-05-07T19:45:10.4224842Z #define __linux__ 1 2025-05-07T19:45:10.4225096Z #define __llvm__ 1 2025-05-07T19:45:10.4225333Z #define __pic__ 2 2025-05-07T19:45:10.4225584Z #define __pie__ 2 2025-05-07T19:45:10.4225835Z #define __private_extern__ extern 2025-05-07T19:45:10.4226203Z #define __seg_fs __attribute__((address_space(257))) 2025-05-07T19:45:10.4226630Z #define __seg_gs __attribute__((address_space(256))) 2025-05-07T19:45:10.4226983Z #define __tune_k8__ 1 2025-05-07T19:45:10.4227270Z #define __unix 1 2025-05-07T19:45:10.4227512Z #define __unix__ 1 2025-05-07T19:45:10.4227776Z #define __x86_64 1 2025-05-07T19:45:10.4228014Z #define __x86_64__ 1 2025-05-07T19:45:10.4228270Z #define linux 1 2025-05-07T19:45:10.4228490Z #define unix 1 2025-05-07T19:45:10.4228638Z 2025-05-07T19:45:10.4659999Z 2025-05-07T19:45:10.4660555Z + conda run -n build_binary c++ --version 2025-05-07T19:45:10.4661114Z 2025-05-07T19:45:12.0881255Z clang version 16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4) 2025-05-07T19:45:12.0883073Z Target: x86_64-conda-linux-gnu 2025-05-07T19:45:12.0883825Z Thread model: posix 2025-05-07T19:45:12.0884733Z InstalledDir: /github/home/miniconda/envs/build_binary/bin 2025-05-07T19:45:12.0886587Z Configuration file: /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-clang++.cfg 2025-05-07T19:45:12.0887907Z 2025-05-07T19:45:12.1462738Z 2025-05-07T19:45:12.1465189Z [INFO] Printing the default version of the C standard used by the compiler ... 2025-05-07T19:45:12.1467397Z + conda run -n build_binary cc -dM -E - < /dev/null | grep __STDC_VERSION__ 2025-05-07T19:45:12.1468078Z 2025-05-07T19:45:13.8640549Z #define __STDC_VERSION__ 201710L 2025-05-07T19:45:13.8643385Z 2025-05-07T19:45:13.8644030Z [INFO] Printing the default version of the C++ standard used by the compiler ... 2025-05-07T19:45:13.8644724Z + conda run -n build_binary c++ -dM -E -x c++ - < /dev/null | grep __cplusplus 2025-05-07T19:45:13.8645061Z 2025-05-07T19:45:15.5938398Z #define __cplusplus 201703L 2025-05-07T19:45:15.5943736Z 2025-05-07T19:45:15.5944186Z [INSTALL] Successfully installed C/C++ compilers 2025-05-07T19:45:15.6041752Z ##[group]Run . $PRELUDE; install_build_tools $BUILD_ENV 2025-05-07T19:45:15.6042265Z . $PRELUDE; install_build_tools $BUILD_ENV 2025-05-07T19:45:15.6043152Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:45:15.6043564Z env: 2025-05-07T19:45:15.6043812Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:45:15.6044198Z BUILD_ENV: build_binary 2025-05-07T19:45:15.6044470Z BUILD_TARGET: default 2025-05-07T19:45:15.6044757Z BUILD_VARIANT: cuda 2025-05-07T19:45:15.6045037Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:45:15.6045302Z ##[endgroup] 2025-05-07T19:45:16.0177034Z ################################################################################ 2025-05-07T19:45:16.0177402Z # Install Build Tools 2025-05-07T19:45:16.0177648Z # 2025-05-07T19:45:16.0195462Z # [2025-05-07T19:45:16.018Z] + install_build_tools build_binary 2025-05-07T19:45:16.0196894Z ################################################################################ 2025-05-07T19:45:16.0197845Z 2025-05-07T19:45:16.0213219Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:45:16.1102512Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:45:16.1117988Z [INSTALL] Installing build tools ... 2025-05-07T19:45:16.1140527Z [EXEC] [ATTEMPT 0/3] + conda install -n build_binary -c conda-forge --override-channels -y auditwheel bazel cmake>=3.30 hypothesis jinja2 make ncurses ninja openblas patchelf rhash scikit-build wheel pyyaml 2025-05-07T19:45:16.8342268Z Channels: 2025-05-07T19:45:16.8342943Z - conda-forge 2025-05-07T19:45:16.8343583Z Platform: linux-64 2025-05-07T19:45:19.9759381Z Collecting package metadata (repodata.json): - \ | / done 2025-05-07T19:45:23.6210430Z Solving environment: \ | / - done 2025-05-07T19:45:23.6841307Z 2025-05-07T19:45:23.6841603Z ## Package Plan ## 2025-05-07T19:45:23.6842138Z 2025-05-07T19:45:23.6842974Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:45:23.6843441Z 2025-05-07T19:45:23.6843639Z added / updated specs: 2025-05-07T19:45:23.6844096Z - auditwheel 2025-05-07T19:45:23.6844527Z - bazel 2025-05-07T19:45:23.6844848Z - cmake[version='>=3.30'] 2025-05-07T19:45:23.6845223Z - hypothesis 2025-05-07T19:45:23.6845451Z - jinja2 2025-05-07T19:45:23.6845647Z - make 2025-05-07T19:45:23.6845855Z - ncurses 2025-05-07T19:45:23.6846051Z - ninja 2025-05-07T19:45:23.6846264Z - openblas 2025-05-07T19:45:23.6846471Z - patchelf 2025-05-07T19:45:23.6846703Z - pyyaml 2025-05-07T19:45:23.6846919Z - rhash 2025-05-07T19:45:23.6847119Z - scikit-build 2025-05-07T19:45:23.6847368Z - wheel 2025-05-07T19:45:23.6847480Z 2025-05-07T19:45:23.6847485Z 2025-05-07T19:45:23.6847608Z The following packages will be downloaded: 2025-05-07T19:45:23.6847850Z 2025-05-07T19:45:23.6847971Z package | build 2025-05-07T19:45:23.6848396Z ---------------------------|----------------- 2025-05-07T19:45:23.6848783Z alsa-lib-1.2.14 | hb9d3cd8_0 553 KB conda-forge 2025-05-07T19:45:23.6849233Z attrs-25.3.0 | pyh71513ae_0 56 KB conda-forge 2025-05-07T19:45:23.6849682Z auditwheel-6.2.0 | pyha804496_1 40 KB conda-forge 2025-05-07T19:45:23.6850103Z bazel-7.5.0 | h96810dc_2 47.4 MB conda-forge 2025-05-07T19:45:23.6850519Z c-ares-1.34.5 | hb9d3cd8_0 202 KB conda-forge 2025-05-07T19:45:23.6850917Z cairo-1.18.0 | hbb29018_2 961 KB conda-forge 2025-05-07T19:45:23.6851342Z click-8.1.8 | pyh707e725_0 83 KB conda-forge 2025-05-07T19:45:23.6851755Z cmake-4.0.2 | h74e3db0_0 19.4 MB conda-forge 2025-05-07T19:45:23.6852159Z distro-1.9.0 | pyhd8ed1ab_1 41 KB conda-forge 2025-05-07T19:45:23.6853075Z exceptiongroup-1.2.2 | pyhd8ed1ab_1 20 KB conda-forge 2025-05-07T19:45:23.6853506Z expat-2.7.0 | h5888daf_0 137 KB conda-forge 2025-05-07T19:45:23.6853971Z font-ttf-dejavu-sans-mono-2.37| hab24e00_0 388 KB conda-forge 2025-05-07T19:45:23.6854472Z font-ttf-inconsolata-3.000 | h77eed37_0 94 KB conda-forge 2025-05-07T19:45:23.6854987Z font-ttf-source-code-pro-2.038| h77eed37_0 684 KB conda-forge 2025-05-07T19:45:23.6855484Z font-ttf-ubuntu-0.83 | h77eed37_3 1.5 MB conda-forge 2025-05-07T19:45:23.6855910Z fontconfig-2.15.0 | h7e30c49_1 259 KB conda-forge 2025-05-07T19:45:23.6856846Z fonts-conda-ecosystem-1 | 0 4 KB conda-forge 2025-05-07T19:45:23.6857430Z fonts-conda-forge-1 | 0 4 KB conda-forge 2025-05-07T19:45:23.6857904Z freetype-2.13.3 | ha770c72_1 168 KB conda-forge 2025-05-07T19:45:23.6858352Z giflib-5.2.2 | hd590300_0 75 KB conda-forge 2025-05-07T19:45:23.6858781Z graphite2-1.3.13 | h59595ed_1003 95 KB conda-forge 2025-05-07T19:45:23.6859234Z harfbuzz-9.0.0 | hfac3d4d_0 1.5 MB conda-forge 2025-05-07T19:45:23.6859684Z hypothesis-6.131.14 | pyha770c72_0 348 KB conda-forge 2025-05-07T19:45:23.6860135Z ijar-7.5.0 | h5888daf_0 114 KB conda-forge 2025-05-07T19:45:23.6860548Z jinja2-3.1.6 | pyhd8ed1ab_0 110 KB conda-forge 2025-05-07T19:45:23.6860988Z keyutils-1.6.1 | h166bdaf_0 115 KB conda-forge 2025-05-07T19:45:23.6861423Z krb5-1.21.3 | h659f571_0 1.3 MB conda-forge 2025-05-07T19:45:23.6861817Z lcms2-2.17 | h717163a_0 242 KB conda-forge 2025-05-07T19:45:23.6862235Z lerc-4.0.0 | h0aef613_1 258 KB conda-forge 2025-05-07T19:45:23.6862687Z libabseil-20250127.1 | cxx17_hbbce691_0 1.3 MB conda-forge 2025-05-07T19:45:23.6863200Z libcups-2.3.3 | h4637d8d_4 4.3 MB conda-forge 2025-05-07T19:45:23.6863642Z libcurl-8.13.0 | h332b0f4_0 428 KB conda-forge 2025-05-07T19:45:23.6864123Z libdeflate-1.23 | h86f0d12_0 71 KB conda-forge 2025-05-07T19:45:23.6864639Z libedit-3.1.20250104 | pl5321h7949ede_0 132 KB conda-forge 2025-05-07T19:45:23.6865106Z libev-4.33 | hd590300_2 110 KB conda-forge 2025-05-07T19:45:23.6865570Z libexpat-2.7.0 | h5888daf_0 73 KB conda-forge 2025-05-07T19:45:23.6866033Z libfreetype-2.13.3 | ha770c72_1 8 KB conda-forge 2025-05-07T19:45:23.6866543Z libfreetype6-2.13.3 | h48d6fc4_1 371 KB conda-forge 2025-05-07T19:45:23.6867032Z libgfortran-15.1.0 | h69a702a_2 34 KB conda-forge 2025-05-07T19:45:23.6867549Z libgfortran5-15.1.0 | hcea5267_2 1.5 MB conda-forge 2025-05-07T19:45:23.6868043Z libglib-2.84.0 | h2ff4ddf_0 3.8 MB conda-forge 2025-05-07T19:45:23.6868494Z libgrpc-1.71.0 | h8e591d7_1 7.6 MB conda-forge 2025-05-07T19:45:23.6869000Z libjpeg-turbo-3.1.0 | hb9d3cd8_0 614 KB conda-forge 2025-05-07T19:45:23.6869470Z liblzma-5.8.1 | hb9d3cd8_1 110 KB conda-forge 2025-05-07T19:45:23.6869967Z liblzma-devel-5.8.1 | hb9d3cd8_1 431 KB conda-forge 2025-05-07T19:45:23.6870679Z libnghttp2-1.64.0 | h161d5f1_0 632 KB conda-forge 2025-05-07T19:45:23.6871212Z libopenblas-0.3.29 |pthreads_h94d23a6_0 5.6 MB conda-forge 2025-05-07T19:45:23.6871727Z libpng-1.6.47 | h943b412_0 282 KB conda-forge 2025-05-07T19:45:23.6872382Z libprotobuf-5.29.3 | h501fc15_1 3.2 MB conda-forge 2025-05-07T19:45:23.6872883Z libre2-11-2024.07.02 | hba17884_3 205 KB conda-forge 2025-05-07T19:45:23.6873352Z libsqlite-3.49.2 | hee588c1_0 895 KB conda-forge 2025-05-07T19:45:23.6873838Z libssh2-1.11.1 | hcf80075_0 298 KB conda-forge 2025-05-07T19:45:23.6874308Z libtiff-4.7.0 | hd9ff511_4 419 KB conda-forge 2025-05-07T19:45:23.6874749Z libuuid-2.38.1 | h0b41bf4_0 33 KB conda-forge 2025-05-07T19:45:23.6875308Z libuv-1.50.0 | hb9d3cd8_0 870 KB conda-forge 2025-05-07T19:45:23.6875760Z libwebp-base-1.5.0 | h851e524_0 420 KB conda-forge 2025-05-07T19:45:23.6876223Z libxcb-1.17.0 | h8a09558_0 387 KB conda-forge 2025-05-07T19:45:23.6876659Z libzlib-1.3.1 | hb9d3cd8_2 60 KB conda-forge 2025-05-07T19:45:23.6877108Z make-4.4.1 | hb9d3cd8_2 501 KB conda-forge 2025-05-07T19:45:23.6877587Z markupsafe-3.0.2 | py313h8060acc_1 24 KB conda-forge 2025-05-07T19:45:23.6878021Z ncurses-6.5 | h2d0b736_3 871 KB conda-forge 2025-05-07T19:45:23.6878465Z ninja-1.12.1 | hff21bea_1 158 KB conda-forge 2025-05-07T19:45:23.6878920Z openblas-0.3.29 |pthreads_h6ec200e_0 5.8 MB conda-forge 2025-05-07T19:45:23.6879410Z openjdk-23.0.1 | h4c11d01_0 181.3 MB conda-forge 2025-05-07T19:45:23.6879858Z packaging-25.0 | pyh29332c3_1 61 KB conda-forge 2025-05-07T19:45:23.6880344Z patchelf-0.18.0 | h3f2d84a_2 133 KB conda-forge 2025-05-07T19:45:23.6880809Z pcre2-10.44 | hc749103_2 934 KB conda-forge 2025-05-07T19:45:23.6881238Z pixman-0.46.0 | h29eaf8c_0 389 KB conda-forge 2025-05-07T19:45:23.6881728Z pthread-stubs-0.4 | hb9d3cd8_1002 8 KB conda-forge 2025-05-07T19:45:23.6882326Z pyelftools-0.32 | pyh707e725_1 146 KB conda-forge 2025-05-07T19:45:23.6882923Z python-3.13.2 |hf636f53_101_cp313 31.7 MB conda-forge 2025-05-07T19:45:23.6883374Z pyyaml-6.0.2 | py313h8060acc_2 201 KB conda-forge 2025-05-07T19:45:23.6883783Z re2-2024.07.02 | h9925aae_3 26 KB conda-forge 2025-05-07T19:45:23.6884204Z rhash-1.4.5 | hb9d3cd8_0 183 KB conda-forge 2025-05-07T19:45:23.6884626Z scikit-build-0.18.1 | pyhae55e72_2 114 KB conda-forge 2025-05-07T19:45:23.6885093Z singlejar-7.5.0 | h0e684df_1 122 KB conda-forge 2025-05-07T19:45:23.6885549Z sortedcontainers-2.4.0 | pyhd8ed1ab_1 28 KB conda-forge 2025-05-07T19:45:23.6886019Z sqlite-3.49.2 | h9eae976_0 840 KB conda-forge 2025-05-07T19:45:23.6886445Z tk-8.6.13 |noxft_h4845f30_101 3.2 MB conda-forge 2025-05-07T19:45:23.6886838Z tomli-2.2.1 | pyhd8ed1ab_1 19 KB conda-forge 2025-05-07T19:45:23.6887250Z wheel-0.45.1 | pyhd8ed1ab_1 61 KB conda-forge 2025-05-07T19:45:23.6887659Z xorg-libice-1.1.2 | hb9d3cd8_0 57 KB conda-forge 2025-05-07T19:45:23.6888115Z xorg-libsm-1.2.6 | he73a12e_0 27 KB conda-forge 2025-05-07T19:45:23.6888579Z xorg-libx11-1.8.12 | h4f16b4b_0 816 KB conda-forge 2025-05-07T19:45:23.6889013Z xorg-libxau-1.0.12 | hb9d3cd8_0 14 KB conda-forge 2025-05-07T19:45:23.6889477Z xorg-libxdmcp-1.1.5 | hb9d3cd8_0 19 KB conda-forge 2025-05-07T19:45:23.6890187Z xorg-libxext-1.3.6 | hb9d3cd8_0 49 KB conda-forge 2025-05-07T19:45:23.6890732Z xorg-libxfixes-6.0.1 | hb9d3cd8_0 19 KB conda-forge 2025-05-07T19:45:23.6891196Z xorg-libxi-1.8.2 | hb9d3cd8_0 46 KB conda-forge 2025-05-07T19:45:23.6891695Z xorg-libxrandr-1.5.4 | hb9d3cd8_0 29 KB conda-forge 2025-05-07T19:45:23.6892211Z xorg-libxrender-0.9.12 | hb9d3cd8_0 32 KB conda-forge 2025-05-07T19:45:23.6892681Z xorg-libxt-1.3.1 | hb9d3cd8_0 371 KB conda-forge 2025-05-07T19:45:23.6893238Z xorg-libxtst-1.2.5 | hb9d3cd8_3 32 KB conda-forge 2025-05-07T19:45:23.6893666Z xz-5.8.1 | hbcc6ac9_1 23 KB conda-forge 2025-05-07T19:45:23.6894111Z xz-gpl-tools-5.8.1 | hbcc6ac9_1 33 KB conda-forge 2025-05-07T19:45:23.6894555Z xz-tools-5.8.1 | hb9d3cd8_1 94 KB conda-forge 2025-05-07T19:45:23.6894995Z yaml-0.2.5 | h7f98852_2 87 KB conda-forge 2025-05-07T19:45:23.6895422Z zlib-1.3.1 | hb9d3cd8_2 90 KB conda-forge 2025-05-07T19:45:23.6895824Z zstd-1.5.7 | hb8e6e7a_2 554 KB conda-forge 2025-05-07T19:45:23.6896351Z ------------------------------------------------------------ 2025-05-07T19:45:23.6896901Z Total: 339.1 MB 2025-05-07T19:45:23.6897158Z 2025-05-07T19:45:23.6897300Z The following NEW packages will be INSTALLED: 2025-05-07T19:45:23.6897572Z 2025-05-07T19:45:23.6897815Z alsa-lib conda-forge/linux-64::alsa-lib-1.2.14-hb9d3cd8_0 2025-05-07T19:45:23.6898284Z attrs conda-forge/noarch::attrs-25.3.0-pyh71513ae_0 2025-05-07T19:45:23.6898788Z auditwheel conda-forge/noarch::auditwheel-6.2.0-pyha804496_1 2025-05-07T19:45:23.6899262Z bazel conda-forge/linux-64::bazel-7.5.0-h96810dc_2 2025-05-07T19:45:23.6899725Z c-ares conda-forge/linux-64::c-ares-1.34.5-hb9d3cd8_0 2025-05-07T19:45:23.6900195Z cairo conda-forge/linux-64::cairo-1.18.0-hbb29018_2 2025-05-07T19:45:23.6900633Z click conda-forge/noarch::click-8.1.8-pyh707e725_0 2025-05-07T19:45:23.6901087Z cmake conda-forge/linux-64::cmake-4.0.2-h74e3db0_0 2025-05-07T19:45:23.6901527Z distro conda-forge/noarch::distro-1.9.0-pyhd8ed1ab_1 2025-05-07T19:45:23.6902071Z exceptiongroup conda-forge/noarch::exceptiongroup-1.2.2-pyhd8ed1ab_1 2025-05-07T19:45:23.6902698Z font-ttf-dejavu-s~ conda-forge/noarch::font-ttf-dejavu-sans-mono-2.37-hab24e00_0 2025-05-07T19:45:23.6903365Z font-ttf-inconsol~ conda-forge/noarch::font-ttf-inconsolata-3.000-h77eed37_0 2025-05-07T19:45:23.6904031Z font-ttf-source-c~ conda-forge/noarch::font-ttf-source-code-pro-2.038-h77eed37_0 2025-05-07T19:45:23.6904639Z font-ttf-ubuntu conda-forge/noarch::font-ttf-ubuntu-0.83-h77eed37_3 2025-05-07T19:45:23.6905204Z fontconfig conda-forge/linux-64::fontconfig-2.15.0-h7e30c49_1 2025-05-07T19:45:23.6905737Z fonts-conda-ecosy~ conda-forge/noarch::fonts-conda-ecosystem-1-0 2025-05-07T19:45:23.6906288Z fonts-conda-forge conda-forge/noarch::fonts-conda-forge-1-0 2025-05-07T19:45:23.6906817Z freetype conda-forge/linux-64::freetype-2.13.3-ha770c72_1 2025-05-07T19:45:23.6907291Z giflib conda-forge/linux-64::giflib-5.2.2-hd590300_0 2025-05-07T19:45:23.6907795Z graphite2 conda-forge/linux-64::graphite2-1.3.13-h59595ed_1003 2025-05-07T19:45:23.6908296Z harfbuzz conda-forge/linux-64::harfbuzz-9.0.0-hfac3d4d_0 2025-05-07T19:45:23.6908930Z hypothesis conda-forge/noarch::hypothesis-6.131.14-pyha770c72_0 2025-05-07T19:45:23.6909378Z ijar conda-forge/linux-64::ijar-7.5.0-h5888daf_0 2025-05-07T19:45:23.6909766Z jinja2 conda-forge/noarch::jinja2-3.1.6-pyhd8ed1ab_0 2025-05-07T19:45:23.6910294Z keyutils conda-forge/linux-64::keyutils-1.6.1-h166bdaf_0 2025-05-07T19:45:23.6910701Z krb5 conda-forge/linux-64::krb5-1.21.3-h659f571_0 2025-05-07T19:45:23.6911112Z lcms2 conda-forge/linux-64::lcms2-2.17-h717163a_0 2025-05-07T19:45:23.6911514Z lerc conda-forge/linux-64::lerc-4.0.0-h0aef613_1 2025-05-07T19:45:23.6911961Z libabseil conda-forge/linux-64::libabseil-20250127.1-cxx17_hbbce691_0 2025-05-07T19:45:23.6912452Z libcups conda-forge/linux-64::libcups-2.3.3-h4637d8d_4 2025-05-07T19:45:23.6912874Z libcurl conda-forge/linux-64::libcurl-8.13.0-h332b0f4_0 2025-05-07T19:45:23.6913407Z libdeflate conda-forge/linux-64::libdeflate-1.23-h86f0d12_0 2025-05-07T19:45:23.6913898Z libedit conda-forge/linux-64::libedit-3.1.20250104-pl5321h7949ede_0 2025-05-07T19:45:23.6914342Z libev conda-forge/linux-64::libev-4.33-hd590300_2 2025-05-07T19:45:23.6914775Z libexpat conda-forge/linux-64::libexpat-2.7.0-h5888daf_0 2025-05-07T19:45:23.6915230Z libfreetype conda-forge/linux-64::libfreetype-2.13.3-ha770c72_1 2025-05-07T19:45:23.6915737Z libfreetype6 conda-forge/linux-64::libfreetype6-2.13.3-h48d6fc4_1 2025-05-07T19:45:23.6916236Z libgfortran conda-forge/linux-64::libgfortran-15.1.0-h69a702a_2 2025-05-07T19:45:23.6916714Z libgfortran5 conda-forge/linux-64::libgfortran5-15.1.0-hcea5267_2 2025-05-07T19:45:23.6917193Z libglib conda-forge/linux-64::libglib-2.84.0-h2ff4ddf_0 2025-05-07T19:45:23.6917618Z libgrpc conda-forge/linux-64::libgrpc-1.71.0-h8e591d7_1 2025-05-07T19:45:23.6918101Z libjpeg-turbo conda-forge/linux-64::libjpeg-turbo-3.1.0-hb9d3cd8_0 2025-05-07T19:45:23.6918586Z liblzma conda-forge/linux-64::liblzma-5.8.1-hb9d3cd8_1 2025-05-07T19:45:23.6919046Z liblzma-devel conda-forge/linux-64::liblzma-devel-5.8.1-hb9d3cd8_1 2025-05-07T19:45:23.6919550Z libnghttp2 conda-forge/linux-64::libnghttp2-1.64.0-h161d5f1_0 2025-05-07T19:45:23.6920052Z libopenblas conda-forge/linux-64::libopenblas-0.3.29-pthreads_h94d23a6_0 2025-05-07T19:45:23.6920556Z libpng conda-forge/linux-64::libpng-1.6.47-h943b412_0 2025-05-07T19:45:23.6921040Z libprotobuf conda-forge/linux-64::libprotobuf-5.29.3-h501fc15_1 2025-05-07T19:45:23.6921526Z libre2-11 conda-forge/linux-64::libre2-11-2024.07.02-hba17884_3 2025-05-07T19:45:23.6922002Z libsqlite conda-forge/linux-64::libsqlite-3.49.2-hee588c1_0 2025-05-07T19:45:23.6922439Z libssh2 conda-forge/linux-64::libssh2-1.11.1-hcf80075_0 2025-05-07T19:45:23.6922887Z libtiff conda-forge/linux-64::libtiff-4.7.0-hd9ff511_4 2025-05-07T19:45:23.6923482Z libuv conda-forge/linux-64::libuv-1.50.0-hb9d3cd8_0 2025-05-07T19:45:23.6924011Z libwebp-base conda-forge/linux-64::libwebp-base-1.5.0-h851e524_0 2025-05-07T19:45:23.6924489Z libxcb conda-forge/linux-64::libxcb-1.17.0-h8a09558_0 2025-05-07T19:45:23.6924898Z make conda-forge/linux-64::make-4.4.1-hb9d3cd8_2 2025-05-07T19:45:23.6925372Z markupsafe conda-forge/linux-64::markupsafe-3.0.2-py313h8060acc_1 2025-05-07T19:45:23.6925833Z ninja conda-forge/linux-64::ninja-1.12.1-hff21bea_1 2025-05-07T19:45:23.6926314Z openblas conda-forge/linux-64::openblas-0.3.29-pthreads_h6ec200e_0 2025-05-07T19:45:23.6926811Z openjdk conda-forge/linux-64::openjdk-23.0.1-h4c11d01_0 2025-05-07T19:45:23.6927264Z packaging conda-forge/noarch::packaging-25.0-pyh29332c3_1 2025-05-07T19:45:23.6927751Z patchelf conda-forge/linux-64::patchelf-0.18.0-h3f2d84a_2 2025-05-07T19:45:23.6928185Z pcre2 conda-forge/linux-64::pcre2-10.44-hc749103_2 2025-05-07T19:45:23.6928618Z pixman conda-forge/linux-64::pixman-0.46.0-h29eaf8c_0 2025-05-07T19:45:23.6929192Z pthread-stubs conda-forge/linux-64::pthread-stubs-0.4-hb9d3cd8_1002 2025-05-07T19:45:23.6929696Z pyelftools conda-forge/noarch::pyelftools-0.32-pyh707e725_1 2025-05-07T19:45:23.6930175Z pyyaml conda-forge/linux-64::pyyaml-6.0.2-py313h8060acc_2 2025-05-07T19:45:23.6930601Z re2 conda-forge/linux-64::re2-2024.07.02-h9925aae_3 2025-05-07T19:45:23.6931020Z rhash conda-forge/linux-64::rhash-1.4.5-hb9d3cd8_0 2025-05-07T19:45:23.6931514Z scikit-build conda-forge/noarch::scikit-build-0.18.1-pyhae55e72_2 2025-05-07T19:45:23.6932015Z singlejar conda-forge/linux-64::singlejar-7.5.0-h0e684df_1 2025-05-07T19:45:23.6932826Z sortedcontainers conda-forge/noarch::sortedcontainers-2.4.0-pyhd8ed1ab_1 2025-05-07T19:45:23.6933432Z tomli conda-forge/noarch::tomli-2.2.1-pyhd8ed1ab_1 2025-05-07T19:45:23.6933938Z xorg-libice conda-forge/linux-64::xorg-libice-1.1.2-hb9d3cd8_0 2025-05-07T19:45:23.6934477Z xorg-libsm conda-forge/linux-64::xorg-libsm-1.2.6-he73a12e_0 2025-05-07T19:45:23.6934983Z xorg-libx11 conda-forge/linux-64::xorg-libx11-1.8.12-h4f16b4b_0 2025-05-07T19:45:23.6935523Z xorg-libxau conda-forge/linux-64::xorg-libxau-1.0.12-hb9d3cd8_0 2025-05-07T19:45:23.6936145Z xorg-libxdmcp conda-forge/linux-64::xorg-libxdmcp-1.1.5-hb9d3cd8_0 2025-05-07T19:45:23.6936737Z xorg-libxext conda-forge/linux-64::xorg-libxext-1.3.6-hb9d3cd8_0 2025-05-07T19:45:23.6937334Z xorg-libxfixes conda-forge/linux-64::xorg-libxfixes-6.0.1-hb9d3cd8_0 2025-05-07T19:45:23.6937876Z xorg-libxi conda-forge/linux-64::xorg-libxi-1.8.2-hb9d3cd8_0 2025-05-07T19:45:23.6938443Z xorg-libxrandr conda-forge/linux-64::xorg-libxrandr-1.5.4-hb9d3cd8_0 2025-05-07T19:45:23.6939024Z xorg-libxrender conda-forge/linux-64::xorg-libxrender-0.9.12-hb9d3cd8_0 2025-05-07T19:45:23.6939606Z xorg-libxt conda-forge/linux-64::xorg-libxt-1.3.1-hb9d3cd8_0 2025-05-07T19:45:23.6940155Z xorg-libxtst conda-forge/linux-64::xorg-libxtst-1.2.5-hb9d3cd8_3 2025-05-07T19:45:23.6940680Z xz-gpl-tools conda-forge/linux-64::xz-gpl-tools-5.8.1-hbcc6ac9_1 2025-05-07T19:45:23.6941199Z xz-tools conda-forge/linux-64::xz-tools-5.8.1-hb9d3cd8_1 2025-05-07T19:45:23.6941640Z yaml conda-forge/linux-64::yaml-0.2.5-h7f98852_2 2025-05-07T19:45:23.6941929Z 2025-05-07T19:45:23.6942058Z The following packages will be UPDATED: 2025-05-07T19:45:23.6942281Z 2025-05-07T19:45:23.6942595Z libuuid pkgs/main::libuuid-1.41.5-h5eee18b_0 --> conda-forge::libuuid-2.38.1-h0b41bf4_0 2025-05-07T19:45:23.6943149Z libzlib 1.2.13-h4ab18f5_6 --> 1.3.1-hb9d3cd8_2 2025-05-07T19:45:23.6943723Z ncurses pkgs/main::ncurses-6.4-h6a678d5_0 --> conda-forge::ncurses-6.5-h2d0b736_3 2025-05-07T19:45:23.6944401Z python pkgs/main::python-3.13.2-hf623796_100~ --> conda-forge::python-3.13.2-hf636f53_101_cp313 2025-05-07T19:45:23.6945108Z sqlite pkgs/main::sqlite-3.45.3-h5eee18b_0 --> conda-forge::sqlite-3.49.2-h9eae976_0 2025-05-07T19:45:23.6945810Z wheel pkgs/main/linux-64::wheel-0.45.1-py31~ --> conda-forge/noarch::wheel-0.45.1-pyhd8ed1ab_1 2025-05-07T19:45:23.6946438Z xz pkgs/main::xz-5.6.4-h5eee18b_1 --> conda-forge::xz-5.8.1-hbcc6ac9_1 2025-05-07T19:45:23.6946938Z zlib 1.2.13-h4ab18f5_6 --> 1.3.1-hb9d3cd8_2 2025-05-07T19:45:23.6947347Z zstd 1.5.6-ha6fb4c9_0 --> 1.5.7-hb8e6e7a_2 2025-05-07T19:45:23.6947631Z 2025-05-07T19:45:23.6947861Z The following packages will be SUPERSEDED by a higher-priority channel: 2025-05-07T19:45:23.6948200Z 2025-05-07T19:45:23.6948489Z expat pkgs/main::expat-2.7.1-h6a678d5_0 --> conda-forge::expat-2.7.0-h5888daf_0 2025-05-07T19:45:23.6949092Z tk pkgs/main::tk-8.6.14-h39e8969_0 --> conda-forge::tk-8.6.13-noxft_h4845f30_101 2025-05-07T19:45:23.6949458Z 2025-05-07T19:45:23.6949593Z 2025-05-07T19:45:23.6949598Z 2025-05-07T19:45:23.6949758Z Downloading and Extracting Packages: ...working... 2025-05-07T19:45:23.6950191Z openjdk-23.0.1 | 181.3 MB | | 0% 2025-05-07T19:45:23.6950439Z 2025-05-07T19:45:23.6950750Z bazel-7.5.0 | 47.4 MB | | 0%  2025-05-07T19:45:23.6951019Z 2025-05-07T19:45:23.6951023Z 2025-05-07T19:45:23.6951249Z python-3.13.2 | 31.7 MB | | 0%  2025-05-07T19:45:23.6951507Z 2025-05-07T19:45:23.6951511Z 2025-05-07T19:45:23.6953168Z 2025-05-07T19:45:23.6972015Z cmake-4.0.2 | 19.4 MB | | 0%  2025-05-07T19:45:23.6973251Z 2025-05-07T19:45:23.6973283Z 2025-05-07T19:45:23.6973294Z 2025-05-07T19:45:23.6973305Z 2025-05-07T19:45:23.6979041Z libgrpc-1.71.0 | 7.6 MB | | 0%  2025-05-07T19:45:23.6979862Z 2025-05-07T19:45:23.6979873Z 2025-05-07T19:45:23.6979883Z 2025-05-07T19:45:23.6979894Z 2025-05-07T19:45:23.6979924Z 2025-05-07T19:45:23.6980425Z openblas-0.3.29 | 5.8 MB | | 0%  2025-05-07T19:45:23.6980730Z 2025-05-07T19:45:23.6980734Z 2025-05-07T19:45:23.6980737Z 2025-05-07T19:45:23.6980741Z 2025-05-07T19:45:23.6980744Z 2025-05-07T19:45:23.6980747Z 2025-05-07T19:45:23.6981024Z libopenblas-0.3.29 | 5.6 MB | | 0%  2025-05-07T19:45:23.6981344Z 2025-05-07T19:45:23.6981348Z 2025-05-07T19:45:23.6981352Z 2025-05-07T19:45:23.6981355Z 2025-05-07T19:45:23.6981359Z 2025-05-07T19:45:23.6981362Z 2025-05-07T19:45:23.6981367Z 2025-05-07T19:45:23.6981708Z libcups-2.3.3 | 4.3 MB | | 0%  2025-05-07T19:45:23.6982001Z 2025-05-07T19:45:23.6982029Z 2025-05-07T19:45:23.6982033Z 2025-05-07T19:45:23.6982036Z 2025-05-07T19:45:23.6982049Z 2025-05-07T19:45:23.6982053Z 2025-05-07T19:45:23.6982056Z 2025-05-07T19:45:23.6982060Z 2025-05-07T19:45:23.6982672Z libglib-2.84.0 | 3.8 MB | | 0%  2025-05-07T19:45:23.6982956Z 2025-05-07T19:45:23.6982983Z 2025-05-07T19:45:23.6982996Z 2025-05-07T19:45:23.6982999Z 2025-05-07T19:45:23.6983003Z 2025-05-07T19:45:23.6983006Z 2025-05-07T19:45:23.6983009Z 2025-05-07T19:45:23.6983013Z 2025-05-07T19:45:23.6983016Z 2025-05-07T19:45:23.6983771Z libprotobuf-5.29.3 | 3.2 MB | | 0%  2025-05-07T19:45:23.6984096Z 2025-05-07T19:45:23.6984110Z 2025-05-07T19:45:23.6984114Z 2025-05-07T19:45:23.6984117Z 2025-05-07T19:45:23.6984121Z 2025-05-07T19:45:23.6984124Z 2025-05-07T19:45:23.6984128Z 2025-05-07T19:45:23.6984136Z 2025-05-07T19:45:23.6984139Z 2025-05-07T19:45:23.6984142Z 2025-05-07T19:45:23.6984831Z tk-8.6.13 | 3.2 MB | | 0%  2025-05-07T19:45:23.6985140Z 2025-05-07T19:45:23.6985155Z 2025-05-07T19:45:23.6985158Z 2025-05-07T19:45:23.6985161Z 2025-05-07T19:45:23.6985165Z 2025-05-07T19:45:23.6985168Z 2025-05-07T19:45:23.6985171Z 2025-05-07T19:45:23.6985180Z 2025-05-07T19:45:23.6985184Z 2025-05-07T19:45:23.6985187Z 2025-05-07T19:45:23.6985191Z 2025-05-07T19:45:23.6988744Z font-ttf-ubuntu-0.83 | 1.5 MB | | 0%  2025-05-07T19:45:23.6989835Z 2025-05-07T19:45:23.6989855Z 2025-05-07T19:45:23.6989870Z 2025-05-07T19:45:23.6989888Z 2025-05-07T19:45:23.6989905Z 2025-05-07T19:45:23.6989915Z 2025-05-07T19:45:23.6989926Z 2025-05-07T19:45:23.6989936Z 2025-05-07T19:45:23.6989946Z 2025-05-07T19:45:23.6989957Z 2025-05-07T19:45:23.6989967Z 2025-05-07T19:45:23.6989977Z 2025-05-07T19:45:23.6990794Z harfbuzz-9.0.0 | 1.5 MB | | 0%  2025-05-07T19:45:23.6991759Z 2025-05-07T19:45:23.6991770Z 2025-05-07T19:45:23.6991781Z 2025-05-07T19:45:23.6991792Z 2025-05-07T19:45:23.6991802Z 2025-05-07T19:45:23.6991812Z 2025-05-07T19:45:23.6991822Z 2025-05-07T19:45:23.6991832Z 2025-05-07T19:45:23.6991842Z 2025-05-07T19:45:23.6991853Z 2025-05-07T19:45:23.6992146Z 2025-05-07T19:45:23.6992159Z 2025-05-07T19:45:23.6992200Z 2025-05-07T19:45:23.6993045Z libgfortran5-15.1.0 | 1.5 MB | | 0%  2025-05-07T19:45:23.6993946Z 2025-05-07T19:45:23.6993956Z 2025-05-07T19:45:23.6993967Z 2025-05-07T19:45:23.6993977Z 2025-05-07T19:45:23.6993988Z 2025-05-07T19:45:23.6993997Z 2025-05-07T19:45:23.6994008Z 2025-05-07T19:45:23.6994017Z 2025-05-07T19:45:23.6994055Z 2025-05-07T19:45:23.6994066Z 2025-05-07T19:45:23.6994076Z 2025-05-07T19:45:23.6994086Z 2025-05-07T19:45:23.6994096Z 2025-05-07T19:45:23.6994106Z 2025-05-07T19:45:23.6995044Z krb5-1.21.3 | 1.3 MB | | 0%  2025-05-07T19:45:23.6996029Z 2025-05-07T19:45:23.6996040Z 2025-05-07T19:45:23.6996050Z 2025-05-07T19:45:23.6996060Z 2025-05-07T19:45:23.6996070Z 2025-05-07T19:45:23.6996080Z 2025-05-07T19:45:23.6996090Z 2025-05-07T19:45:23.6996100Z 2025-05-07T19:45:23.6996110Z 2025-05-07T19:45:23.6996134Z 2025-05-07T19:45:23.6996145Z 2025-05-07T19:45:23.6996155Z 2025-05-07T19:45:23.6996165Z 2025-05-07T19:45:23.6996175Z 2025-05-07T19:45:23.6996185Z 2025-05-07T19:45:23.6997106Z libabseil-20250127.1 | 1.3 MB | | 0%  2025-05-07T19:45:23.6997438Z 2025-05-07T19:45:23.6997441Z 2025-05-07T19:45:23.6997445Z 2025-05-07T19:45:23.6997448Z 2025-05-07T19:45:23.6997452Z 2025-05-07T19:45:23.6997455Z 2025-05-07T19:45:23.6997459Z 2025-05-07T19:45:23.6997462Z 2025-05-07T19:45:23.6997466Z 2025-05-07T19:45:23.6997469Z 2025-05-07T19:45:23.6997477Z 2025-05-07T19:45:23.6997480Z 2025-05-07T19:45:23.6997484Z 2025-05-07T19:45:23.6997487Z 2025-05-07T19:45:23.6997491Z 2025-05-07T19:45:23.6997494Z 2025-05-07T19:45:23.6997806Z cairo-1.18.0 | 961 KB | | 0%  2025-05-07T19:45:23.6998107Z 2025-05-07T19:45:23.6998111Z 2025-05-07T19:45:23.6998114Z 2025-05-07T19:45:23.6998122Z 2025-05-07T19:45:23.6998125Z 2025-05-07T19:45:23.6998128Z 2025-05-07T19:45:23.6998132Z 2025-05-07T19:45:23.6998135Z 2025-05-07T19:45:23.6998138Z 2025-05-07T19:45:23.6998169Z 2025-05-07T19:45:23.6998172Z 2025-05-07T19:45:23.6998175Z 2025-05-07T19:45:23.6998179Z 2025-05-07T19:45:23.6998182Z 2025-05-07T19:45:23.6998185Z 2025-05-07T19:45:23.6998189Z 2025-05-07T19:45:23.6998192Z 2025-05-07T19:45:23.6998484Z pcre2-10.44 | 934 KB | | 0%  2025-05-07T19:45:23.6998789Z 2025-05-07T19:45:23.6998792Z 2025-05-07T19:45:23.6998820Z 2025-05-07T19:45:23.6998827Z 2025-05-07T19:45:23.6998830Z 2025-05-07T19:45:23.6998834Z 2025-05-07T19:45:23.6998838Z 2025-05-07T19:45:23.6998841Z 2025-05-07T19:45:23.6998844Z 2025-05-07T19:45:23.6998848Z 2025-05-07T19:45:23.6998851Z 2025-05-07T19:45:23.6998855Z 2025-05-07T19:45:23.6998859Z 2025-05-07T19:45:23.6998862Z 2025-05-07T19:45:23.6998865Z 2025-05-07T19:45:23.6998869Z 2025-05-07T19:45:23.6998876Z 2025-05-07T19:45:23.6998880Z 2025-05-07T19:45:23.6999226Z libsqlite-3.49.2 | 895 KB | | 0%  2025-05-07T19:45:23.6999556Z 2025-05-07T19:45:23.6999559Z 2025-05-07T19:45:23.6999563Z 2025-05-07T19:45:23.6999567Z 2025-05-07T19:45:23.6999571Z 2025-05-07T19:45:23.6999574Z 2025-05-07T19:45:23.6999577Z 2025-05-07T19:45:23.6999580Z 2025-05-07T19:45:23.6999584Z 2025-05-07T19:45:23.6999587Z 2025-05-07T19:45:23.6999590Z 2025-05-07T19:45:23.6999594Z 2025-05-07T19:45:23.6999597Z 2025-05-07T19:45:23.6999600Z 2025-05-07T19:45:23.6999608Z 2025-05-07T19:45:23.6999627Z 2025-05-07T19:45:23.6999631Z 2025-05-07T19:45:23.6999634Z 2025-05-07T19:45:23.6999638Z 2025-05-07T19:45:23.8035435Z ... (more hidden) ... 2025-05-07T19:45:23.8036324Z 2025-05-07T19:45:23.8036357Z 2025-05-07T19:45:23.8036368Z 2025-05-07T19:45:23.8036402Z 2025-05-07T19:45:24.0026409Z libgrpc-1.71.0 | 7.6 MB | 2 | 2%  2025-05-07T19:45:24.0027257Z 2025-05-07T19:45:24.0027262Z 2025-05-07T19:45:24.0027266Z 2025-05-07T19:45:24.0027270Z 2025-05-07T19:45:24.0586475Z libgrpc-1.71.0 | 7.6 MB | 4 | 5%  2025-05-07T19:45:24.0586789Z 2025-05-07T19:45:24.0586793Z 2025-05-07T19:45:24.0586797Z 2025-05-07T19:45:24.0587523Z cmake-4.0.2 | 19.4 MB | | 0%  2025-05-07T19:45:24.0848351Z openjdk-23.0.1 | 181.3 MB | | 0% 2025-05-07T19:45:24.0848738Z 2025-05-07T19:45:24.0848925Z 2025-05-07T19:45:24.0922816Z python-3.13.2 | 31.7 MB | | 0%  2025-05-07T19:45:24.0924067Z 2025-05-07T19:45:24.1028140Z bazel-7.5.0 | 47.4 MB | | 0%  2025-05-07T19:45:24.1028415Z 2025-05-07T19:45:24.1028420Z 2025-05-07T19:45:24.1028424Z 2025-05-07T19:45:24.1028427Z 2025-05-07T19:45:24.1585108Z libgrpc-1.71.0 | 7.6 MB | #########2 | 93%  2025-05-07T19:45:24.1585417Z 2025-05-07T19:45:24.1585421Z 2025-05-07T19:45:24.1585425Z 2025-05-07T19:45:24.1588513Z cmake-4.0.2 | 19.4 MB | #####1 | 51%  2025-05-07T19:45:24.1848823Z openjdk-23.0.1 | 181.3 MB | 4 | 4% 2025-05-07T19:45:24.1849605Z 2025-05-07T19:45:24.1849610Z 2025-05-07T19:45:24.1925257Z python-3.13.2 | 31.7 MB | ##3 | 24%  2025-05-07T19:45:24.1926036Z 2025-05-07T19:45:24.2035476Z bazel-7.5.0 | 47.4 MB | #1 | 11%  2025-05-07T19:45:24.2035864Z 2025-05-07T19:45:24.2035876Z 2025-05-07T19:45:24.2035908Z 2025-05-07T19:45:24.2035934Z 2025-05-07T19:45:24.2591245Z libgrpc-1.71.0 | 7.6 MB | ########## | 100%  2025-05-07T19:45:24.2592082Z 2025-05-07T19:45:24.2592096Z 2025-05-07T19:45:24.2592108Z 2025-05-07T19:45:24.2592748Z cmake-4.0.2 | 19.4 MB | ######### | 90%  2025-05-07T19:45:24.2665440Z openjdk-23.0.1 | 181.3 MB | 8 | 9% 2025-05-07T19:45:24.2666223Z 2025-05-07T19:45:24.2666261Z 2025-05-07T19:45:24.2666274Z 2025-05-07T19:45:24.2666285Z 2025-05-07T19:45:24.2666307Z 2025-05-07T19:45:24.2849847Z openblas-0.3.29 | 5.8 MB | | 0%  2025-05-07T19:45:24.2850154Z 2025-05-07T19:45:24.2850199Z 2025-05-07T19:45:24.2924171Z python-3.13.2 | 31.7 MB | ####4 | 45%  2025-05-07T19:45:24.2924456Z 2025-05-07T19:45:24.3672017Z bazel-7.5.0 | 47.4 MB | ##4 | 24%  2025-05-07T19:45:24.3672291Z 2025-05-07T19:45:24.3672461Z 2025-05-07T19:45:24.3672469Z 2025-05-07T19:45:24.3672474Z 2025-05-07T19:45:24.3672498Z 2025-05-07T19:45:24.3859272Z openblas-0.3.29 | 5.8 MB | ######8 | 69%  2025-05-07T19:45:24.3969866Z openjdk-23.0.1 | 181.3 MB | #2 | 13% 2025-05-07T19:45:24.3970373Z 2025-05-07T19:45:24.4017965Z bazel-7.5.0 | 47.4 MB | ###3 | 34%  2025-05-07T19:45:24.4018246Z 2025-05-07T19:45:24.4018251Z 2025-05-07T19:45:24.4865102Z python-3.13.2 | 31.7 MB | ######2 | 62%  2025-05-07T19:45:24.4969828Z openjdk-23.0.1 | 181.3 MB | #6 | 16% 2025-05-07T19:45:24.4970345Z 2025-05-07T19:45:24.5017381Z bazel-7.5.0 | 47.4 MB | ####6 | 46%  2025-05-07T19:45:24.5017700Z 2025-05-07T19:45:24.5017708Z 2025-05-07T19:45:24.5062487Z python-3.13.2 | 31.7 MB | ########4 | 84%  2025-05-07T19:45:24.5062758Z 2025-05-07T19:45:24.5062763Z 2025-05-07T19:45:24.5062767Z 2025-05-07T19:45:24.5062771Z 2025-05-07T19:45:24.5062780Z 2025-05-07T19:45:24.5583663Z openblas-0.3.29 | 5.8 MB | ########## | 100%  2025-05-07T19:45:24.5583985Z 2025-05-07T19:45:24.5583990Z 2025-05-07T19:45:24.5583994Z 2025-05-07T19:45:24.5583997Z 2025-05-07T19:45:24.5584001Z 2025-05-07T19:45:24.5584004Z 2025-05-07T19:45:24.5862834Z libopenblas-0.3.29 | 5.6 MB | | 0%  2025-05-07T19:45:24.6126545Z openjdk-23.0.1 | 181.3 MB | ## | 21% 2025-05-07T19:45:24.6127153Z 2025-05-07T19:45:24.6127167Z 2025-05-07T19:45:24.6127173Z 2025-05-07T19:45:24.6494236Z cmake-4.0.2 | 19.4 MB | ########## | 100%  2025-05-07T19:45:24.6495300Z 2025-05-07T19:45:24.6495314Z 2025-05-07T19:45:24.6495344Z 2025-05-07T19:45:24.6495355Z 2025-05-07T19:45:24.6495366Z 2025-05-07T19:45:24.6495376Z 2025-05-07T19:45:24.6495387Z 2025-05-07T19:45:24.6567851Z libcups-2.3.3 | 4.3 MB | | 0%  2025-05-07T19:45:24.6568157Z 2025-05-07T19:45:24.6584312Z bazel-7.5.0 | 47.4 MB | #####6 | 57%  2025-05-07T19:45:24.6584750Z 2025-05-07T19:45:24.6584754Z 2025-05-07T19:45:24.6584758Z 2025-05-07T19:45:24.6584761Z 2025-05-07T19:45:24.6584765Z 2025-05-07T19:45:24.6584773Z 2025-05-07T19:45:24.6934258Z libopenblas-0.3.29 | 5.6 MB | ########8 | 88%  2025-05-07T19:45:24.7567245Z openjdk-23.0.1 | 181.3 MB | ##4 | 25% 2025-05-07T19:45:24.7567610Z 2025-05-07T19:45:24.7951106Z bazel-7.5.0 | 47.4 MB | ######7 | 67%  2025-05-07T19:45:24.7954550Z openjdk-23.0.1 | 181.3 MB | ##8 | 28% 2025-05-07T19:45:24.7954791Z 2025-05-07T19:45:24.7954795Z 2025-05-07T19:45:24.7954799Z 2025-05-07T19:45:24.7954814Z 2025-05-07T19:45:24.7954818Z 2025-05-07T19:45:24.7954888Z 2025-05-07T19:45:24.8139102Z libopenblas-0.3.29 | 5.6 MB | ########## | 100%  2025-05-07T19:45:24.8139431Z 2025-05-07T19:45:24.8139436Z 2025-05-07T19:45:24.8139439Z 2025-05-07T19:45:24.8139455Z 2025-05-07T19:45:24.8139459Z 2025-05-07T19:45:24.8139462Z 2025-05-07T19:45:24.8139481Z 2025-05-07T19:45:24.8141270Z libcups-2.3.3 | 4.3 MB | ########## | 100%  2025-05-07T19:45:24.8141545Z 2025-05-07T19:45:24.8141557Z 2025-05-07T19:45:24.8141561Z 2025-05-07T19:45:24.8141564Z 2025-05-07T19:45:24.8141580Z 2025-05-07T19:45:24.8141583Z 2025-05-07T19:45:24.8142236Z 2025-05-07T19:45:24.8432384Z libcups-2.3.3 | 4.3 MB | ########## | 100%  2025-05-07T19:45:24.8432692Z 2025-05-07T19:45:24.8432696Z 2025-05-07T19:45:24.8432700Z 2025-05-07T19:45:24.8432703Z 2025-05-07T19:45:24.8432722Z 2025-05-07T19:45:24.8432725Z 2025-05-07T19:45:24.8432729Z 2025-05-07T19:45:24.8432732Z 2025-05-07T19:45:24.8570533Z libglib-2.84.0 | 3.8 MB | | 0%  2025-05-07T19:45:24.8570833Z 2025-05-07T19:45:24.8938663Z bazel-7.5.0 | 47.4 MB | #######7 | 77%  2025-05-07T19:45:24.8938942Z 2025-05-07T19:45:24.8938947Z 2025-05-07T19:45:24.8938950Z 2025-05-07T19:45:24.8938954Z 2025-05-07T19:45:24.8938973Z 2025-05-07T19:45:24.8938977Z 2025-05-07T19:45:24.8938996Z 2025-05-07T19:45:24.8938999Z 2025-05-07T19:45:24.8939003Z 2025-05-07T19:45:24.9177638Z libprotobuf-5.29.3 | 3.2 MB | | 0%  2025-05-07T19:45:24.9571923Z openjdk-23.0.1 | 181.3 MB | ###2 | 32% 2025-05-07T19:45:24.9572236Z 2025-05-07T19:45:24.9983618Z bazel-7.5.0 | 47.4 MB | ########8 | 88%  2025-05-07T19:45:24.9983901Z 2025-05-07T19:45:24.9983906Z 2025-05-07T19:45:24.9983909Z 2025-05-07T19:45:24.9983913Z 2025-05-07T19:45:24.9983916Z 2025-05-07T19:45:24.9983920Z 2025-05-07T19:45:24.9983923Z 2025-05-07T19:45:24.9983941Z 2025-05-07T19:45:24.9984213Z libglib-2.84.0 | 3.8 MB | ########## | 100%  2025-05-07T19:45:24.9984493Z 2025-05-07T19:45:24.9984496Z 2025-05-07T19:45:24.9984500Z 2025-05-07T19:45:24.9984503Z 2025-05-07T19:45:24.9984507Z 2025-05-07T19:45:24.9984510Z 2025-05-07T19:45:24.9984513Z 2025-05-07T19:45:24.9984526Z 2025-05-07T19:45:25.0163757Z libglib-2.84.0 | 3.8 MB | ########## | 100%  2025-05-07T19:45:25.0165401Z 2025-05-07T19:45:25.0165425Z 2025-05-07T19:45:25.0165446Z 2025-05-07T19:45:25.0165469Z 2025-05-07T19:45:25.0165486Z 2025-05-07T19:45:25.0165507Z 2025-05-07T19:45:25.0165530Z 2025-05-07T19:45:25.0165553Z 2025-05-07T19:45:25.0165597Z 2025-05-07T19:45:25.0167208Z libprotobuf-5.29.3 | 3.2 MB | ########## | 100%  2025-05-07T19:45:25.0167510Z 2025-05-07T19:45:25.0167514Z 2025-05-07T19:45:25.0167517Z 2025-05-07T19:45:25.0167521Z 2025-05-07T19:45:25.0167524Z 2025-05-07T19:45:25.0167527Z 2025-05-07T19:45:25.0167531Z 2025-05-07T19:45:25.0167535Z 2025-05-07T19:45:25.0167539Z 2025-05-07T19:45:25.0177862Z libprotobuf-5.29.3 | 3.2 MB | ########## | 100%  2025-05-07T19:45:25.0405870Z openjdk-23.0.1 | 181.3 MB | ###5 | 36% 2025-05-07T19:45:25.0406393Z 2025-05-07T19:45:25.0406400Z 2025-05-07T19:45:25.0406688Z 2025-05-07T19:45:25.0406692Z 2025-05-07T19:45:25.0406696Z 2025-05-07T19:45:25.0406699Z 2025-05-07T19:45:25.0406703Z 2025-05-07T19:45:25.0406706Z 2025-05-07T19:45:25.0406710Z 2025-05-07T19:45:25.0406713Z 2025-05-07T19:45:25.0518794Z tk-8.6.13 | 3.2 MB | | 0%  2025-05-07T19:45:25.0519082Z 2025-05-07T19:45:25.0519344Z 2025-05-07T19:45:25.0519353Z 2025-05-07T19:45:25.0519358Z 2025-05-07T19:45:25.0519392Z 2025-05-07T19:45:25.0519397Z 2025-05-07T19:45:25.0519402Z 2025-05-07T19:45:25.0519406Z 2025-05-07T19:45:25.0519411Z 2025-05-07T19:45:25.0519415Z 2025-05-07T19:45:25.1014683Z 2025-05-07T19:45:25.1015224Z font-ttf-ubuntu-0.83 | 1.5 MB | 1 | 1%  2025-05-07T19:45:25.1015578Z 2025-05-07T19:45:25.1015582Z 2025-05-07T19:45:25.1015586Z 2025-05-07T19:45:25.1015590Z 2025-05-07T19:45:25.1015593Z 2025-05-07T19:45:25.1015597Z 2025-05-07T19:45:25.1015600Z 2025-05-07T19:45:25.1015618Z 2025-05-07T19:45:25.1015622Z 2025-05-07T19:45:25.1015625Z 2025-05-07T19:45:25.1015629Z 2025-05-07T19:45:25.1180306Z font-ttf-ubuntu-0.83 | 1.5 MB | ########## | 100%  2025-05-07T19:45:25.1340825Z openjdk-23.0.1 | 181.3 MB | ###9 | 40% 2025-05-07T19:45:25.1341116Z 2025-05-07T19:45:25.1341120Z 2025-05-07T19:45:25.1341124Z 2025-05-07T19:45:25.1341413Z 2025-05-07T19:45:25.1374423Z libgrpc-1.71.0 | 7.6 MB | ########## | 100%  2025-05-07T19:45:25.1374722Z 2025-05-07T19:45:25.1374727Z 2025-05-07T19:45:25.1374731Z 2025-05-07T19:45:25.1374734Z 2025-05-07T19:45:25.1374738Z 2025-05-07T19:45:25.1374741Z 2025-05-07T19:45:25.1374745Z 2025-05-07T19:45:25.1374749Z 2025-05-07T19:45:25.1374752Z 2025-05-07T19:45:25.1375202Z 2025-05-07T19:45:25.1410498Z tk-8.6.13 | 3.2 MB | ########## | 100%  2025-05-07T19:45:25.1411024Z 2025-05-07T19:45:25.1411071Z 2025-05-07T19:45:25.1419943Z python-3.13.2 | 31.7 MB | ########## | 100%  2025-05-07T19:45:25.1420248Z 2025-05-07T19:45:25.1420252Z 2025-05-07T19:45:25.1420256Z 2025-05-07T19:45:25.1420259Z 2025-05-07T19:45:25.1420263Z 2025-05-07T19:45:25.1420266Z 2025-05-07T19:45:25.1420270Z 2025-05-07T19:45:25.1420274Z 2025-05-07T19:45:25.1420277Z 2025-05-07T19:45:25.1420281Z 2025-05-07T19:45:25.1420284Z 2025-05-07T19:45:25.1430540Z 2025-05-07T19:45:25.1697828Z harfbuzz-9.0.0 | 1.5 MB | 1 | 1%  2025-05-07T19:45:25.1698401Z 2025-05-07T19:45:25.1698406Z 2025-05-07T19:45:25.1698410Z 2025-05-07T19:45:25.1698413Z 2025-05-07T19:45:25.1698416Z 2025-05-07T19:45:25.1698420Z 2025-05-07T19:45:25.1698423Z 2025-05-07T19:45:25.1698427Z 2025-05-07T19:45:25.1698431Z 2025-05-07T19:45:25.1698434Z 2025-05-07T19:45:25.1698437Z 2025-05-07T19:45:25.1698441Z 2025-05-07T19:45:25.1938745Z harfbuzz-9.0.0 | 1.5 MB | ########## | 100%  2025-05-07T19:45:25.1939157Z 2025-05-07T19:45:25.1939162Z 2025-05-07T19:45:25.1939165Z 2025-05-07T19:45:25.1939169Z 2025-05-07T19:45:25.1939172Z 2025-05-07T19:45:25.1939176Z 2025-05-07T19:45:25.1939179Z 2025-05-07T19:45:25.1939183Z 2025-05-07T19:45:25.1939186Z 2025-05-07T19:45:25.1939189Z 2025-05-07T19:45:25.1939205Z 2025-05-07T19:45:25.1939209Z 2025-05-07T19:45:25.1939212Z 2025-05-07T19:45:25.2126302Z libgfortran5-15.1.0 | 1.5 MB | 1 | 1%  2025-05-07T19:45:25.2126672Z 2025-05-07T19:45:25.2126676Z 2025-05-07T19:45:25.2126680Z 2025-05-07T19:45:25.2126683Z 2025-05-07T19:45:25.2126687Z 2025-05-07T19:45:25.2126703Z 2025-05-07T19:45:25.2126707Z 2025-05-07T19:45:25.2126710Z 2025-05-07T19:45:25.2126714Z 2025-05-07T19:45:25.2126717Z 2025-05-07T19:45:25.2126720Z 2025-05-07T19:45:25.2126724Z 2025-05-07T19:45:25.2126727Z 2025-05-07T19:45:25.2126730Z 2025-05-07T19:45:25.2126734Z 2025-05-07T19:45:25.2206602Z libabseil-20250127.1 | 1.3 MB | 1 | 1%  2025-05-07T19:45:25.2207421Z 2025-05-07T19:45:25.2207430Z 2025-05-07T19:45:25.2207437Z 2025-05-07T19:45:25.2207445Z 2025-05-07T19:45:25.2207451Z 2025-05-07T19:45:25.2207458Z 2025-05-07T19:45:25.2207466Z 2025-05-07T19:45:25.2207472Z 2025-05-07T19:45:25.2207478Z 2025-05-07T19:45:25.2207486Z 2025-05-07T19:45:25.2207493Z 2025-05-07T19:45:25.2207508Z 2025-05-07T19:45:25.2207515Z 2025-05-07T19:45:25.2207523Z 2025-05-07T19:45:25.2304880Z krb5-1.21.3 | 1.3 MB | 1 | 1%  2025-05-07T19:45:25.2306146Z 2025-05-07T19:45:25.2306158Z 2025-05-07T19:45:25.2306170Z 2025-05-07T19:45:25.2306180Z 2025-05-07T19:45:25.2306191Z 2025-05-07T19:45:25.2306201Z 2025-05-07T19:45:25.2306211Z 2025-05-07T19:45:25.2306222Z 2025-05-07T19:45:25.2306232Z 2025-05-07T19:45:25.2306242Z 2025-05-07T19:45:25.2306253Z 2025-05-07T19:45:25.2306263Z 2025-05-07T19:45:25.2306287Z 2025-05-07T19:45:25.2436225Z libgfortran5-15.1.0 | 1.5 MB | ########## | 100%  2025-05-07T19:45:25.2481842Z openjdk-23.0.1 | 181.3 MB | ####3 | 43% 2025-05-07T19:45:25.2483335Z 2025-05-07T19:45:25.2483360Z 2025-05-07T19:45:25.2483376Z 2025-05-07T19:45:25.2483397Z 2025-05-07T19:45:25.2483419Z 2025-05-07T19:45:25.2483437Z 2025-05-07T19:45:25.2483457Z 2025-05-07T19:45:25.2483520Z 2025-05-07T19:45:25.2483541Z 2025-05-07T19:45:25.2483562Z 2025-05-07T19:45:25.2483585Z 2025-05-07T19:45:25.2483599Z 2025-05-07T19:45:25.2483616Z 2025-05-07T19:45:25.2483631Z 2025-05-07T19:45:25.2483647Z 2025-05-07T19:45:25.2663497Z libabseil-20250127.1 | 1.3 MB | ########## | 100%  2025-05-07T19:45:25.2664163Z 2025-05-07T19:45:25.2664172Z 2025-05-07T19:45:25.2664177Z 2025-05-07T19:45:25.2664184Z 2025-05-07T19:45:25.2664191Z 2025-05-07T19:45:25.2664197Z 2025-05-07T19:45:25.2664204Z 2025-05-07T19:45:25.2664212Z 2025-05-07T19:45:25.2664217Z 2025-05-07T19:45:25.2664242Z 2025-05-07T19:45:25.2664269Z 2025-05-07T19:45:25.2664274Z 2025-05-07T19:45:25.2664281Z 2025-05-07T19:45:25.2664289Z 2025-05-07T19:45:25.2839009Z krb5-1.21.3 | 1.3 MB | ########## | 100%  2025-05-07T19:45:25.2839437Z 2025-05-07T19:45:25.2839442Z 2025-05-07T19:45:25.2839460Z 2025-05-07T19:45:25.2839463Z 2025-05-07T19:45:25.2839481Z 2025-05-07T19:45:25.2839484Z 2025-05-07T19:45:25.2839488Z 2025-05-07T19:45:25.2839491Z 2025-05-07T19:45:25.2839495Z 2025-05-07T19:45:25.2839498Z 2025-05-07T19:45:25.2839502Z 2025-05-07T19:45:25.2839505Z 2025-05-07T19:45:25.2839509Z 2025-05-07T19:45:25.2839512Z 2025-05-07T19:45:25.2839516Z 2025-05-07T19:45:25.2839519Z 2025-05-07T19:45:25.2839523Z 2025-05-07T19:45:25.2847231Z pcre2-10.44 | 934 KB | 1 | 2%  2025-05-07T19:45:25.2847675Z 2025-05-07T19:45:25.2847679Z 2025-05-07T19:45:25.2847683Z 2025-05-07T19:45:25.2847694Z 2025-05-07T19:45:25.2848084Z 2025-05-07T19:45:25.2980326Z openblas-0.3.29 | 5.8 MB | ########## | 100%  2025-05-07T19:45:25.2980809Z 2025-05-07T19:45:25.2980814Z 2025-05-07T19:45:25.2980817Z 2025-05-07T19:45:25.2980821Z 2025-05-07T19:45:25.2980825Z 2025-05-07T19:45:25.2980828Z 2025-05-07T19:45:25.2980832Z 2025-05-07T19:45:25.2980835Z 2025-05-07T19:45:25.2981012Z 2025-05-07T19:45:25.2981016Z 2025-05-07T19:45:25.2981019Z 2025-05-07T19:45:25.2981023Z 2025-05-07T19:45:25.2981026Z 2025-05-07T19:45:25.2981029Z 2025-05-07T19:45:25.2981033Z 2025-05-07T19:45:25.2981036Z 2025-05-07T19:45:25.3102698Z cairo-1.18.0 | 961 KB | 1 | 2%  2025-05-07T19:45:25.3103142Z 2025-05-07T19:45:25.3103146Z 2025-05-07T19:45:25.3103150Z 2025-05-07T19:45:25.3103154Z 2025-05-07T19:45:25.3103157Z 2025-05-07T19:45:25.3103161Z 2025-05-07T19:45:25.3103164Z 2025-05-07T19:45:25.3103168Z 2025-05-07T19:45:25.3103171Z 2025-05-07T19:45:25.3103977Z 2025-05-07T19:45:25.3103996Z 2025-05-07T19:45:25.3103999Z 2025-05-07T19:45:25.3104003Z 2025-05-07T19:45:25.3104006Z 2025-05-07T19:45:25.3104009Z 2025-05-07T19:45:25.3104013Z 2025-05-07T19:45:25.3104016Z 2025-05-07T19:45:25.3104020Z 2025-05-07T19:45:25.3181318Z libsqlite-3.49.2 | 895 KB | 1 | 2%  2025-05-07T19:45:25.3181846Z 2025-05-07T19:45:25.3181851Z 2025-05-07T19:45:25.3181855Z 2025-05-07T19:45:25.3181858Z 2025-05-07T19:45:25.3181862Z 2025-05-07T19:45:25.3181865Z 2025-05-07T19:45:25.3181869Z 2025-05-07T19:45:25.3181872Z 2025-05-07T19:45:25.3181875Z 2025-05-07T19:45:25.3181879Z 2025-05-07T19:45:25.3181882Z 2025-05-07T19:45:25.3181886Z 2025-05-07T19:45:25.3181890Z 2025-05-07T19:45:25.3181893Z 2025-05-07T19:45:25.3181896Z 2025-05-07T19:45:25.3181900Z 2025-05-07T19:45:25.3181903Z 2025-05-07T19:45:25.3331603Z pcre2-10.44 | 934 KB | ########## | 100%  2025-05-07T19:45:25.3332076Z 2025-05-07T19:45:25.3332080Z 2025-05-07T19:45:25.3332084Z 2025-05-07T19:45:25.3332087Z 2025-05-07T19:45:25.3332091Z 2025-05-07T19:45:25.3332094Z 2025-05-07T19:45:25.3332098Z 2025-05-07T19:45:25.3332101Z 2025-05-07T19:45:25.3332105Z 2025-05-07T19:45:25.3332108Z 2025-05-07T19:45:25.3332112Z 2025-05-07T19:45:25.3332115Z 2025-05-07T19:45:25.3332139Z 2025-05-07T19:45:25.3332143Z 2025-05-07T19:45:25.3332146Z 2025-05-07T19:45:25.3332150Z 2025-05-07T19:45:25.3437364Z cairo-1.18.0 | 961 KB | ########## | 100%  2025-05-07T19:45:25.3445176Z openjdk-23.0.1 | 181.3 MB | ####7 | 47% 2025-05-07T19:45:25.3445455Z 2025-05-07T19:45:25.3445460Z 2025-05-07T19:45:25.3445465Z 2025-05-07T19:45:25.3445469Z 2025-05-07T19:45:25.3445474Z 2025-05-07T19:45:25.3445479Z 2025-05-07T19:45:25.3445483Z 2025-05-07T19:45:25.3445488Z 2025-05-07T19:45:25.3445493Z 2025-05-07T19:45:25.3445498Z 2025-05-07T19:45:25.3445520Z 2025-05-07T19:45:25.3445525Z 2025-05-07T19:45:25.3445529Z 2025-05-07T19:45:25.3445547Z 2025-05-07T19:45:25.3445550Z 2025-05-07T19:45:25.3445554Z 2025-05-07T19:45:25.3445557Z 2025-05-07T19:45:25.3446702Z 2025-05-07T19:45:25.3532417Z libsqlite-3.49.2 | 895 KB | ########## | 100%  2025-05-07T19:45:25.3533494Z 2025-05-07T19:45:25.3533509Z 2025-05-07T19:45:25.3533521Z 2025-05-07T19:45:25.3533532Z 2025-05-07T19:45:25.3533543Z 2025-05-07T19:45:25.3533553Z 2025-05-07T19:45:25.3533563Z 2025-05-07T19:45:25.3533574Z 2025-05-07T19:45:25.3533584Z 2025-05-07T19:45:25.3533594Z 2025-05-07T19:45:25.3533604Z 2025-05-07T19:45:25.3533614Z 2025-05-07T19:45:25.3533624Z 2025-05-07T19:45:25.3533635Z 2025-05-07T19:45:25.3533645Z 2025-05-07T19:45:25.3533655Z 2025-05-07T19:45:25.3533666Z 2025-05-07T19:45:25.3533677Z 2025-05-07T19:45:25.3533687Z 2025-05-07T19:45:25.3743077Z ... (more hidden) ... 2025-05-07T19:45:25.3743419Z 2025-05-07T19:45:25.3743424Z 2025-05-07T19:45:25.3743427Z 2025-05-07T19:45:25.3743431Z 2025-05-07T19:45:25.3743434Z 2025-05-07T19:45:25.3743438Z 2025-05-07T19:45:25.3743441Z 2025-05-07T19:45:25.3743445Z 2025-05-07T19:45:25.3743448Z 2025-05-07T19:45:25.3743451Z 2025-05-07T19:45:25.3743455Z 2025-05-07T19:45:25.3743458Z 2025-05-07T19:45:25.3743656Z 2025-05-07T19:45:25.3743661Z 2025-05-07T19:45:25.3743665Z 2025-05-07T19:45:25.3743668Z 2025-05-07T19:45:25.3743671Z 2025-05-07T19:45:25.3743675Z 2025-05-07T19:45:25.3743678Z 2025-05-07T19:45:25.4681656Z ... (more hidden) ... 2025-05-07T19:45:25.4798325Z openjdk-23.0.1 | 181.3 MB | ##### | 51% 2025-05-07T19:45:25.4798693Z 2025-05-07T19:45:25.4798781Z 2025-05-07T19:45:25.4798784Z 2025-05-07T19:45:25.4798805Z 2025-05-07T19:45:25.4798808Z 2025-05-07T19:45:25.4798812Z 2025-05-07T19:45:25.4968284Z 2025-05-07T19:45:25.4968720Z libcups-2.3.3 | 4.3 MB | ########## | 100%  2025-05-07T19:45:25.4969226Z 2025-05-07T19:45:25.4969231Z 2025-05-07T19:45:25.4969234Z 2025-05-07T19:45:25.4969237Z 2025-05-07T19:45:25.4969241Z 2025-05-07T19:45:25.4969244Z 2025-05-07T19:45:25.5683001Z libopenblas-0.3.29 | 5.6 MB | ########## | 100%  2025-05-07T19:45:25.6683842Z openjdk-23.0.1 | 181.3 MB | #####4 | 55% 2025-05-07T19:45:25.7356344Z openjdk-23.0.1 | 181.3 MB | #####8 | 58% 2025-05-07T19:45:25.7356711Z 2025-05-07T19:45:25.7357052Z bazel-7.5.0 | 47.4 MB | ########## | 100%  2025-05-07T19:45:25.7357290Z 2025-05-07T19:45:25.8458891Z bazel-7.5.0 | 47.4 MB | ########## | 100%  2025-05-07T19:45:25.8916970Z openjdk-23.0.1 | 181.3 MB | ######1 | 62% 2025-05-07T19:45:25.8917756Z 2025-05-07T19:45:25.8917790Z 2025-05-07T19:45:25.8917802Z 2025-05-07T19:45:25.8917812Z 2025-05-07T19:45:25.8917822Z 2025-05-07T19:45:25.8917832Z 2025-05-07T19:45:25.8917873Z 2025-05-07T19:45:25.8917883Z 2025-05-07T19:45:25.9467525Z libglib-2.84.0 | 3.8 MB | ########## | 100%  2025-05-07T19:45:25.9644128Z openjdk-23.0.1 | 181.3 MB | ######6 | 67% 2025-05-07T19:45:25.9644945Z 2025-05-07T19:45:25.9644959Z 2025-05-07T19:45:25.9644969Z 2025-05-07T19:45:25.9644980Z 2025-05-07T19:45:25.9644991Z 2025-05-07T19:45:25.9645033Z 2025-05-07T19:45:25.9645044Z 2025-05-07T19:45:25.9645054Z 2025-05-07T19:45:25.9645064Z 2025-05-07T19:45:25.9645074Z 2025-05-07T19:45:25.9645084Z 2025-05-07T19:45:25.9646059Z font-ttf-ubuntu-0.83 | 1.5 MB | ########## | 100%  2025-05-07T19:45:25.9646989Z 2025-05-07T19:45:25.9647000Z 2025-05-07T19:45:25.9647010Z 2025-05-07T19:45:25.9647021Z 2025-05-07T19:45:25.9647031Z 2025-05-07T19:45:25.9647042Z 2025-05-07T19:45:25.9647052Z 2025-05-07T19:45:25.9647062Z 2025-05-07T19:45:25.9647073Z 2025-05-07T19:45:25.9647083Z 2025-05-07T19:45:25.9647102Z 2025-05-07T19:45:26.0230195Z font-ttf-ubuntu-0.83 | 1.5 MB | ########## | 100%  2025-05-07T19:45:26.0230553Z 2025-05-07T19:45:26.0230557Z 2025-05-07T19:45:26.0230561Z 2025-05-07T19:45:26.0230564Z 2025-05-07T19:45:26.0230568Z 2025-05-07T19:45:26.0230572Z 2025-05-07T19:45:26.0230575Z 2025-05-07T19:45:26.0230578Z 2025-05-07T19:45:26.0230582Z 2025-05-07T19:45:26.0468648Z libprotobuf-5.29.3 | 3.2 MB | ########## | 100%  2025-05-07T19:45:26.1471591Z openjdk-23.0.1 | 181.3 MB | #######1 | 71% 2025-05-07T19:45:26.2471987Z openjdk-23.0.1 | 181.3 MB | #######7 | 77% 2025-05-07T19:45:26.3473797Z openjdk-23.0.1 | 181.3 MB | ########2 | 83% 2025-05-07T19:45:26.4637553Z openjdk-23.0.1 | 181.3 MB | ########9 | 89% 2025-05-07T19:45:26.4638126Z 2025-05-07T19:45:26.4638137Z 2025-05-07T19:45:26.4638142Z 2025-05-07T19:45:26.4638147Z 2025-05-07T19:45:26.4638151Z 2025-05-07T19:45:26.4638156Z 2025-05-07T19:45:26.4638184Z 2025-05-07T19:45:26.4638189Z 2025-05-07T19:45:26.4638193Z 2025-05-07T19:45:26.4638198Z 2025-05-07T19:45:26.4638813Z tk-8.6.13 | 3.2 MB | ########## | 100%  2025-05-07T19:45:26.4639121Z 2025-05-07T19:45:26.4639125Z 2025-05-07T19:45:26.4639129Z 2025-05-07T19:45:26.4639132Z 2025-05-07T19:45:26.4639136Z 2025-05-07T19:45:26.4639140Z 2025-05-07T19:45:26.4639434Z 2025-05-07T19:45:26.4639440Z 2025-05-07T19:45:26.4639443Z 2025-05-07T19:45:26.4639453Z 2025-05-07T19:45:26.4720837Z tk-8.6.13 | 3.2 MB | ########## | 100%  2025-05-07T19:45:26.5723249Z openjdk-23.0.1 | 181.3 MB | #########4 | 94% 2025-05-07T19:45:26.6071870Z openjdk-23.0.1 | 181.3 MB | #########9 | 100% 2025-05-07T19:45:26.6072257Z 2025-05-07T19:45:26.6072471Z 2025-05-07T19:45:26.6072486Z 2025-05-07T19:45:26.6072491Z 2025-05-07T19:45:26.6072495Z 2025-05-07T19:45:26.6072501Z 2025-05-07T19:45:26.6072506Z 2025-05-07T19:45:26.6072883Z 2025-05-07T19:45:26.6072887Z 2025-05-07T19:45:26.6072890Z 2025-05-07T19:45:26.6072894Z 2025-05-07T19:45:26.6072897Z 2025-05-07T19:45:26.6073519Z harfbuzz-9.0.0 | 1.5 MB | ########## | 100%  2025-05-07T19:45:26.6073839Z 2025-05-07T19:45:26.6073842Z 2025-05-07T19:45:26.6073846Z 2025-05-07T19:45:26.6073850Z 2025-05-07T19:45:26.6073853Z 2025-05-07T19:45:26.6073866Z 2025-05-07T19:45:26.6073870Z 2025-05-07T19:45:26.6073873Z 2025-05-07T19:45:26.6073877Z 2025-05-07T19:45:26.6073881Z 2025-05-07T19:45:26.6073884Z 2025-05-07T19:45:26.6073887Z 2025-05-07T19:45:26.6860508Z harfbuzz-9.0.0 | 1.5 MB | ########## | 100%  2025-05-07T19:45:26.6860859Z 2025-05-07T19:45:26.6860864Z 2025-05-07T19:45:26.6860868Z 2025-05-07T19:45:26.6860871Z 2025-05-07T19:45:26.6860875Z 2025-05-07T19:45:26.6860878Z 2025-05-07T19:45:26.6860882Z 2025-05-07T19:45:26.6860885Z 2025-05-07T19:45:26.6860905Z 2025-05-07T19:45:26.6860909Z 2025-05-07T19:45:26.6860931Z 2025-05-07T19:45:26.6860935Z 2025-05-07T19:45:26.6860939Z 2025-05-07T19:45:26.6861251Z libgfortran5-15.1.0 | 1.5 MB | ########## | 100%  2025-05-07T19:45:26.6861570Z 2025-05-07T19:45:26.6861574Z 2025-05-07T19:45:26.6861577Z 2025-05-07T19:45:26.6861581Z 2025-05-07T19:45:26.6861604Z 2025-05-07T19:45:26.6861608Z 2025-05-07T19:45:26.6861619Z 2025-05-07T19:45:26.6861623Z 2025-05-07T19:45:26.6861626Z 2025-05-07T19:45:26.6861630Z 2025-05-07T19:45:26.6861633Z 2025-05-07T19:45:26.6861637Z 2025-05-07T19:45:26.6861931Z 2025-05-07T19:45:27.1333830Z libgfortran5-15.1.0 | 1.5 MB | ########## | 100%  2025-05-07T19:45:27.1334849Z 2025-05-07T19:45:27.1334862Z 2025-05-07T19:45:27.1334873Z 2025-05-07T19:45:27.1334883Z 2025-05-07T19:45:27.1334915Z 2025-05-07T19:45:27.1334925Z 2025-05-07T19:45:27.1334935Z 2025-05-07T19:45:27.1334945Z 2025-05-07T19:45:27.1334956Z 2025-05-07T19:45:27.1334966Z 2025-05-07T19:45:27.1335008Z 2025-05-07T19:45:27.1335019Z 2025-05-07T19:45:27.1335029Z 2025-05-07T19:45:27.1335039Z 2025-05-07T19:45:27.1335049Z 2025-05-07T19:45:27.1335996Z libabseil-20250127.1 | 1.3 MB | ########## | 100%  2025-05-07T19:45:27.1337207Z 2025-05-07T19:45:27.1337218Z 2025-05-07T19:45:27.1337228Z 2025-05-07T19:45:27.1337254Z 2025-05-07T19:45:27.1337264Z 2025-05-07T19:45:27.1337274Z 2025-05-07T19:45:27.1337284Z 2025-05-07T19:45:27.1337294Z 2025-05-07T19:45:27.1337305Z 2025-05-07T19:45:27.1337315Z 2025-05-07T19:45:27.1337325Z 2025-05-07T19:45:27.1337335Z 2025-05-07T19:45:27.1337345Z 2025-05-07T19:45:27.1337355Z 2025-05-07T19:45:27.1337365Z 2025-05-07T19:45:27.2710327Z libabseil-20250127.1 | 1.3 MB | ########## | 100%  2025-05-07T19:45:27.2711181Z 2025-05-07T19:45:27.2711186Z 2025-05-07T19:45:27.2711190Z 2025-05-07T19:45:27.2711194Z 2025-05-07T19:45:27.2711198Z 2025-05-07T19:45:27.2711220Z 2025-05-07T19:45:27.2711224Z 2025-05-07T19:45:27.2711227Z 2025-05-07T19:45:27.2711231Z 2025-05-07T19:45:27.2711234Z 2025-05-07T19:45:27.2711238Z 2025-05-07T19:45:27.2711241Z 2025-05-07T19:45:27.2711245Z 2025-05-07T19:45:27.2711266Z 2025-05-07T19:45:27.2711652Z krb5-1.21.3 | 1.3 MB | ########## | 100%  2025-05-07T19:45:27.2712167Z 2025-05-07T19:45:27.2712173Z 2025-05-07T19:45:27.2712177Z 2025-05-07T19:45:27.2712180Z 2025-05-07T19:45:27.2712184Z 2025-05-07T19:45:27.2712187Z 2025-05-07T19:45:27.2712190Z 2025-05-07T19:45:27.2712194Z 2025-05-07T19:45:27.2712197Z 2025-05-07T19:45:27.2712218Z 2025-05-07T19:45:27.2712221Z 2025-05-07T19:45:27.2712224Z 2025-05-07T19:45:27.2712227Z 2025-05-07T19:45:27.2712231Z 2025-05-07T19:45:27.4474696Z krb5-1.21.3 | 1.3 MB | ########## | 100%  2025-05-07T19:45:27.4476040Z 2025-05-07T19:45:27.4476056Z 2025-05-07T19:45:27.4476067Z 2025-05-07T19:45:27.4476552Z 2025-05-07T19:45:27.4476562Z 2025-05-07T19:45:27.4476573Z 2025-05-07T19:45:27.4476583Z 2025-05-07T19:45:27.4476593Z 2025-05-07T19:45:27.4476604Z 2025-05-07T19:45:27.4476614Z 2025-05-07T19:45:27.4476624Z 2025-05-07T19:45:27.4476635Z 2025-05-07T19:45:27.4476646Z 2025-05-07T19:45:27.4476656Z 2025-05-07T19:45:27.4476666Z 2025-05-07T19:45:27.4476677Z 2025-05-07T19:45:27.4476703Z 2025-05-07T19:45:27.4477627Z pcre2-10.44 | 934 KB | ########## | 100%  2025-05-07T19:45:27.4478546Z 2025-05-07T19:45:27.4478557Z 2025-05-07T19:45:27.4478567Z 2025-05-07T19:45:27.4478577Z 2025-05-07T19:45:27.4478587Z 2025-05-07T19:45:27.4478597Z 2025-05-07T19:45:27.4478608Z 2025-05-07T19:45:27.4478618Z 2025-05-07T19:45:27.4478628Z 2025-05-07T19:45:27.4478639Z 2025-05-07T19:45:27.4478649Z 2025-05-07T19:45:27.4478659Z 2025-05-07T19:45:27.4478670Z 2025-05-07T19:45:27.4478681Z 2025-05-07T19:45:27.4478691Z 2025-05-07T19:45:27.4478701Z 2025-05-07T19:45:27.4478747Z 2025-05-07T19:45:27.5068555Z pcre2-10.44 | 934 KB | ########## | 100%  2025-05-07T19:45:27.5069552Z 2025-05-07T19:45:27.5069566Z 2025-05-07T19:45:27.5069577Z 2025-05-07T19:45:27.5069588Z 2025-05-07T19:45:27.5069598Z 2025-05-07T19:45:27.5069609Z 2025-05-07T19:45:27.5069619Z 2025-05-07T19:45:27.5069661Z 2025-05-07T19:45:27.5069673Z 2025-05-07T19:45:27.5069683Z 2025-05-07T19:45:27.5069693Z 2025-05-07T19:45:27.5069703Z 2025-05-07T19:45:27.5069735Z 2025-05-07T19:45:27.5069746Z 2025-05-07T19:45:27.5069756Z 2025-05-07T19:45:27.5069767Z 2025-05-07T19:45:27.5071098Z cairo-1.18.0 | 961 KB | ########## | 100%  2025-05-07T19:45:27.5071403Z 2025-05-07T19:45:27.5071407Z 2025-05-07T19:45:27.5071411Z 2025-05-07T19:45:27.5071415Z 2025-05-07T19:45:27.5071418Z 2025-05-07T19:45:27.5071439Z 2025-05-07T19:45:27.5071443Z 2025-05-07T19:45:27.5071446Z 2025-05-07T19:45:27.5071458Z 2025-05-07T19:45:27.5071462Z 2025-05-07T19:45:27.5071465Z 2025-05-07T19:45:27.5071468Z 2025-05-07T19:45:27.5071471Z 2025-05-07T19:45:27.5071475Z 2025-05-07T19:45:27.5071478Z 2025-05-07T19:45:27.5071481Z 2025-05-07T19:45:27.5326224Z cairo-1.18.0 | 961 KB | ########## | 100%  2025-05-07T19:45:27.5327220Z 2025-05-07T19:45:27.5327266Z 2025-05-07T19:45:27.5327278Z 2025-05-07T19:45:27.5327288Z 2025-05-07T19:45:27.5327299Z 2025-05-07T19:45:27.5327308Z 2025-05-07T19:45:27.5327318Z 2025-05-07T19:45:27.5327329Z 2025-05-07T19:45:27.5327339Z 2025-05-07T19:45:27.5327349Z 2025-05-07T19:45:27.5327359Z 2025-05-07T19:45:27.5327369Z 2025-05-07T19:45:27.5327379Z 2025-05-07T19:45:27.5327389Z 2025-05-07T19:45:27.5327399Z 2025-05-07T19:45:27.5327409Z 2025-05-07T19:45:27.5327419Z 2025-05-07T19:45:27.5327430Z 2025-05-07T19:45:27.5328382Z libsqlite-3.49.2 | 895 KB | ########## | 100%  2025-05-07T19:45:27.5329352Z 2025-05-07T19:45:27.5329363Z 2025-05-07T19:45:27.5329373Z 2025-05-07T19:45:27.5329383Z 2025-05-07T19:45:27.5329393Z 2025-05-07T19:45:27.5329403Z 2025-05-07T19:45:27.5329413Z 2025-05-07T19:45:27.5329423Z 2025-05-07T19:45:27.5329433Z 2025-05-07T19:45:27.5329465Z 2025-05-07T19:45:27.5329475Z 2025-05-07T19:45:27.5329485Z 2025-05-07T19:45:27.5329918Z 2025-05-07T19:45:27.5329932Z 2025-05-07T19:45:27.5329942Z 2025-05-07T19:45:27.5329952Z 2025-05-07T19:45:27.5329963Z 2025-05-07T19:45:27.5329973Z 2025-05-07T19:45:27.9768847Z libsqlite-3.49.2 | 895 KB | ########## | 100%  2025-05-07T19:45:27.9769217Z 2025-05-07T19:45:27.9769221Z 2025-05-07T19:45:27.9769225Z 2025-05-07T19:45:28.1584023Z cmake-4.0.2 | 19.4 MB | ########## | 100%  2025-05-07T19:45:28.1584851Z 2025-05-07T19:45:28.1584865Z 2025-05-07T19:45:28.3808033Z python-3.13.2 | 31.7 MB | ########## | 100%  2025-05-07T19:45:28.7660522Z openjdk-23.0.1 | 181.3 MB | ########## | 100% 2025-05-07T19:45:28.7660824Z 2025-05-07T19:45:28.7660829Z 2025-05-07T19:45:28.7660833Z 2025-05-07T19:45:28.7660836Z 2025-05-07T19:45:28.7660840Z 2025-05-07T19:45:28.7660843Z 2025-05-07T19:45:28.7660847Z 2025-05-07T19:45:28.7660851Z 2025-05-07T19:45:28.7660873Z 2025-05-07T19:45:28.7660876Z 2025-05-07T19:45:28.7660897Z 2025-05-07T19:45:28.7660901Z 2025-05-07T19:45:28.7660905Z 2025-05-07T19:45:28.7660908Z 2025-05-07T19:45:28.7660911Z 2025-05-07T19:45:28.7660915Z 2025-05-07T19:45:28.7660918Z 2025-05-07T19:45:28.7660922Z 2025-05-07T19:45:28.7660925Z 2025-05-07T19:45:28.7661266Z ... (more hidden) ... 2025-05-07T19:45:28.7661579Z 2025-05-07T19:45:28.7661583Z 2025-05-07T19:45:28.7661587Z 2025-05-07T19:45:28.7661590Z 2025-05-07T19:45:28.7661594Z 2025-05-07T19:45:28.7661597Z 2025-05-07T19:45:28.7661600Z 2025-05-07T19:45:28.7661604Z 2025-05-07T19:45:28.7661615Z 2025-05-07T19:45:28.7661619Z 2025-05-07T19:45:28.7661622Z 2025-05-07T19:45:28.7661626Z 2025-05-07T19:45:28.7661629Z 2025-05-07T19:45:28.7661633Z 2025-05-07T19:45:28.7661636Z 2025-05-07T19:45:28.7661639Z 2025-05-07T19:45:28.7661643Z 2025-05-07T19:45:28.7661647Z 2025-05-07T19:45:28.7661650Z 2025-05-07T19:45:29.5759448Z ... (more hidden) ... 2025-05-07T19:45:29.5760479Z 2025-05-07T19:45:30.2892327Z bazel-7.5.0 | 47.4 MB | ########## | 100%  2025-05-07T19:45:30.2896929Z openjdk-23.0.1 | 181.3 MB | ########## | 100% 2025-05-07T19:45:30.2897711Z 2025-05-07T19:45:30.2897725Z 2025-05-07T19:45:30.2897737Z 2025-05-07T19:45:30.2897747Z 2025-05-07T19:45:30.2897757Z 2025-05-07T19:45:30.2897768Z 2025-05-07T19:45:30.2897778Z 2025-05-07T19:45:30.2897788Z 2025-05-07T19:45:30.2897798Z 2025-05-07T19:45:30.2897809Z 2025-05-07T19:45:30.2897822Z 2025-05-07T19:45:30.2897832Z 2025-05-07T19:45:30.2897842Z 2025-05-07T19:45:30.2897889Z 2025-05-07T19:45:30.2897900Z 2025-05-07T19:45:30.2897935Z 2025-05-07T19:45:30.2897946Z 2025-05-07T19:45:30.2897956Z 2025-05-07T19:45:30.2897966Z 2025-05-07T19:45:30.2898206Z 2025-05-07T19:45:30.2899163Z  2025-05-07T19:45:30.2900135Z 2025-05-07T19:45:30.2900740Z 2025-05-07T19:45:30.2901248Z  2025-05-07T19:45:30.2901849Z 2025-05-07T19:45:30.2901861Z 2025-05-07T19:45:30.2902345Z  2025-05-07T19:45:30.2902985Z 2025-05-07T19:45:30.2902997Z 2025-05-07T19:45:30.2903008Z 2025-05-07T19:45:30.2903473Z  2025-05-07T19:45:30.2903687Z 2025-05-07T19:45:30.2903691Z 2025-05-07T19:45:30.2903695Z 2025-05-07T19:45:30.2903698Z 2025-05-07T19:45:30.2903950Z  2025-05-07T19:45:30.2904195Z 2025-05-07T19:45:30.2904199Z 2025-05-07T19:45:30.2904202Z 2025-05-07T19:45:30.2904205Z 2025-05-07T19:45:30.2904209Z 2025-05-07T19:45:30.2904388Z  2025-05-07T19:45:30.2904614Z 2025-05-07T19:45:30.2904618Z 2025-05-07T19:45:30.2904621Z 2025-05-07T19:45:30.2904643Z 2025-05-07T19:45:30.2904938Z 2025-05-07T19:45:30.2904942Z 2025-05-07T19:45:30.2905134Z  2025-05-07T19:45:30.2905361Z 2025-05-07T19:45:30.2905364Z 2025-05-07T19:45:30.2905368Z 2025-05-07T19:45:30.2905372Z 2025-05-07T19:45:30.2905376Z 2025-05-07T19:45:30.2905379Z 2025-05-07T19:45:30.2905383Z 2025-05-07T19:45:30.2905588Z  2025-05-07T19:45:30.2905814Z 2025-05-07T19:45:30.2905817Z 2025-05-07T19:45:30.2905821Z 2025-05-07T19:45:30.2905825Z 2025-05-07T19:45:30.2905828Z 2025-05-07T19:45:30.2905930Z 2025-05-07T19:45:30.2905934Z 2025-05-07T19:45:30.2905938Z 2025-05-07T19:45:30.2906150Z  2025-05-07T19:45:30.2906378Z 2025-05-07T19:45:30.2906382Z 2025-05-07T19:45:30.2906386Z 2025-05-07T19:45:30.2906389Z 2025-05-07T19:45:30.2906393Z 2025-05-07T19:45:30.2906397Z 2025-05-07T19:45:30.2906400Z 2025-05-07T19:45:30.2906408Z 2025-05-07T19:45:30.2906412Z 2025-05-07T19:45:30.2906648Z  2025-05-07T19:45:30.2906880Z 2025-05-07T19:45:30.2906883Z 2025-05-07T19:45:30.2906887Z 2025-05-07T19:45:30.2906891Z 2025-05-07T19:45:30.2906894Z 2025-05-07T19:45:30.2906897Z 2025-05-07T19:45:30.2906901Z 2025-05-07T19:45:30.2906904Z 2025-05-07T19:45:30.2906908Z 2025-05-07T19:45:30.2906911Z 2025-05-07T19:45:30.2907124Z  2025-05-07T19:45:30.2907357Z 2025-05-07T19:45:30.2907365Z 2025-05-07T19:45:30.2907368Z 2025-05-07T19:45:30.2907371Z 2025-05-07T19:45:30.2907375Z 2025-05-07T19:45:30.2907378Z 2025-05-07T19:45:30.2907381Z 2025-05-07T19:45:30.2907385Z 2025-05-07T19:45:30.2907388Z 2025-05-07T19:45:30.2907391Z 2025-05-07T19:45:30.2907395Z 2025-05-07T19:45:30.2907616Z  2025-05-07T19:45:30.2907852Z 2025-05-07T19:45:30.2907855Z 2025-05-07T19:45:30.2907859Z 2025-05-07T19:45:30.2907862Z 2025-05-07T19:45:30.2907866Z 2025-05-07T19:45:30.2907869Z 2025-05-07T19:45:30.2907872Z 2025-05-07T19:45:30.2907876Z 2025-05-07T19:45:30.2907879Z 2025-05-07T19:45:30.2907882Z 2025-05-07T19:45:30.2907886Z 2025-05-07T19:45:30.2907889Z 2025-05-07T19:45:30.2908107Z  2025-05-07T19:45:30.2908344Z 2025-05-07T19:45:30.2908348Z 2025-05-07T19:45:30.2908352Z 2025-05-07T19:45:30.2908355Z 2025-05-07T19:45:30.2908362Z 2025-05-07T19:45:30.2908366Z 2025-05-07T19:45:30.2908369Z 2025-05-07T19:45:30.2908372Z 2025-05-07T19:45:30.2908376Z 2025-05-07T19:45:30.2908379Z 2025-05-07T19:45:30.2908400Z 2025-05-07T19:45:30.2908404Z 2025-05-07T19:45:30.2908407Z 2025-05-07T19:45:30.2908622Z  2025-05-07T19:45:30.2908868Z 2025-05-07T19:45:30.2908871Z 2025-05-07T19:45:30.2908875Z 2025-05-07T19:45:30.2908878Z 2025-05-07T19:45:30.2908882Z 2025-05-07T19:45:30.2908885Z 2025-05-07T19:45:30.2908888Z 2025-05-07T19:45:30.2908910Z 2025-05-07T19:45:30.2908913Z 2025-05-07T19:45:30.2908916Z 2025-05-07T19:45:30.2908920Z 2025-05-07T19:45:30.2908923Z 2025-05-07T19:45:30.2908926Z 2025-05-07T19:45:30.2908930Z 2025-05-07T19:45:30.2909153Z  2025-05-07T19:45:30.2909397Z 2025-05-07T19:45:30.2909401Z 2025-05-07T19:45:30.2909424Z 2025-05-07T19:45:30.2909431Z 2025-05-07T19:45:30.2909434Z 2025-05-07T19:45:30.2909437Z 2025-05-07T19:45:30.2909441Z 2025-05-07T19:45:30.2909445Z 2025-05-07T19:45:30.2909448Z 2025-05-07T19:45:30.2909451Z 2025-05-07T19:45:30.2909455Z 2025-05-07T19:45:30.2909458Z 2025-05-07T19:45:30.2909462Z 2025-05-07T19:45:30.2909465Z 2025-05-07T19:45:30.2909469Z 2025-05-07T19:45:30.2909749Z  2025-05-07T19:45:30.2910015Z 2025-05-07T19:45:30.2910018Z 2025-05-07T19:45:30.2910022Z 2025-05-07T19:45:30.2910025Z 2025-05-07T19:45:30.2910029Z 2025-05-07T19:45:30.2910032Z 2025-05-07T19:45:30.2910036Z 2025-05-07T19:45:30.2910039Z 2025-05-07T19:45:30.2910043Z 2025-05-07T19:45:30.2910046Z 2025-05-07T19:45:30.2910050Z 2025-05-07T19:45:30.2910053Z 2025-05-07T19:45:30.2910056Z 2025-05-07T19:45:30.2910060Z 2025-05-07T19:45:30.2910063Z 2025-05-07T19:45:30.2910067Z 2025-05-07T19:45:30.2910307Z  2025-05-07T19:45:30.2910619Z 2025-05-07T19:45:30.2910623Z 2025-05-07T19:45:30.2910626Z 2025-05-07T19:45:30.2910629Z 2025-05-07T19:45:30.2910633Z 2025-05-07T19:45:30.2910636Z 2025-05-07T19:45:30.2910639Z 2025-05-07T19:45:30.2910643Z 2025-05-07T19:45:30.2910646Z 2025-05-07T19:45:30.2910649Z 2025-05-07T19:45:30.2910653Z 2025-05-07T19:45:30.2910660Z 2025-05-07T19:45:30.2910663Z 2025-05-07T19:45:30.2910667Z 2025-05-07T19:45:30.2910688Z 2025-05-07T19:45:30.2910691Z 2025-05-07T19:45:30.2910695Z 2025-05-07T19:45:30.2910925Z  2025-05-07T19:45:30.2911176Z 2025-05-07T19:45:30.2911180Z 2025-05-07T19:45:30.2911183Z 2025-05-07T19:45:30.2911187Z 2025-05-07T19:45:30.2911192Z 2025-05-07T19:45:30.2911196Z 2025-05-07T19:45:30.2911200Z 2025-05-07T19:45:30.2911222Z 2025-05-07T19:45:30.2911225Z 2025-05-07T19:45:30.2911228Z 2025-05-07T19:45:30.2911236Z 2025-05-07T19:45:30.2911240Z 2025-05-07T19:45:30.2911243Z 2025-05-07T19:45:30.2911247Z 2025-05-07T19:45:30.2911250Z 2025-05-07T19:45:30.2911253Z 2025-05-07T19:45:30.2911257Z 2025-05-07T19:45:30.2911260Z 2025-05-07T19:45:30.2911494Z  2025-05-07T19:45:30.2911773Z 2025-05-07T19:45:30.2911780Z 2025-05-07T19:45:30.2911880Z  2025-05-07T19:45:30.2911993Z 2025-05-07T19:45:30.2911997Z 2025-05-07T19:45:30.2912117Z  2025-05-07T19:45:30.2912229Z 2025-05-07T19:45:30.2912232Z 2025-05-07T19:45:30.2912236Z 2025-05-07T19:45:30.2912441Z  2025-05-07T19:45:30.2912575Z 2025-05-07T19:45:30.2912580Z 2025-05-07T19:45:30.2912583Z 2025-05-07T19:45:30.2912587Z 2025-05-07T19:45:30.2912695Z  2025-05-07T19:45:30.2912816Z 2025-05-07T19:45:30.2912820Z 2025-05-07T19:45:30.2912824Z 2025-05-07T19:45:30.2912827Z 2025-05-07T19:45:30.2912831Z 2025-05-07T19:45:30.2912956Z  2025-05-07T19:45:30.2913089Z 2025-05-07T19:45:30.2913092Z 2025-05-07T19:45:30.2913096Z 2025-05-07T19:45:30.2913099Z 2025-05-07T19:45:30.2913102Z 2025-05-07T19:45:30.2913106Z 2025-05-07T19:45:30.2913217Z  2025-05-07T19:45:30.2913372Z 2025-05-07T19:45:30.2913375Z 2025-05-07T19:45:30.2913379Z 2025-05-07T19:45:30.2913382Z 2025-05-07T19:45:30.2913385Z 2025-05-07T19:45:30.2913392Z 2025-05-07T19:45:30.2913396Z 2025-05-07T19:45:30.2913510Z  2025-05-07T19:45:30.2913701Z 2025-05-07T19:45:30.2913704Z 2025-05-07T19:45:30.2913708Z 2025-05-07T19:45:30.2913711Z 2025-05-07T19:45:30.2913714Z 2025-05-07T19:45:30.2913718Z 2025-05-07T19:45:30.2913721Z 2025-05-07T19:45:30.2913724Z 2025-05-07T19:45:30.2913845Z  2025-05-07T19:45:30.2914002Z 2025-05-07T19:45:30.2914005Z 2025-05-07T19:45:30.2914027Z 2025-05-07T19:45:30.2914030Z 2025-05-07T19:45:30.2914033Z 2025-05-07T19:45:30.2914037Z 2025-05-07T19:45:30.2914040Z 2025-05-07T19:45:30.2914043Z 2025-05-07T19:45:30.2914050Z 2025-05-07T19:45:30.2914173Z  2025-05-07T19:45:30.2914335Z 2025-05-07T19:45:30.2914338Z 2025-05-07T19:45:30.2914342Z 2025-05-07T19:45:30.2914345Z 2025-05-07T19:45:30.2914367Z 2025-05-07T19:45:30.2914371Z 2025-05-07T19:45:30.2914374Z 2025-05-07T19:45:30.2914378Z 2025-05-07T19:45:30.2914381Z 2025-05-07T19:45:30.2914385Z 2025-05-07T19:45:30.2914577Z  2025-05-07T19:45:30.2914748Z 2025-05-07T19:45:30.2914752Z 2025-05-07T19:45:30.2914755Z 2025-05-07T19:45:30.2914758Z 2025-05-07T19:45:30.2914780Z 2025-05-07T19:45:30.2914784Z 2025-05-07T19:45:30.2914787Z 2025-05-07T19:45:30.2914791Z 2025-05-07T19:45:30.2914794Z 2025-05-07T19:45:30.2914797Z 2025-05-07T19:45:30.2914801Z 2025-05-07T19:45:30.2914934Z  2025-05-07T19:45:30.2915144Z 2025-05-07T19:45:30.2915147Z 2025-05-07T19:45:30.2915151Z 2025-05-07T19:45:30.2915172Z 2025-05-07T19:45:30.2915176Z 2025-05-07T19:45:30.2915179Z 2025-05-07T19:45:30.2915244Z 2025-05-07T19:45:30.2915247Z 2025-05-07T19:45:30.2915251Z 2025-05-07T19:45:30.2915254Z 2025-05-07T19:45:30.2915258Z 2025-05-07T19:45:30.2915261Z 2025-05-07T19:45:30.2915399Z  2025-05-07T19:45:30.2915586Z 2025-05-07T19:45:30.2915590Z 2025-05-07T19:45:30.2915611Z 2025-05-07T19:45:30.2915614Z 2025-05-07T19:45:30.2915618Z 2025-05-07T19:45:30.2915625Z 2025-05-07T19:45:30.2915628Z 2025-05-07T19:45:30.2915632Z 2025-05-07T19:45:30.2915636Z 2025-05-07T19:45:30.2915639Z 2025-05-07T19:45:30.2915642Z 2025-05-07T19:45:30.2915646Z 2025-05-07T19:45:30.2915649Z 2025-05-07T19:45:30.2915793Z  2025-05-07T19:45:30.2916007Z 2025-05-07T19:45:30.2916010Z 2025-05-07T19:45:30.2916014Z 2025-05-07T19:45:30.2916017Z 2025-05-07T19:45:30.2916021Z 2025-05-07T19:45:30.2916024Z 2025-05-07T19:45:30.2916028Z 2025-05-07T19:45:30.2916031Z 2025-05-07T19:45:30.2916035Z 2025-05-07T19:45:30.2916038Z 2025-05-07T19:45:30.2916046Z 2025-05-07T19:45:30.2916049Z 2025-05-07T19:45:30.2916053Z 2025-05-07T19:45:30.2916056Z 2025-05-07T19:45:30.2916203Z  2025-05-07T19:45:30.2916435Z 2025-05-07T19:45:30.2916439Z 2025-05-07T19:45:30.2916442Z 2025-05-07T19:45:30.2916445Z 2025-05-07T19:45:30.2916449Z 2025-05-07T19:45:30.2916452Z 2025-05-07T19:45:30.2916456Z 2025-05-07T19:45:30.2916462Z 2025-05-07T19:45:30.2916466Z 2025-05-07T19:45:30.2916470Z 2025-05-07T19:45:30.2916473Z 2025-05-07T19:45:30.2916476Z 2025-05-07T19:45:30.2916480Z 2025-05-07T19:45:30.2916483Z 2025-05-07T19:45:30.2916486Z 2025-05-07T19:45:30.2916661Z  2025-05-07T19:45:30.2916871Z 2025-05-07T19:45:30.2916875Z 2025-05-07T19:45:30.2916879Z 2025-05-07T19:45:30.2916883Z 2025-05-07T19:45:30.2916887Z 2025-05-07T19:45:30.2916890Z 2025-05-07T19:45:30.2916893Z 2025-05-07T19:45:30.2916897Z 2025-05-07T19:45:30.2916900Z 2025-05-07T19:45:30.2916903Z 2025-05-07T19:45:30.2916907Z 2025-05-07T19:45:30.2916914Z 2025-05-07T19:45:30.2916917Z 2025-05-07T19:45:30.2916921Z 2025-05-07T19:45:30.2916924Z 2025-05-07T19:45:30.2916948Z 2025-05-07T19:45:30.2917109Z  2025-05-07T19:45:30.2917322Z 2025-05-07T19:45:30.2917326Z 2025-05-07T19:45:30.2917330Z 2025-05-07T19:45:30.2917333Z 2025-05-07T19:45:30.2917337Z 2025-05-07T19:45:30.2917343Z 2025-05-07T19:45:30.2917346Z 2025-05-07T19:45:30.2917349Z 2025-05-07T19:45:30.2917353Z 2025-05-07T19:45:30.2917356Z 2025-05-07T19:45:30.2917380Z 2025-05-07T19:45:30.2917383Z 2025-05-07T19:45:30.2917386Z 2025-05-07T19:45:30.2917390Z 2025-05-07T19:45:30.2917393Z 2025-05-07T19:45:30.2917396Z 2025-05-07T19:45:30.2917400Z 2025-05-07T19:45:30.2917567Z  2025-05-07T19:45:30.2917783Z 2025-05-07T19:45:30.2917787Z 2025-05-07T19:45:30.2917791Z 2025-05-07T19:45:30.2917821Z 2025-05-07T19:45:30.2917825Z 2025-05-07T19:45:30.2917828Z 2025-05-07T19:45:30.2917835Z 2025-05-07T19:45:30.2917838Z 2025-05-07T19:45:30.2917842Z 2025-05-07T19:45:30.2917845Z 2025-05-07T19:45:30.2917849Z 2025-05-07T19:45:30.2917852Z 2025-05-07T19:45:30.2917856Z 2025-05-07T19:45:30.2917860Z 2025-05-07T19:45:30.2917864Z 2025-05-07T19:45:30.2917868Z 2025-05-07T19:45:30.2917871Z 2025-05-07T19:45:30.2917874Z 2025-05-07T19:45:30.2918120Z  2025-05-07T19:45:30.2918364Z 2025-05-07T19:45:30.2918367Z 2025-05-07T19:45:30.2918469Z  2025-05-07T19:45:30.2918583Z 2025-05-07T19:45:30.2918586Z 2025-05-07T19:45:30.2918711Z  2025-05-07T19:45:30.2918828Z 2025-05-07T19:45:30.2918832Z 2025-05-07T19:45:30.2918835Z 2025-05-07T19:45:30.2918942Z  2025-05-07T19:45:30.2919079Z 2025-05-07T19:45:30.2919082Z 2025-05-07T19:45:30.2919086Z 2025-05-07T19:45:30.2919090Z 2025-05-07T19:45:30.2919199Z  2025-05-07T19:45:30.2919323Z 2025-05-07T19:45:30.2919326Z 2025-05-07T19:45:30.2919330Z 2025-05-07T19:45:30.2919334Z 2025-05-07T19:45:30.2919400Z 2025-05-07T19:45:30.2919540Z  2025-05-07T19:45:30.2919673Z 2025-05-07T19:45:30.2919676Z 2025-05-07T19:45:30.2919680Z 2025-05-07T19:45:30.2919683Z 2025-05-07T19:45:30.2919687Z 2025-05-07T19:45:30.2919690Z 2025-05-07T19:45:30.2919825Z  2025-05-07T19:45:30.2919962Z 2025-05-07T19:45:30.2919965Z 2025-05-07T19:45:30.2919969Z 2025-05-07T19:45:30.2919977Z 2025-05-07T19:45:30.2919980Z 2025-05-07T19:45:30.2919984Z 2025-05-07T19:45:30.2919987Z 2025-05-07T19:45:30.2920107Z  2025-05-07T19:45:30.2920274Z 2025-05-07T19:45:30.2920278Z 2025-05-07T19:45:30.2920281Z 2025-05-07T19:45:30.2920285Z 2025-05-07T19:45:30.2920288Z 2025-05-07T19:45:30.2920293Z 2025-05-07T19:45:30.2920297Z 2025-05-07T19:45:30.2920300Z 2025-05-07T19:45:30.2920423Z  2025-05-07T19:45:30.2920599Z 2025-05-07T19:45:30.2920603Z 2025-05-07T19:45:30.2920606Z 2025-05-07T19:45:30.2920610Z 2025-05-07T19:45:30.2920613Z 2025-05-07T19:45:30.2920620Z 2025-05-07T19:45:30.2920623Z 2025-05-07T19:45:30.2920628Z 2025-05-07T19:45:30.2920631Z 2025-05-07T19:45:30.2920756Z  2025-05-07T19:45:30.2920925Z 2025-05-07T19:45:30.2920929Z 2025-05-07T19:45:30.2920953Z 2025-05-07T19:45:30.2920956Z 2025-05-07T19:45:30.2920960Z 2025-05-07T19:45:30.2920963Z 2025-05-07T19:45:30.2920967Z 2025-05-07T19:45:30.2920970Z 2025-05-07T19:45:30.2920976Z 2025-05-07T19:45:30.2920981Z 2025-05-07T19:45:30.2921114Z  2025-05-07T19:45:30.2921288Z 2025-05-07T19:45:30.2921292Z 2025-05-07T19:45:30.2921296Z 2025-05-07T19:45:30.2921320Z 2025-05-07T19:45:30.2921323Z 2025-05-07T19:45:30.2921326Z 2025-05-07T19:45:30.2921330Z 2025-05-07T19:45:30.2921333Z 2025-05-07T19:45:30.2921337Z 2025-05-07T19:45:30.2921340Z 2025-05-07T19:45:30.2921344Z 2025-05-07T19:45:30.2921482Z  2025-05-07T19:45:30.2921664Z 2025-05-07T19:45:30.2921668Z 2025-05-07T19:45:30.2921673Z 2025-05-07T19:45:30.2921699Z 2025-05-07T19:45:30.2921703Z 2025-05-07T19:45:30.2921706Z 2025-05-07T19:45:30.2921709Z 2025-05-07T19:45:30.2921713Z 2025-05-07T19:45:30.2921716Z 2025-05-07T19:45:30.2921720Z 2025-05-07T19:45:30.2921724Z 2025-05-07T19:45:30.2921727Z 2025-05-07T19:45:30.2921869Z  2025-05-07T19:45:30.2922058Z 2025-05-07T19:45:30.2922082Z 2025-05-07T19:45:30.2922089Z 2025-05-07T19:45:30.2922094Z 2025-05-07T19:45:30.2922097Z 2025-05-07T19:45:30.2922100Z 2025-05-07T19:45:30.2922104Z 2025-05-07T19:45:30.2922108Z 2025-05-07T19:45:30.2922112Z 2025-05-07T19:45:30.2922116Z 2025-05-07T19:45:30.2922119Z 2025-05-07T19:45:30.2922123Z 2025-05-07T19:45:30.2922127Z 2025-05-07T19:45:30.2922270Z  2025-05-07T19:45:30.2922490Z 2025-05-07T19:45:30.2922493Z 2025-05-07T19:45:30.2922497Z 2025-05-07T19:45:30.2922500Z 2025-05-07T19:45:30.2922504Z 2025-05-07T19:45:30.2922620Z 2025-05-07T19:45:30.2922625Z 2025-05-07T19:45:30.2922628Z 2025-05-07T19:45:30.2922635Z 2025-05-07T19:45:30.2922639Z 2025-05-07T19:45:30.2922642Z 2025-05-07T19:45:30.2922645Z 2025-05-07T19:45:30.2922649Z 2025-05-07T19:45:30.2922652Z 2025-05-07T19:45:30.2922796Z  2025-05-07T19:45:30.2923022Z 2025-05-07T19:45:30.2923026Z 2025-05-07T19:45:30.2923030Z 2025-05-07T19:45:30.2923033Z 2025-05-07T19:45:30.2923037Z 2025-05-07T19:45:30.2924386Z 2025-05-07T19:45:30.2924392Z 2025-05-07T19:45:30.2924396Z 2025-05-07T19:45:30.2924401Z 2025-05-07T19:45:30.2924404Z 2025-05-07T19:45:30.2924407Z 2025-05-07T19:45:30.2924411Z 2025-05-07T19:45:30.2924414Z 2025-05-07T19:45:30.2924418Z 2025-05-07T19:45:30.2924422Z 2025-05-07T19:45:30.2924599Z  2025-05-07T19:45:30.2924808Z 2025-05-07T19:45:30.2924814Z 2025-05-07T19:45:30.2924817Z 2025-05-07T19:45:30.2924821Z 2025-05-07T19:45:30.2924824Z 2025-05-07T19:45:30.2924827Z 2025-05-07T19:45:30.2924831Z 2025-05-07T19:45:30.2924834Z 2025-05-07T19:45:30.2924902Z 2025-05-07T19:45:30.2924906Z 2025-05-07T19:45:30.2924909Z 2025-05-07T19:45:30.2924912Z 2025-05-07T19:45:30.2924916Z 2025-05-07T19:45:30.2924919Z 2025-05-07T19:45:30.2924940Z 2025-05-07T19:45:30.2924943Z 2025-05-07T19:45:30.2925099Z  2025-05-07T19:45:30.2925311Z 2025-05-07T19:45:30.2925314Z 2025-05-07T19:45:30.2925318Z 2025-05-07T19:45:30.2925325Z 2025-05-07T19:45:30.2925329Z 2025-05-07T19:45:30.2925332Z 2025-05-07T19:45:30.2925335Z 2025-05-07T19:45:30.2925339Z 2025-05-07T19:45:30.2925342Z 2025-05-07T19:45:30.2925363Z 2025-05-07T19:45:30.2925366Z 2025-05-07T19:45:30.2925370Z 2025-05-07T19:45:30.2925373Z 2025-05-07T19:45:30.2925376Z 2025-05-07T19:45:30.2925380Z 2025-05-07T19:45:30.2925383Z 2025-05-07T19:45:30.2925387Z 2025-05-07T19:45:30.2925546Z  2025-05-07T19:45:30.2925770Z 2025-05-07T19:45:30.2925774Z 2025-05-07T19:45:30.2925798Z 2025-05-07T19:45:30.2925802Z 2025-05-07T19:45:30.2925808Z 2025-05-07T19:45:30.2925812Z 2025-05-07T19:45:30.2925815Z 2025-05-07T19:45:30.2925819Z 2025-05-07T19:45:30.2925823Z 2025-05-07T19:45:30.2925826Z 2025-05-07T19:45:30.2925829Z 2025-05-07T19:45:30.2925833Z 2025-05-07T19:45:30.2925836Z 2025-05-07T19:45:30.2925840Z 2025-05-07T19:45:30.2925843Z 2025-05-07T19:45:30.2925846Z 2025-05-07T19:45:30.2925850Z 2025-05-07T19:45:30.2925856Z 2025-05-07T19:45:30.2926025Z  2025-05-07T19:45:30.2926268Z 2025-05-07T19:45:30.2926272Z 2025-05-07T19:45:30.2926372Z  2025-05-07T19:45:30.2926476Z 2025-05-07T19:45:30.2926479Z 2025-05-07T19:45:30.2926600Z  2025-05-07T19:45:30.2926712Z 2025-05-07T19:45:30.2926715Z 2025-05-07T19:45:30.2926719Z 2025-05-07T19:45:30.2926821Z  2025-05-07T19:45:30.2926954Z 2025-05-07T19:45:30.2926958Z 2025-05-07T19:45:30.2926962Z 2025-05-07T19:45:30.2926965Z 2025-05-07T19:45:30.2927072Z  2025-05-07T19:45:30.2927192Z 2025-05-07T19:45:30.2927200Z 2025-05-07T19:45:30.2927204Z 2025-05-07T19:45:30.2927208Z 2025-05-07T19:45:30.2927212Z 2025-05-07T19:45:30.2927340Z  2025-05-07T19:45:30.2927468Z 2025-05-07T19:45:30.2927472Z 2025-05-07T19:45:30.2927475Z 2025-05-07T19:45:30.2927479Z 2025-05-07T19:45:30.2927482Z 2025-05-07T19:45:30.2927486Z 2025-05-07T19:45:30.2927620Z  2025-05-07T19:45:30.2927757Z 2025-05-07T19:45:30.2927761Z 2025-05-07T19:45:30.2927765Z 2025-05-07T19:45:30.2927768Z 2025-05-07T19:45:30.2927772Z 2025-05-07T19:45:30.2927775Z 2025-05-07T19:45:30.2927779Z 2025-05-07T19:45:30.2927903Z  2025-05-07T19:45:30.2928072Z 2025-05-07T19:45:30.2928075Z 2025-05-07T19:45:30.2928079Z 2025-05-07T19:45:30.2928082Z 2025-05-07T19:45:30.2928086Z 2025-05-07T19:45:30.2928089Z 2025-05-07T19:45:30.2928092Z 2025-05-07T19:45:30.2928096Z 2025-05-07T19:45:30.2928216Z  2025-05-07T19:45:30.2928390Z 2025-05-07T19:45:30.2928394Z 2025-05-07T19:45:30.2928398Z 2025-05-07T19:45:30.2928405Z 2025-05-07T19:45:30.2928408Z 2025-05-07T19:45:30.2928412Z 2025-05-07T19:45:30.2928415Z 2025-05-07T19:45:30.2928418Z 2025-05-07T19:45:30.2928422Z 2025-05-07T19:45:30.2928550Z  2025-05-07T19:45:30.2928714Z 2025-05-07T19:45:30.2928734Z 2025-05-07T19:45:30.2928738Z 2025-05-07T19:45:30.2928741Z 2025-05-07T19:45:30.2928745Z 2025-05-07T19:45:30.2928807Z 2025-05-07T19:45:30.2928812Z 2025-05-07T19:45:30.2928815Z 2025-05-07T19:45:30.2928819Z 2025-05-07T19:45:30.2928822Z 2025-05-07T19:45:30.2928950Z  2025-05-07T19:45:30.2929127Z 2025-05-07T19:45:30.2929131Z 2025-05-07T19:45:30.2929152Z 2025-05-07T19:45:30.2929156Z 2025-05-07T19:45:30.2929159Z 2025-05-07T19:45:30.2929164Z 2025-05-07T19:45:30.2929167Z 2025-05-07T19:45:30.2929171Z 2025-05-07T19:45:30.2929174Z 2025-05-07T19:45:30.2929177Z 2025-05-07T19:45:30.2929181Z 2025-05-07T19:45:30.2929315Z  2025-05-07T19:45:30.2929497Z 2025-05-07T19:45:30.2929578Z 2025-05-07T19:45:30.2929582Z 2025-05-07T19:45:30.2929585Z 2025-05-07T19:45:30.2929588Z 2025-05-07T19:45:30.2929592Z 2025-05-07T19:45:30.2929595Z 2025-05-07T19:45:30.2929599Z 2025-05-07T19:45:30.2929602Z 2025-05-07T19:45:30.2929606Z 2025-05-07T19:45:30.2929609Z 2025-05-07T19:45:30.2929612Z 2025-05-07T19:45:30.2929750Z  2025-05-07T19:45:30.2929958Z 2025-05-07T19:45:30.2929962Z 2025-05-07T19:45:30.2929966Z 2025-05-07T19:45:30.2929969Z 2025-05-07T19:45:30.2929973Z 2025-05-07T19:45:30.2929976Z 2025-05-07T19:45:30.2929979Z 2025-05-07T19:45:30.2929983Z 2025-05-07T19:45:30.2929986Z 2025-05-07T19:45:30.2929990Z 2025-05-07T19:45:30.2929993Z 2025-05-07T19:45:30.2929997Z 2025-05-07T19:45:30.2930000Z 2025-05-07T19:45:30.2930141Z  2025-05-07T19:45:30.2930354Z 2025-05-07T19:45:30.2930358Z 2025-05-07T19:45:30.2930361Z 2025-05-07T19:45:30.2930365Z 2025-05-07T19:45:30.2930368Z 2025-05-07T19:45:30.2930372Z 2025-05-07T19:45:30.2930379Z 2025-05-07T19:45:30.2930382Z 2025-05-07T19:45:30.2930386Z 2025-05-07T19:45:30.2930389Z 2025-05-07T19:45:30.2930393Z 2025-05-07T19:45:30.2930396Z 2025-05-07T19:45:30.2930400Z 2025-05-07T19:45:30.2930403Z 2025-05-07T19:45:30.2930547Z  2025-05-07T19:45:30.2930766Z 2025-05-07T19:45:30.2930769Z 2025-05-07T19:45:30.2930776Z 2025-05-07T19:45:30.2930779Z 2025-05-07T19:45:30.2930783Z 2025-05-07T19:45:30.2930786Z 2025-05-07T19:45:30.2930789Z 2025-05-07T19:45:30.2930793Z 2025-05-07T19:45:30.2930796Z 2025-05-07T19:45:30.2930800Z 2025-05-07T19:45:30.2930803Z 2025-05-07T19:45:30.2930807Z 2025-05-07T19:45:30.2930810Z 2025-05-07T19:45:30.2930813Z 2025-05-07T19:45:30.2930817Z 2025-05-07T19:45:30.2930983Z  2025-05-07T19:45:30.2931194Z 2025-05-07T19:45:30.2931198Z 2025-05-07T19:45:30.2931201Z 2025-05-07T19:45:30.2931205Z 2025-05-07T19:45:30.2931208Z 2025-05-07T19:45:30.2931212Z 2025-05-07T19:45:30.2931219Z 2025-05-07T19:45:30.2931223Z 2025-05-07T19:45:30.2931226Z 2025-05-07T19:45:30.2931230Z 2025-05-07T19:45:30.2931234Z 2025-05-07T19:45:30.2931238Z 2025-05-07T19:45:30.2931241Z 2025-05-07T19:45:30.2931245Z 2025-05-07T19:45:30.2931267Z 2025-05-07T19:45:30.2931270Z 2025-05-07T19:45:30.2931423Z  2025-05-07T19:45:30.2931637Z 2025-05-07T19:45:30.2931643Z 2025-05-07T19:45:30.2931647Z 2025-05-07T19:45:30.2931650Z 2025-05-07T19:45:30.2931654Z 2025-05-07T19:45:30.2931657Z 2025-05-07T19:45:30.2931661Z 2025-05-07T19:45:30.2931664Z 2025-05-07T19:45:30.2931667Z 2025-05-07T19:45:30.2931688Z 2025-05-07T19:45:30.2931692Z 2025-05-07T19:45:30.2931695Z 2025-05-07T19:45:30.2931698Z 2025-05-07T19:45:30.2931702Z 2025-05-07T19:45:30.2931705Z 2025-05-07T19:45:30.2931709Z 2025-05-07T19:45:30.2931712Z 2025-05-07T19:45:30.2931874Z  2025-05-07T19:45:30.2932093Z 2025-05-07T19:45:30.2932096Z 2025-05-07T19:45:30.2932121Z 2025-05-07T19:45:30.2932125Z 2025-05-07T19:45:30.2932128Z 2025-05-07T19:45:30.2932131Z 2025-05-07T19:45:30.2932134Z 2025-05-07T19:45:30.2932138Z 2025-05-07T19:45:30.2932141Z 2025-05-07T19:45:30.2932145Z 2025-05-07T19:45:30.2932148Z 2025-05-07T19:45:30.2932151Z 2025-05-07T19:45:30.2932155Z 2025-05-07T19:45:30.2932158Z 2025-05-07T19:45:30.2932161Z 2025-05-07T19:45:30.2932226Z 2025-05-07T19:45:30.2932230Z 2025-05-07T19:45:30.2932234Z 2025-05-07T19:45:30.2932403Z  2025-05-07T19:45:30.2932642Z 2025-05-07T19:45:30.2932646Z 2025-05-07T19:45:30.2932747Z  2025-05-07T19:45:30.2932856Z 2025-05-07T19:45:30.2932860Z 2025-05-07T19:45:30.2932979Z  2025-05-07T19:45:30.2933092Z 2025-05-07T19:45:30.2933096Z 2025-05-07T19:45:30.2933099Z 2025-05-07T19:45:30.2933202Z  2025-05-07T19:45:30.2933336Z 2025-05-07T19:45:30.2933339Z 2025-05-07T19:45:30.2933343Z 2025-05-07T19:45:30.2933347Z 2025-05-07T19:45:30.2933452Z  2025-05-07T19:45:30.2933634Z 2025-05-07T19:45:30.2933637Z 2025-05-07T19:45:30.2933641Z 2025-05-07T19:45:30.2933644Z 2025-05-07T19:45:30.2933648Z 2025-05-07T19:45:30.2933776Z  2025-05-07T19:45:30.2933906Z 2025-05-07T19:45:30.2933910Z 2025-05-07T19:45:30.2933914Z 2025-05-07T19:45:30.2933917Z 2025-05-07T19:45:30.2933920Z 2025-05-07T19:45:30.2933923Z 2025-05-07T19:45:30.2934058Z  2025-05-07T19:45:30.2934191Z 2025-05-07T19:45:30.2934195Z 2025-05-07T19:45:30.2934198Z 2025-05-07T19:45:30.2934202Z 2025-05-07T19:45:30.2934205Z 2025-05-07T19:45:30.2934209Z 2025-05-07T19:45:30.2934212Z 2025-05-07T19:45:30.2934325Z  2025-05-07T19:45:30.2934487Z 2025-05-07T19:45:30.2934492Z 2025-05-07T19:45:30.2934495Z 2025-05-07T19:45:30.2934498Z 2025-05-07T19:45:30.2934502Z 2025-05-07T19:45:30.2934505Z 2025-05-07T19:45:30.2934509Z 2025-05-07T19:45:30.2934513Z 2025-05-07T19:45:30.2934632Z  2025-05-07T19:45:30.2934804Z 2025-05-07T19:45:30.2934811Z 2025-05-07T19:45:30.2934814Z 2025-05-07T19:45:30.2934818Z 2025-05-07T19:45:30.2934821Z 2025-05-07T19:45:30.2934825Z 2025-05-07T19:45:30.2934828Z 2025-05-07T19:45:30.2934832Z 2025-05-07T19:45:30.2934835Z 2025-05-07T19:45:30.2934958Z  2025-05-07T19:45:30.2935120Z 2025-05-07T19:45:30.2935143Z 2025-05-07T19:45:30.2935146Z 2025-05-07T19:45:30.2935152Z 2025-05-07T19:45:30.2935156Z 2025-05-07T19:45:30.2935159Z 2025-05-07T19:45:30.2935163Z 2025-05-07T19:45:30.2935166Z 2025-05-07T19:45:30.2935170Z 2025-05-07T19:45:30.2935173Z 2025-05-07T19:45:30.2935300Z  2025-05-07T19:45:30.2935470Z 2025-05-07T19:45:30.2935473Z 2025-05-07T19:45:30.2935495Z 2025-05-07T19:45:30.2935498Z 2025-05-07T19:45:30.2935501Z 2025-05-07T19:45:30.2935505Z 2025-05-07T19:45:30.2935508Z 2025-05-07T19:45:30.2935512Z 2025-05-07T19:45:30.2935515Z 2025-05-07T19:45:30.2935518Z 2025-05-07T19:45:30.2935522Z 2025-05-07T19:45:30.2935653Z  2025-05-07T19:45:30.2935832Z 2025-05-07T19:45:30.2935854Z 2025-05-07T19:45:30.2935858Z 2025-05-07T19:45:30.2935861Z 2025-05-07T19:45:30.2935864Z 2025-05-07T19:45:30.2935868Z 2025-05-07T19:45:30.2935871Z 2025-05-07T19:45:30.2935874Z 2025-05-07T19:45:30.2935878Z 2025-05-07T19:45:30.2935881Z 2025-05-07T19:45:30.2935885Z 2025-05-07T19:45:30.2935889Z 2025-05-07T19:45:30.2936025Z  2025-05-07T19:45:30.2936340Z 2025-05-07T19:45:30.2936346Z 2025-05-07T19:45:30.2936352Z 2025-05-07T19:45:30.2936356Z 2025-05-07T19:45:30.2936361Z 2025-05-07T19:45:30.2936367Z 2025-05-07T19:45:30.2936370Z 2025-05-07T19:45:30.2936374Z 2025-05-07T19:45:30.2936378Z 2025-05-07T19:45:30.2936381Z 2025-05-07T19:45:30.2936384Z 2025-05-07T19:45:30.2936388Z 2025-05-07T19:45:30.2936391Z 2025-05-07T19:45:30.2936544Z  2025-05-07T19:45:30.2936761Z 2025-05-07T19:45:30.2936765Z 2025-05-07T19:45:30.2936768Z 2025-05-07T19:45:30.2936775Z 2025-05-07T19:45:30.2936779Z 2025-05-07T19:45:30.2936782Z 2025-05-07T19:45:30.2936786Z 2025-05-07T19:45:30.2936790Z 2025-05-07T19:45:30.2936794Z 2025-05-07T19:45:30.2936797Z 2025-05-07T19:45:30.2936801Z 2025-05-07T19:45:30.2936804Z 2025-05-07T19:45:30.2936807Z 2025-05-07T19:45:30.2936811Z 2025-05-07T19:45:30.2936956Z  2025-05-07T19:45:30.2937266Z 2025-05-07T19:45:30.2937270Z 2025-05-07T19:45:30.2937274Z 2025-05-07T19:45:30.2937278Z 2025-05-07T19:45:30.2937282Z 2025-05-07T19:45:30.2937285Z 2025-05-07T19:45:30.2937289Z 2025-05-07T19:45:30.2937292Z 2025-05-07T19:45:30.2937296Z 2025-05-07T19:45:30.2937299Z 2025-05-07T19:45:30.2937303Z 2025-05-07T19:45:30.2937306Z 2025-05-07T19:45:30.2937309Z 2025-05-07T19:45:30.2937312Z 2025-05-07T19:45:30.2937317Z 2025-05-07T19:45:30.2937490Z  2025-05-07T19:45:30.2937697Z 2025-05-07T19:45:30.2937702Z 2025-05-07T19:45:30.2937705Z 2025-05-07T19:45:30.2937708Z 2025-05-07T19:45:30.2937775Z 2025-05-07T19:45:30.2937779Z 2025-05-07T19:45:30.2937782Z 2025-05-07T19:45:30.2937785Z 2025-05-07T19:45:30.2937789Z 2025-05-07T19:45:30.2937792Z 2025-05-07T19:45:30.2937796Z 2025-05-07T19:45:30.2937800Z 2025-05-07T19:45:30.2937803Z 2025-05-07T19:45:30.2937807Z 2025-05-07T19:45:30.2937827Z 2025-05-07T19:45:30.2937831Z 2025-05-07T19:45:30.2937991Z  2025-05-07T19:45:30.2938204Z 2025-05-07T19:45:30.2938207Z 2025-05-07T19:45:30.2938211Z 2025-05-07T19:45:30.2938215Z 2025-05-07T19:45:30.2938218Z 2025-05-07T19:45:30.2938222Z 2025-05-07T19:45:30.2938225Z 2025-05-07T19:45:30.2938228Z 2025-05-07T19:45:30.2938252Z 2025-05-07T19:45:30.2938255Z 2025-05-07T19:45:30.2938259Z 2025-05-07T19:45:30.2938262Z 2025-05-07T19:45:30.2938265Z 2025-05-07T19:45:30.2938269Z 2025-05-07T19:45:30.2938272Z 2025-05-07T19:45:30.2938276Z 2025-05-07T19:45:30.2938279Z 2025-05-07T19:45:30.2938440Z  2025-05-07T19:45:30.2938663Z 2025-05-07T19:45:30.2938667Z 2025-05-07T19:45:30.2938690Z 2025-05-07T19:45:30.2938694Z 2025-05-07T19:45:30.2938697Z 2025-05-07T19:45:30.2938701Z 2025-05-07T19:45:30.2938704Z 2025-05-07T19:45:30.2938707Z 2025-05-07T19:45:30.2938711Z 2025-05-07T19:45:30.2938714Z 2025-05-07T19:45:30.2938718Z 2025-05-07T19:45:30.2938722Z 2025-05-07T19:45:30.2938725Z 2025-05-07T19:45:30.2938732Z 2025-05-07T19:45:30.2938735Z 2025-05-07T19:45:30.2938739Z 2025-05-07T19:45:30.2938742Z 2025-05-07T19:45:30.2938746Z 2025-05-07T19:45:30.2938916Z  2025-05-07T19:45:30.2939159Z 2025-05-07T19:45:30.2939163Z 2025-05-07T19:45:30.2939266Z  2025-05-07T19:45:30.2939378Z 2025-05-07T19:45:30.2939381Z 2025-05-07T19:45:30.2939501Z  2025-05-07T19:45:30.2939613Z 2025-05-07T19:45:30.2939616Z 2025-05-07T19:45:30.2939620Z 2025-05-07T19:45:30.2939744Z  done 2025-05-07T19:45:30.6070857Z Preparing transaction: | / - done 2025-05-07T19:45:34.4833434Z Verifying transaction: | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - done 2025-05-07T19:45:37.0025099Z Executing transaction: | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | done 2025-05-07T19:45:37.4202254Z [INSTALL] Adding symlink librhash.so.0, which is needed by CMake ... 2025-05-07T19:45:39.1006294Z + ln -s /github/home/miniconda/envs/build_binary/lib/librhash.so /github/home/miniconda/envs/build_binary/lib/librhash.so.0 2025-05-07T19:45:39.1007002Z 2025-05-07T19:45:39.1019093Z 2025-05-07T19:45:39.1046304Z [EXEC] [ATTEMPT 0/3] + conda run -n build_binary pip install build 2025-05-07T19:45:41.2734876Z WARNING: Running pip as the 'root' user can result in broken permissions and conflicting behaviour with the system package manager, possibly rendering your system unusable. It is recommended to use a virtual environment instead: https://pip.pypa.io/warnings/venv. Use the --root-user-action option if you know what you are doing and want to suppress this warning. 2025-05-07T19:45:41.2739057Z 2025-05-07T19:45:41.2739183Z Collecting build 2025-05-07T19:45:41.2739543Z Downloading build-1.2.2.post1-py3-none-any.whl.metadata (6.5 kB) 2025-05-07T19:45:41.2740689Z Requirement already satisfied: packaging>=19.1 in /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages (from build) (25.0) 2025-05-07T19:45:41.2741407Z Collecting pyproject_hooks (from build) 2025-05-07T19:45:41.2741863Z Downloading pyproject_hooks-1.2.0-py3-none-any.whl.metadata (1.3 kB) 2025-05-07T19:45:41.2742359Z Downloading build-1.2.2.post1-py3-none-any.whl (22 kB) 2025-05-07T19:45:41.2742818Z Downloading pyproject_hooks-1.2.0-py3-none-any.whl (10 kB) 2025-05-07T19:45:41.2743377Z Installing collected packages: pyproject_hooks, build 2025-05-07T19:45:41.2743621Z 2025-05-07T19:45:41.2743807Z Successfully installed build-1.2.2.post1 pyproject_hooks-1.2.0 2025-05-07T19:45:41.2744268Z 2025-05-07T19:45:42.9146215Z /github/home/miniconda/envs/build_binary/bin/make 2025-05-07T19:45:42.9146525Z 2025-05-07T19:45:42.9957864Z [CHECK] Binary make found in PATH 2025-05-07T19:45:44.5944761Z /github/home/miniconda/envs/build_binary/bin/cmake 2025-05-07T19:45:44.5945422Z 2025-05-07T19:45:44.6536573Z [CHECK] Binary cmake found in PATH 2025-05-07T19:45:46.2382522Z /github/home/miniconda/envs/build_binary/bin/ninja 2025-05-07T19:45:46.2382840Z 2025-05-07T19:45:46.2961487Z [CHECK] Binary ninja found in PATH 2025-05-07T19:45:47.9823271Z [CHECK] Python (sub-)package 'click' found ... 2025-05-07T19:45:49.7986984Z [CHECK] Python (sub-)package 'hypothesis' found ... 2025-05-07T19:45:51.5076527Z [CHECK] Python (sub-)package 'jinja2' found ... 2025-05-07T19:45:53.3117981Z [CHECK] Python (sub-)package 'skbuild' found ... 2025-05-07T19:45:54.9715547Z [CHECK] Python (sub-)package 'wheel' found ... 2025-05-07T19:45:54.9716724Z [INSTALL] Successfully installed all the build tools 2025-05-07T19:45:54.9779108Z ##[group]Run . $PRELUDE; install_cuda $BUILD_ENV 12.6.3 2025-05-07T19:45:54.9779603Z . $PRELUDE; install_cuda $BUILD_ENV 12.6.3 2025-05-07T19:45:54.9780225Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:45:54.9780594Z env: 2025-05-07T19:45:54.9780842Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:45:54.9781182Z BUILD_ENV: build_binary 2025-05-07T19:45:54.9781446Z BUILD_TARGET: default 2025-05-07T19:45:54.9781712Z BUILD_VARIANT: cuda 2025-05-07T19:45:54.9781958Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:45:54.9782249Z ##[endgroup] 2025-05-07T19:45:55.4536932Z ################################################################################ 2025-05-07T19:45:55.4537933Z # Install CUDA 2025-05-07T19:45:55.4538537Z # 2025-05-07T19:45:55.4549536Z # [2025-05-07T19:45:55.454Z] + install_cuda build_binary 12.6.3 2025-05-07T19:45:55.4550529Z ################################################################################ 2025-05-07T19:45:55.4550872Z 2025-05-07T19:45:55.4567099Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:45:55.5444091Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:45:55.5445157Z [SETUP] Cleaning up Conda packages ... 2025-05-07T19:45:55.5446096Z + conda clean --packages --tarball -y 2025-05-07T19:45:55.5446694Z 2025-05-07T19:45:56.0677365Z Will remove 144 (595.4 MB) tarball(s). 2025-05-07T19:45:56.0678001Z Will remove 19 (4.7 MB) package(s). 2025-05-07T19:45:56.1240748Z 2025-05-07T19:45:56.1245525Z + conda clean --all -y 2025-05-07T19:45:56.1246039Z 2025-05-07T19:45:56.7375338Z There are no unused tarball(s) to remove. 2025-05-07T19:45:56.7376338Z Will remove 1 index cache(s). 2025-05-07T19:45:56.7377400Z There are no unused package(s) to remove. 2025-05-07T19:45:56.7378341Z There are no tempfile(s) to remove. 2025-05-07T19:45:56.7378641Z There are no logfile(s) to remove. 2025-05-07T19:45:56.7929853Z 2025-05-07T19:45:56.7940201Z [INSTALL] Installing CUDA 12.6.3 ... 2025-05-07T19:45:56.7969041Z [EXEC] [ATTEMPT 0/3] + conda install --force-reinstall -n build_binary -c conda-forge --override-channels -y cuda=12.6.3 2025-05-07T19:45:57.6232714Z Channels: 2025-05-07T19:45:57.6233135Z - conda-forge 2025-05-07T19:45:57.6233519Z Platform: linux-64 2025-05-07T19:46:07.3629049Z Collecting package metadata (repodata.json): - \ | / - \ | / - \ | / - \ | / - done 2025-05-07T19:46:08.8720615Z Solving environment: | / - \ done 2025-05-07T19:46:09.0114131Z 2025-05-07T19:46:09.0114547Z ## Package Plan ## 2025-05-07T19:46:09.0115067Z 2025-05-07T19:46:09.0115660Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:46:09.0116592Z 2025-05-07T19:46:09.0116867Z added / updated specs: 2025-05-07T19:46:09.0117580Z - cuda=12.6.3 2025-05-07T19:46:09.0117956Z 2025-05-07T19:46:09.0117968Z 2025-05-07T19:46:09.0118310Z The following packages will be downloaded: 2025-05-07T19:46:09.0118975Z 2025-05-07T19:46:09.0119340Z package | build 2025-05-07T19:46:09.0120673Z ---------------------------|----------------- 2025-05-07T19:46:09.0121064Z attr-2.5.1 | h166bdaf_1 69 KB conda-forge 2025-05-07T19:46:09.0121505Z binutils-2.40 | h4852527_7 31 KB conda-forge 2025-05-07T19:46:09.0122058Z c-compiler-1.5.2 | h0b41bf4_0 6 KB conda-forge 2025-05-07T19:46:09.0122489Z cuda-12.6.3 | ha804496_0 26 KB conda-forge 2025-05-07T19:46:09.0122920Z cuda-cccl_linux-64-12.6.77 | ha770c72_0 1.0 MB conda-forge 2025-05-07T19:46:09.0123443Z cuda-command-line-tools-12.6.3| ha770c72_0 20 KB conda-forge 2025-05-07T19:46:09.0123944Z cuda-compiler-12.6.3 | hbad6d8a_0 20 KB conda-forge 2025-05-07T19:46:09.0124439Z cuda-crt-dev_linux-64-12.6.85| ha770c72_0 87 KB conda-forge 2025-05-07T19:46:09.0125136Z cuda-crt-tools-12.6.85 | ha770c72_0 26 KB conda-forge 2025-05-07T19:46:09.0125604Z cuda-cudart-12.6.77 | h5888daf_0 22 KB conda-forge 2025-05-07T19:46:09.0126084Z cuda-cudart-dev-12.6.77 | h5888daf_0 22 KB conda-forge 2025-05-07T19:46:09.0126687Z cuda-cudart-dev_linux-64-12.6.77| h3f2d84a_0 357 KB conda-forge 2025-05-07T19:46:09.0127199Z cuda-cudart-static-12.6.77 | h5888daf_0 22 KB conda-forge 2025-05-07T19:46:09.0127719Z cuda-cudart-static_linux-64-12.6.77| h3f2d84a_0 744 KB conda-forge 2025-05-07T19:46:09.0128211Z cuda-cudart_linux-64-12.6.77| h3f2d84a_0 184 KB conda-forge 2025-05-07T19:46:09.0128684Z cuda-cuobjdump-12.6.77 | hbd13f7d_1 241 KB conda-forge 2025-05-07T19:46:09.0129203Z cuda-cupti-12.6.80 | hbd13f7d_0 1.9 MB conda-forge 2025-05-07T19:46:09.0129634Z cuda-cupti-dev-12.6.80 | h5888daf_0 3.4 MB conda-forge 2025-05-07T19:46:09.0130092Z cuda-cuxxfilt-12.6.77 | hbd13f7d_1 211 KB conda-forge 2025-05-07T19:46:09.0130553Z cuda-driver-dev-12.6.77 | h5888daf_0 22 KB conda-forge 2025-05-07T19:46:09.0131028Z cuda-driver-dev_linux-64-12.6.77| h3f2d84a_0 35 KB conda-forge 2025-05-07T19:46:09.0131494Z cuda-gdb-12.6.77 | h50b4baa_1 370 KB conda-forge 2025-05-07T19:46:09.0131915Z cuda-libraries-12.6.3 | ha770c72_0 20 KB conda-forge 2025-05-07T19:46:09.0132385Z cuda-libraries-dev-12.6.3 | ha770c72_0 20 KB conda-forge 2025-05-07T19:46:09.0132832Z cuda-nsight-12.6.77 | h7938cbb_0 113.2 MB conda-forge 2025-05-07T19:46:09.0133264Z cuda-nvcc-12.6.85 | hcdd1206_0 23 KB conda-forge 2025-05-07T19:46:09.0133723Z cuda-nvcc-dev_linux-64-12.6.85| he91c749_0 10.8 MB conda-forge 2025-05-07T19:46:09.0134200Z cuda-nvcc-impl-12.6.85 | h85509e4_0 25 KB conda-forge 2025-05-07T19:46:09.0134692Z cuda-nvcc-tools-12.6.85 | he02047a_0 23.0 MB conda-forge 2025-05-07T19:46:09.0135155Z cuda-nvcc_linux-64-12.6.85 | h04802cd_0 25 KB conda-forge 2025-05-07T19:46:09.0135644Z cuda-nvdisasm-12.6.77 | hbd13f7d_1 47.6 MB conda-forge 2025-05-07T19:46:09.0136130Z cuda-nvml-dev-12.6.77 | hbd13f7d_1 159 KB conda-forge 2025-05-07T19:46:09.0136714Z cuda-nvprof-12.6.80 | hbd13f7d_0 2.6 MB conda-forge 2025-05-07T19:46:09.0137388Z cuda-nvprune-12.6.77 | hbd13f7d_1 66 KB conda-forge 2025-05-07T19:46:09.0137907Z cuda-nvrtc-12.6.85 | hbd13f7d_0 17.3 MB conda-forge 2025-05-07T19:46:09.0138420Z cuda-nvrtc-dev-12.6.85 | h5888daf_0 31 KB conda-forge 2025-05-07T19:46:09.0138889Z cuda-nvtx-12.6.77 | hbd13f7d_0 31 KB conda-forge 2025-05-07T19:46:09.0139494Z cuda-nvvm-dev_linux-64-12.6.85| ha770c72_0 25 KB conda-forge 2025-05-07T19:46:09.0140040Z cuda-nvvm-impl-12.6.85 | he02047a_0 7.7 MB conda-forge 2025-05-07T19:46:09.0140534Z cuda-nvvm-tools-12.6.85 | he02047a_0 10.4 MB conda-forge 2025-05-07T19:46:09.0141018Z cuda-nvvp-12.6.80 | hbd13f7d_1 109.3 MB conda-forge 2025-05-07T19:46:09.0141470Z cuda-opencl-12.6.77 | hbd13f7d_0 29 KB conda-forge 2025-05-07T19:46:09.0141969Z cuda-opencl-dev-12.6.77 | h5888daf_0 93 KB conda-forge 2025-05-07T19:46:09.0142470Z cuda-profiler-api-12.6.77 | h7938cbb_0 22 KB conda-forge 2025-05-07T19:46:09.0142976Z cuda-runtime-12.6.3 | ha804496_0 19 KB conda-forge 2025-05-07T19:46:09.0143572Z cuda-sanitizer-api-12.6.77 | hbd13f7d_1 8.9 MB conda-forge 2025-05-07T19:46:09.0144143Z cuda-toolkit-12.6.3 | ha804496_0 19 KB conda-forge 2025-05-07T19:46:09.0144594Z cuda-tools-12.6.3 | ha770c72_0 19 KB conda-forge 2025-05-07T19:46:09.0145015Z cuda-version-12.6 | h7480c83_3 20 KB conda-forge 2025-05-07T19:46:09.0145485Z cuda-visual-tools-12.6.3 | ha770c72_0 19 KB conda-forge 2025-05-07T19:46:09.0145951Z cxx-compiler-1.5.2 | hf52228f_0 6 KB conda-forge 2025-05-07T19:46:09.0146354Z dbus-1.13.6 | h5008d03_3 604 KB conda-forge 2025-05-07T19:46:09.0146743Z gcc-11.4.0 | h602e360_13 49 KB conda-forge 2025-05-07T19:46:09.0147132Z gds-tools-1.11.1.6 | h5888daf_4 37.8 MB conda-forge 2025-05-07T19:46:09.0147538Z gmp-6.3.0 | hac33072_2 449 KB conda-forge 2025-05-07T19:46:09.0147910Z gxx-11.4.0 | h602e360_13 49 KB conda-forge 2025-05-07T19:46:09.0148304Z libcap-2.75 | h39aace5_0 118 KB conda-forge 2025-05-07T19:46:09.0148724Z libcublas-12.6.4.1 | h5888daf_1 256.2 MB conda-forge 2025-05-07T19:46:09.0149152Z libcublas-dev-12.6.4.1 | h5888daf_1 88 KB conda-forge 2025-05-07T19:46:09.0149596Z libcufft-11.3.0.4 | hbd13f7d_0 156.2 MB conda-forge 2025-05-07T19:46:09.0150018Z libcufft-dev-11.3.0.4 | h5888daf_0 33 KB conda-forge 2025-05-07T19:46:09.0150462Z libcufile-1.11.1.6 | h12f29b5_4 900 KB conda-forge 2025-05-07T19:46:09.0150892Z libcufile-dev-1.11.1.6 | h5888daf_4 35 KB conda-forge 2025-05-07T19:46:09.0151330Z libcurand-10.3.7.77 | hbd13f7d_0 39.9 MB conda-forge 2025-05-07T19:46:09.0151779Z libcurand-dev-10.3.7.77 | h5888daf_0 262 KB conda-forge 2025-05-07T19:46:09.0152216Z libcusolver-11.7.1.2 | h5888daf_1 95.8 MB conda-forge 2025-05-07T19:46:09.0152676Z libcusolver-dev-11.7.1.2 | h5888daf_1 59 KB conda-forge 2025-05-07T19:46:09.0153126Z libcusparse-12.5.4.2 | hbd13f7d_0 118.6 MB conda-forge 2025-05-07T19:46:09.0153603Z libcusparse-dev-12.5.4.2 | h5888daf_0 51 KB conda-forge 2025-05-07T19:46:09.0154085Z libgcrypt-lib-1.11.0 | hb9d3cd8_2 572 KB conda-forge 2025-05-07T19:46:09.0173688Z libgpg-error-1.55 | h3f2d84a_0 305 KB conda-forge 2025-05-07T19:46:09.0174189Z libnl-3.11.0 | hb9d3cd8_0 724 KB conda-forge 2025-05-07T19:46:09.0174625Z libnpp-12.3.1.54 | h5888daf_0 93.4 MB conda-forge 2025-05-07T19:46:09.0175096Z libnpp-dev-12.3.1.54 | h5888daf_0 441 KB conda-forge 2025-05-07T19:46:09.0175544Z libnuma-2.0.18 | h4ab18f5_2 42 KB conda-forge 2025-05-07T19:46:09.0176032Z libnvfatbin-12.6.77 | hbd13f7d_0 783 KB conda-forge 2025-05-07T19:46:09.0176900Z libnvfatbin-dev-12.6.77 | h5888daf_0 26 KB conda-forge 2025-05-07T19:46:09.0177385Z libnvjitlink-12.6.85 | hbd13f7d_0 14.9 MB conda-forge 2025-05-07T19:46:09.0177892Z libnvjitlink-dev-12.6.85 | h5888daf_0 25 KB conda-forge 2025-05-07T19:46:09.0178368Z libnvjpeg-12.3.3.54 | h5888daf_0 2.4 MB conda-forge 2025-05-07T19:46:09.0178862Z libnvjpeg-dev-12.3.3.54 | ha770c72_0 31 KB conda-forge 2025-05-07T19:46:09.0179356Z libsystemd0-257.4 | h4e0b6ca_1 477 KB conda-forge 2025-05-07T19:46:09.0179808Z libudev1-257.4 | hbe16f8c_1 141 KB conda-forge 2025-05-07T19:46:09.0180279Z libxkbcommon-1.7.0 | h2c5496b_1 579 KB conda-forge 2025-05-07T19:46:09.0180874Z libxkbfile-1.1.0 | h166bdaf_1 111 KB conda-forge 2025-05-07T19:46:09.0181325Z lz4-c-1.10.0 | h5888daf_1 163 KB conda-forge 2025-05-07T19:46:09.0181787Z nsight-compute-2024.3.2.3 | hb5ebaad_0 443.1 MB conda-forge 2025-05-07T19:46:09.0182261Z nspr-4.36 | h5888daf_0 225 KB conda-forge 2025-05-07T19:46:09.0182678Z nss-3.111 | h159eef7_0 1.9 MB conda-forge 2025-05-07T19:46:09.0183089Z ocl-icd-2.3.3 | hb9d3cd8_0 104 KB conda-forge 2025-05-07T19:46:09.0183568Z opencl-headers-2024.10.24 | h5888daf_0 53 KB conda-forge 2025-05-07T19:46:09.0184038Z rdma-core-57.0 | h5888daf_0 1.2 MB conda-forge 2025-05-07T19:46:09.0184488Z wayland-1.23.1 | h3e06ad9_0 314 KB conda-forge 2025-05-07T19:46:09.0184912Z xcb-util-0.4.1 | hb711507_2 19 KB conda-forge 2025-05-07T19:46:09.0185384Z xcb-util-cursor-0.1.5 | hb9d3cd8_0 20 KB conda-forge 2025-05-07T19:46:09.0185871Z xcb-util-image-0.4.0 | hb711507_2 24 KB conda-forge 2025-05-07T19:46:09.0186342Z xcb-util-keysyms-0.4.1 | hb711507_0 14 KB conda-forge 2025-05-07T19:46:09.0186853Z xcb-util-renderutil-0.3.10 | hb711507_0 17 KB conda-forge 2025-05-07T19:46:09.0187324Z xcb-util-wm-0.4.2 | hb711507_0 50 KB conda-forge 2025-05-07T19:46:09.0187805Z xkeyboard-config-2.44 | hb9d3cd8_0 384 KB conda-forge 2025-05-07T19:46:09.0188320Z xorg-libxcomposite-0.4.6 | hb9d3cd8_2 13 KB conda-forge 2025-05-07T19:46:09.0188814Z xorg-libxdamage-1.1.6 | hb9d3cd8_0 13 KB conda-forge 2025-05-07T19:46:09.0189369Z ------------------------------------------------------------ 2025-05-07T19:46:09.0189702Z Total: 1.59 GB 2025-05-07T19:46:09.0189928Z 2025-05-07T19:46:09.0190059Z The following NEW packages will be INSTALLED: 2025-05-07T19:46:09.0190273Z 2025-05-07T19:46:09.0190445Z attr conda-forge/linux-64::attr-2.5.1-h166bdaf_1 2025-05-07T19:46:09.0190868Z binutils conda-forge/linux-64::binutils-2.40-h4852527_7 2025-05-07T19:46:09.0191332Z c-compiler conda-forge/linux-64::c-compiler-1.5.2-h0b41bf4_0 2025-05-07T19:46:09.0191746Z cuda conda-forge/noarch::cuda-12.6.3-ha804496_0 2025-05-07T19:46:09.0192214Z cuda-cccl_linux-64 conda-forge/noarch::cuda-cccl_linux-64-12.6.77-ha770c72_0 2025-05-07T19:46:09.0192791Z cuda-command-line~ conda-forge/linux-64::cuda-command-line-tools-12.6.3-ha770c72_0 2025-05-07T19:46:09.0193369Z cuda-compiler conda-forge/noarch::cuda-compiler-12.6.3-hbad6d8a_0 2025-05-07T19:46:09.0193920Z cuda-crt-dev_linu~ conda-forge/noarch::cuda-crt-dev_linux-64-12.6.85-ha770c72_0 2025-05-07T19:46:09.0194459Z cuda-crt-tools conda-forge/linux-64::cuda-crt-tools-12.6.85-ha770c72_0 2025-05-07T19:46:09.0195051Z cuda-cudart conda-forge/linux-64::cuda-cudart-12.6.77-h5888daf_0 2025-05-07T19:46:09.0195553Z cuda-cudart-dev conda-forge/linux-64::cuda-cudart-dev-12.6.77-h5888daf_0 2025-05-07T19:46:09.0196133Z cuda-cudart-dev_l~ conda-forge/noarch::cuda-cudart-dev_linux-64-12.6.77-h3f2d84a_0 2025-05-07T19:46:09.0196738Z cuda-cudart-static conda-forge/linux-64::cuda-cudart-static-12.6.77-h5888daf_0 2025-05-07T19:46:09.0197339Z cuda-cudart-stati~ conda-forge/noarch::cuda-cudart-static_linux-64-12.6.77-h3f2d84a_0 2025-05-07T19:46:09.0197945Z cuda-cudart_linux~ conda-forge/noarch::cuda-cudart_linux-64-12.6.77-h3f2d84a_0 2025-05-07T19:46:09.0198490Z cuda-cuobjdump conda-forge/linux-64::cuda-cuobjdump-12.6.77-hbd13f7d_1 2025-05-07T19:46:09.0199002Z cuda-cupti conda-forge/linux-64::cuda-cupti-12.6.80-hbd13f7d_0 2025-05-07T19:46:09.0199507Z cuda-cupti-dev conda-forge/linux-64::cuda-cupti-dev-12.6.80-h5888daf_0 2025-05-07T19:46:09.0200127Z cuda-cuxxfilt conda-forge/linux-64::cuda-cuxxfilt-12.6.77-hbd13f7d_1 2025-05-07T19:46:09.0200670Z cuda-driver-dev conda-forge/linux-64::cuda-driver-dev-12.6.77-h5888daf_0 2025-05-07T19:46:09.0201229Z cuda-driver-dev_l~ conda-forge/noarch::cuda-driver-dev_linux-64-12.6.77-h3f2d84a_0 2025-05-07T19:46:09.0201760Z cuda-gdb conda-forge/linux-64::cuda-gdb-12.6.77-h50b4baa_1 2025-05-07T19:46:09.0202255Z cuda-libraries conda-forge/linux-64::cuda-libraries-12.6.3-ha770c72_0 2025-05-07T19:46:09.0202804Z cuda-libraries-dev conda-forge/linux-64::cuda-libraries-dev-12.6.3-ha770c72_0 2025-05-07T19:46:09.0203357Z cuda-nsight conda-forge/linux-64::cuda-nsight-12.6.77-h7938cbb_0 2025-05-07T19:46:09.0203823Z cuda-nvcc conda-forge/linux-64::cuda-nvcc-12.6.85-hcdd1206_0 2025-05-07T19:46:09.0204349Z cuda-nvcc-dev_lin~ conda-forge/noarch::cuda-nvcc-dev_linux-64-12.6.85-he91c749_0 2025-05-07T19:46:09.0204913Z cuda-nvcc-impl conda-forge/linux-64::cuda-nvcc-impl-12.6.85-h85509e4_0 2025-05-07T19:46:09.0205434Z cuda-nvcc-tools conda-forge/linux-64::cuda-nvcc-tools-12.6.85-he02047a_0 2025-05-07T19:46:09.0205985Z cuda-nvcc_linux-64 conda-forge/linux-64::cuda-nvcc_linux-64-12.6.85-h04802cd_0 2025-05-07T19:46:09.0206511Z cuda-nvdisasm conda-forge/linux-64::cuda-nvdisasm-12.6.77-hbd13f7d_1 2025-05-07T19:46:09.0207211Z cuda-nvml-dev conda-forge/linux-64::cuda-nvml-dev-12.6.77-hbd13f7d_1 2025-05-07T19:46:09.0207741Z cuda-nvprof conda-forge/linux-64::cuda-nvprof-12.6.80-hbd13f7d_0 2025-05-07T19:46:09.0208272Z cuda-nvprune conda-forge/linux-64::cuda-nvprune-12.6.77-hbd13f7d_1 2025-05-07T19:46:09.0208978Z cuda-nvrtc conda-forge/linux-64::cuda-nvrtc-12.6.85-hbd13f7d_0 2025-05-07T19:46:09.0209498Z cuda-nvrtc-dev conda-forge/linux-64::cuda-nvrtc-dev-12.6.85-h5888daf_0 2025-05-07T19:46:09.0210029Z cuda-nvtx conda-forge/linux-64::cuda-nvtx-12.6.77-hbd13f7d_0 2025-05-07T19:46:09.0210593Z cuda-nvvm-dev_lin~ conda-forge/noarch::cuda-nvvm-dev_linux-64-12.6.85-ha770c72_0 2025-05-07T19:46:09.0211180Z cuda-nvvm-impl conda-forge/linux-64::cuda-nvvm-impl-12.6.85-he02047a_0 2025-05-07T19:46:09.0211756Z cuda-nvvm-tools conda-forge/linux-64::cuda-nvvm-tools-12.6.85-he02047a_0 2025-05-07T19:46:09.0212279Z cuda-nvvp conda-forge/linux-64::cuda-nvvp-12.6.80-hbd13f7d_1 2025-05-07T19:46:09.0212789Z cuda-opencl conda-forge/linux-64::cuda-opencl-12.6.77-hbd13f7d_0 2025-05-07T19:46:09.0213347Z cuda-opencl-dev conda-forge/linux-64::cuda-opencl-dev-12.6.77-h5888daf_0 2025-05-07T19:46:09.0213933Z cuda-profiler-api conda-forge/linux-64::cuda-profiler-api-12.6.77-h7938cbb_0 2025-05-07T19:46:09.0214514Z cuda-runtime conda-forge/noarch::cuda-runtime-12.6.3-ha804496_0 2025-05-07T19:46:09.0215084Z cuda-sanitizer-api conda-forge/linux-64::cuda-sanitizer-api-12.6.77-hbd13f7d_1 2025-05-07T19:46:09.0215672Z cuda-toolkit conda-forge/noarch::cuda-toolkit-12.6.3-ha804496_0 2025-05-07T19:46:09.0216262Z cuda-tools conda-forge/linux-64::cuda-tools-12.6.3-ha770c72_0 2025-05-07T19:46:09.0216851Z cuda-version conda-forge/noarch::cuda-version-12.6-h7480c83_3 2025-05-07T19:46:09.0217418Z cuda-visual-tools conda-forge/linux-64::cuda-visual-tools-12.6.3-ha770c72_0 2025-05-07T19:46:09.0217995Z cxx-compiler conda-forge/linux-64::cxx-compiler-1.5.2-hf52228f_0 2025-05-07T19:46:09.0218483Z dbus conda-forge/linux-64::dbus-1.13.6-h5008d03_3 2025-05-07T19:46:09.0218908Z gcc conda-forge/linux-64::gcc-11.4.0-h602e360_13 2025-05-07T19:46:09.0219348Z gds-tools conda-forge/linux-64::gds-tools-1.11.1.6-h5888daf_4 2025-05-07T19:46:09.0219809Z gmp conda-forge/linux-64::gmp-6.3.0-hac33072_2 2025-05-07T19:46:09.0220202Z gxx conda-forge/linux-64::gxx-11.4.0-h602e360_13 2025-05-07T19:46:09.0220634Z libcap conda-forge/linux-64::libcap-2.75-h39aace5_0 2025-05-07T19:46:09.0221200Z libcublas conda-forge/linux-64::libcublas-12.6.4.1-h5888daf_1 2025-05-07T19:46:09.0221731Z libcublas-dev conda-forge/linux-64::libcublas-dev-12.6.4.1-h5888daf_1 2025-05-07T19:46:09.0222263Z libcufft conda-forge/linux-64::libcufft-11.3.0.4-hbd13f7d_0 2025-05-07T19:46:09.0222757Z libcufft-dev conda-forge/linux-64::libcufft-dev-11.3.0.4-h5888daf_0 2025-05-07T19:46:09.0223289Z libcufile conda-forge/linux-64::libcufile-1.11.1.6-h12f29b5_4 2025-05-07T19:46:09.0223828Z libcufile-dev conda-forge/linux-64::libcufile-dev-1.11.1.6-h5888daf_4 2025-05-07T19:46:09.0224351Z libcurand conda-forge/linux-64::libcurand-10.3.7.77-hbd13f7d_0 2025-05-07T19:46:09.0224898Z libcurand-dev conda-forge/linux-64::libcurand-dev-10.3.7.77-h5888daf_0 2025-05-07T19:46:09.0225441Z libcusolver conda-forge/linux-64::libcusolver-11.7.1.2-h5888daf_1 2025-05-07T19:46:09.0226021Z libcusolver-dev conda-forge/linux-64::libcusolver-dev-11.7.1.2-h5888daf_1 2025-05-07T19:46:09.0226593Z libcusparse conda-forge/linux-64::libcusparse-12.5.4.2-hbd13f7d_0 2025-05-07T19:46:09.0227147Z libcusparse-dev conda-forge/linux-64::libcusparse-dev-12.5.4.2-h5888daf_0 2025-05-07T19:46:09.0227722Z libgcrypt-lib conda-forge/linux-64::libgcrypt-lib-1.11.0-hb9d3cd8_2 2025-05-07T19:46:09.0228242Z libgpg-error conda-forge/linux-64::libgpg-error-1.55-h3f2d84a_0 2025-05-07T19:46:09.0228735Z libnl conda-forge/linux-64::libnl-3.11.0-hb9d3cd8_0 2025-05-07T19:46:09.0229192Z libnpp conda-forge/linux-64::libnpp-12.3.1.54-h5888daf_0 2025-05-07T19:46:09.0229677Z libnpp-dev conda-forge/linux-64::libnpp-dev-12.3.1.54-h5888daf_0 2025-05-07T19:46:09.0230181Z libnuma conda-forge/linux-64::libnuma-2.0.18-h4ab18f5_2 2025-05-07T19:46:09.0230668Z libnvfatbin conda-forge/linux-64::libnvfatbin-12.6.77-hbd13f7d_0 2025-05-07T19:46:09.0231236Z libnvfatbin-dev conda-forge/linux-64::libnvfatbin-dev-12.6.77-h5888daf_0 2025-05-07T19:46:09.0231814Z libnvjitlink conda-forge/linux-64::libnvjitlink-12.6.85-hbd13f7d_0 2025-05-07T19:46:09.0232384Z libnvjitlink-dev conda-forge/linux-64::libnvjitlink-dev-12.6.85-h5888daf_0 2025-05-07T19:46:09.0232950Z libnvjpeg conda-forge/linux-64::libnvjpeg-12.3.3.54-h5888daf_0 2025-05-07T19:46:09.0233484Z libnvjpeg-dev conda-forge/linux-64::libnvjpeg-dev-12.3.3.54-ha770c72_0 2025-05-07T19:46:09.0234035Z libsystemd0 conda-forge/linux-64::libsystemd0-257.4-h4e0b6ca_1 2025-05-07T19:46:09.0234536Z libudev1 conda-forge/linux-64::libudev1-257.4-hbe16f8c_1 2025-05-07T19:46:09.0235027Z libxkbcommon conda-forge/linux-64::libxkbcommon-1.7.0-h2c5496b_1 2025-05-07T19:46:09.0235553Z libxkbfile conda-forge/linux-64::libxkbfile-1.1.0-h166bdaf_1 2025-05-07T19:46:09.0236003Z lz4-c conda-forge/linux-64::lz4-c-1.10.0-h5888daf_1 2025-05-07T19:46:09.0236522Z nsight-compute conda-forge/linux-64::nsight-compute-2024.3.2.3-hb5ebaad_0 2025-05-07T19:46:09.0237111Z nspr conda-forge/linux-64::nspr-4.36-h5888daf_0 2025-05-07T19:46:09.0237504Z nss conda-forge/linux-64::nss-3.111-h159eef7_0 2025-05-07T19:46:09.0237933Z ocl-icd conda-forge/linux-64::ocl-icd-2.3.3-hb9d3cd8_0 2025-05-07T19:46:09.0238449Z opencl-headers conda-forge/linux-64::opencl-headers-2024.10.24-h5888daf_0 2025-05-07T19:46:09.0238992Z rdma-core conda-forge/linux-64::rdma-core-57.0-h5888daf_0 2025-05-07T19:46:09.0239463Z wayland conda-forge/linux-64::wayland-1.23.1-h3e06ad9_0 2025-05-07T19:46:09.0239909Z xcb-util conda-forge/linux-64::xcb-util-0.4.1-hb711507_2 2025-05-07T19:46:09.0240430Z xcb-util-cursor conda-forge/linux-64::xcb-util-cursor-0.1.5-hb9d3cd8_0 2025-05-07T19:46:09.0240976Z xcb-util-image conda-forge/linux-64::xcb-util-image-0.4.0-hb711507_2 2025-05-07T19:46:09.0241621Z xcb-util-keysyms conda-forge/linux-64::xcb-util-keysyms-0.4.1-hb711507_0 2025-05-07T19:46:09.0242219Z xcb-util-renderut~ conda-forge/linux-64::xcb-util-renderutil-0.3.10-hb711507_0 2025-05-07T19:46:09.0242792Z xcb-util-wm conda-forge/linux-64::xcb-util-wm-0.4.2-hb711507_0 2025-05-07T19:46:09.0243340Z xkeyboard-config conda-forge/linux-64::xkeyboard-config-2.44-hb9d3cd8_0 2025-05-07T19:46:09.0243942Z xorg-libxcomposite conda-forge/linux-64::xorg-libxcomposite-0.4.6-hb9d3cd8_2 2025-05-07T19:46:09.0244557Z xorg-libxdamage conda-forge/linux-64::xorg-libxdamage-1.1.6-hb9d3cd8_0 2025-05-07T19:46:09.0244893Z 2025-05-07T19:46:09.0244942Z 2025-05-07T19:46:09.0244946Z 2025-05-07T19:46:09.0245118Z Downloading and Extracting Packages: ...working... 2025-05-07T19:46:09.0245799Z nsight-compute-2024. | 443.1 MB | | 0% 2025-05-07T19:46:09.0246198Z 2025-05-07T19:46:09.0246519Z libcublas-12.6.4.1 | 256.2 MB | | 0%  2025-05-07T19:46:09.0246766Z 2025-05-07T19:46:09.0246770Z 2025-05-07T19:46:09.0247182Z libcufft-11.3.0.4 | 156.2 MB | | 0%  2025-05-07T19:46:09.0247447Z 2025-05-07T19:46:09.0247451Z 2025-05-07T19:46:09.0247454Z 2025-05-07T19:46:09.0247695Z libcusparse-12.5.4.2 | 118.6 MB | | 0%  2025-05-07T19:46:09.0247989Z 2025-05-07T19:46:09.0247993Z 2025-05-07T19:46:09.0247996Z 2025-05-07T19:46:09.0248000Z 2025-05-07T19:46:09.0254312Z cuda-nsight-12.6.77 | 113.2 MB | | 0%  2025-05-07T19:46:09.0255156Z 2025-05-07T19:46:09.0255167Z 2025-05-07T19:46:09.0255178Z 2025-05-07T19:46:09.0255213Z 2025-05-07T19:46:09.0255223Z 2025-05-07T19:46:09.0255921Z cuda-nvvp-12.6.80 | 109.3 MB | | 0%  2025-05-07T19:46:09.0256923Z 2025-05-07T19:46:09.0256934Z 2025-05-07T19:46:09.0256945Z 2025-05-07T19:46:09.0256955Z 2025-05-07T19:46:09.0256965Z 2025-05-07T19:46:09.0256976Z 2025-05-07T19:46:09.0257755Z libcusolver-11.7.1.2 | 95.8 MB | | 0%  2025-05-07T19:46:09.0258594Z 2025-05-07T19:46:09.0258623Z 2025-05-07T19:46:09.0258646Z 2025-05-07T19:46:09.0258657Z 2025-05-07T19:46:09.0258666Z 2025-05-07T19:46:09.0258676Z 2025-05-07T19:46:09.0258686Z 2025-05-07T19:46:09.0259428Z libnpp-12.3.1.54 | 93.4 MB | | 0%  2025-05-07T19:46:09.0259917Z 2025-05-07T19:46:09.0259921Z 2025-05-07T19:46:09.0259925Z 2025-05-07T19:46:09.0259928Z 2025-05-07T19:46:09.0259932Z 2025-05-07T19:46:09.0259935Z 2025-05-07T19:46:09.0259939Z 2025-05-07T19:46:09.0259942Z 2025-05-07T19:46:09.0260224Z cuda-nvdisasm-12.6.7 | 47.6 MB | | 0%  2025-05-07T19:46:09.0260521Z 2025-05-07T19:46:09.0260525Z 2025-05-07T19:46:09.0260529Z 2025-05-07T19:46:09.0260532Z 2025-05-07T19:46:09.0260536Z 2025-05-07T19:46:09.0260539Z 2025-05-07T19:46:09.0260543Z 2025-05-07T19:46:09.0260546Z 2025-05-07T19:46:09.0260550Z 2025-05-07T19:46:09.0262357Z libcurand-10.3.7.77 | 39.9 MB | | 0%  2025-05-07T19:46:09.0262653Z 2025-05-07T19:46:09.0264026Z 2025-05-07T19:46:09.0264030Z 2025-05-07T19:46:09.0264033Z 2025-05-07T19:46:09.0264036Z 2025-05-07T19:46:09.0264040Z 2025-05-07T19:46:09.0264043Z 2025-05-07T19:46:09.0264047Z 2025-05-07T19:46:09.0264050Z 2025-05-07T19:46:09.0264053Z 2025-05-07T19:46:09.0264352Z gds-tools-1.11.1.6 | 37.8 MB | | 0%  2025-05-07T19:46:09.0264644Z 2025-05-07T19:46:09.0264648Z 2025-05-07T19:46:09.0264652Z 2025-05-07T19:46:09.0264655Z 2025-05-07T19:46:09.0264660Z 2025-05-07T19:46:09.0264663Z 2025-05-07T19:46:09.0264666Z 2025-05-07T19:46:09.0264670Z 2025-05-07T19:46:09.0264673Z 2025-05-07T19:46:09.0264676Z 2025-05-07T19:46:09.0264680Z 2025-05-07T19:46:09.0269470Z cuda-nvcc-tools-12.6 | 23.0 MB | | 0%  2025-05-07T19:46:09.0269797Z 2025-05-07T19:46:09.0269801Z 2025-05-07T19:46:09.0269805Z 2025-05-07T19:46:09.0269808Z 2025-05-07T19:46:09.0269811Z 2025-05-07T19:46:09.0269816Z 2025-05-07T19:46:09.0269922Z 2025-05-07T19:46:09.0269931Z 2025-05-07T19:46:09.0269935Z 2025-05-07T19:46:09.0269954Z 2025-05-07T19:46:09.0269958Z 2025-05-07T19:46:09.0269961Z 2025-05-07T19:46:09.0270458Z cuda-nvrtc-12.6.85 | 17.3 MB | | 0%  2025-05-07T19:46:09.0270758Z 2025-05-07T19:46:09.0270780Z 2025-05-07T19:46:09.0270783Z 2025-05-07T19:46:09.0270787Z 2025-05-07T19:46:09.0270809Z 2025-05-07T19:46:09.0270812Z 2025-05-07T19:46:09.0270815Z 2025-05-07T19:46:09.0270819Z 2025-05-07T19:46:09.0270822Z 2025-05-07T19:46:09.0270825Z 2025-05-07T19:46:09.0270830Z 2025-05-07T19:46:09.0270833Z 2025-05-07T19:46:09.0271775Z 2025-05-07T19:46:09.0272083Z libnvjitlink-12.6.85 | 14.9 MB | | 0%  2025-05-07T19:46:09.0272425Z 2025-05-07T19:46:09.0272428Z 2025-05-07T19:46:09.0272432Z 2025-05-07T19:46:09.0272435Z 2025-05-07T19:46:09.0272438Z 2025-05-07T19:46:09.0272442Z 2025-05-07T19:46:09.0272445Z 2025-05-07T19:46:09.0272454Z 2025-05-07T19:46:09.0272463Z 2025-05-07T19:46:09.0272468Z 2025-05-07T19:46:09.0272471Z 2025-05-07T19:46:09.0272475Z 2025-05-07T19:46:09.0272478Z 2025-05-07T19:46:09.0272482Z 2025-05-07T19:46:09.0273375Z cuda-nvcc-dev_linux- | 10.8 MB | | 0%  2025-05-07T19:46:09.0273733Z 2025-05-07T19:46:09.0273738Z 2025-05-07T19:46:09.0273742Z 2025-05-07T19:46:09.0273746Z 2025-05-07T19:46:09.0273749Z 2025-05-07T19:46:09.0273753Z 2025-05-07T19:46:09.0273757Z 2025-05-07T19:46:09.0273778Z 2025-05-07T19:46:09.0273800Z 2025-05-07T19:46:09.0273804Z 2025-05-07T19:46:09.0273808Z 2025-05-07T19:46:09.0273813Z 2025-05-07T19:46:09.0273817Z 2025-05-07T19:46:09.0273822Z 2025-05-07T19:46:09.0273826Z 2025-05-07T19:46:09.0278499Z cuda-nvvm-tools-12.6 | 10.4 MB | | 0%  2025-05-07T19:46:09.0278831Z 2025-05-07T19:46:09.0278854Z 2025-05-07T19:46:09.0278858Z 2025-05-07T19:46:09.0278876Z 2025-05-07T19:46:09.0278906Z 2025-05-07T19:46:09.0278921Z 2025-05-07T19:46:09.0278925Z 2025-05-07T19:46:09.0278928Z 2025-05-07T19:46:09.0278932Z 2025-05-07T19:46:09.0278936Z 2025-05-07T19:46:09.0278940Z 2025-05-07T19:46:09.0278943Z 2025-05-07T19:46:09.0278946Z 2025-05-07T19:46:09.0278950Z 2025-05-07T19:46:09.0278953Z 2025-05-07T19:46:09.0278956Z 2025-05-07T19:46:09.0279523Z cuda-sanitizer-api-1 | 8.9 MB | | 0%  2025-05-07T19:46:09.0279891Z 2025-05-07T19:46:09.0279907Z 2025-05-07T19:46:09.0279911Z 2025-05-07T19:46:09.0279914Z 2025-05-07T19:46:09.0279917Z 2025-05-07T19:46:09.0279921Z 2025-05-07T19:46:09.0279924Z 2025-05-07T19:46:09.0279928Z 2025-05-07T19:46:09.0279931Z 2025-05-07T19:46:09.0279936Z 2025-05-07T19:46:09.0279939Z 2025-05-07T19:46:09.0279943Z 2025-05-07T19:46:09.0279946Z 2025-05-07T19:46:09.0279950Z 2025-05-07T19:46:09.0279953Z 2025-05-07T19:46:09.0279957Z 2025-05-07T19:46:09.0279977Z 2025-05-07T19:46:09.0280673Z cuda-nvvm-impl-12.6. | 7.7 MB | | 0%  2025-05-07T19:46:09.0281168Z 2025-05-07T19:46:09.0281172Z 2025-05-07T19:46:09.0281176Z 2025-05-07T19:46:09.0281180Z 2025-05-07T19:46:09.0281185Z 2025-05-07T19:46:09.0281205Z 2025-05-07T19:46:09.0281209Z 2025-05-07T19:46:09.0281212Z 2025-05-07T19:46:09.0281215Z 2025-05-07T19:46:09.0281219Z 2025-05-07T19:46:09.0281222Z 2025-05-07T19:46:09.0281225Z 2025-05-07T19:46:09.0281229Z 2025-05-07T19:46:09.0281232Z 2025-05-07T19:46:09.0281235Z 2025-05-07T19:46:09.0281240Z 2025-05-07T19:46:09.0281243Z 2025-05-07T19:46:09.0281246Z 2025-05-07T19:46:09.0284160Z cuda-cupti-dev-12.6. | 3.4 MB | | 0%  2025-05-07T19:46:09.0284504Z 2025-05-07T19:46:09.0284524Z 2025-05-07T19:46:09.0284528Z 2025-05-07T19:46:09.0284532Z 2025-05-07T19:46:09.0284535Z 2025-05-07T19:46:09.0284538Z 2025-05-07T19:46:09.0284542Z 2025-05-07T19:46:09.0284545Z 2025-05-07T19:46:09.0284643Z 2025-05-07T19:46:09.0284653Z 2025-05-07T19:46:09.0284657Z 2025-05-07T19:46:09.0284660Z 2025-05-07T19:46:09.0284663Z 2025-05-07T19:46:09.0284666Z 2025-05-07T19:46:09.0284670Z 2025-05-07T19:46:09.0284674Z 2025-05-07T19:46:09.0284677Z 2025-05-07T19:46:09.0284698Z 2025-05-07T19:46:09.0284702Z 2025-05-07T19:46:09.1212178Z ... (more hidden) ... 2025-05-07T19:46:09.1212656Z nsight-compute-2024. | 443.1 MB | 1 | 1% 2025-05-07T19:46:09.1212939Z 2025-05-07T19:46:09.1234285Z libcublas-12.6.4.1 | 256.2 MB | | 1%  2025-05-07T19:46:09.1234600Z 2025-05-07T19:46:09.1234659Z 2025-05-07T19:46:09.1234663Z 2025-05-07T19:46:09.1267036Z libcusparse-12.5.4.2 | 118.6 MB | | 1%  2025-05-07T19:46:09.1267352Z 2025-05-07T19:46:09.1267357Z 2025-05-07T19:46:09.1423374Z libcufft-11.3.0.4 | 156.2 MB | | 1%  2025-05-07T19:46:09.1423674Z 2025-05-07T19:46:09.1423695Z 2025-05-07T19:46:09.1423708Z 2025-05-07T19:46:09.1423713Z 2025-05-07T19:46:09.2216614Z cuda-nsight-12.6.77 | 113.2 MB | | 0%  2025-05-07T19:46:09.2216936Z 2025-05-07T19:46:09.2236105Z libcublas-12.6.4.1 | 256.2 MB | 3 | 4%  2025-05-07T19:46:09.2236920Z 2025-05-07T19:46:09.2236934Z 2025-05-07T19:46:09.2236961Z 2025-05-07T19:46:09.2363221Z libcusparse-12.5.4.2 | 118.6 MB | 3 | 4%  2025-05-07T19:46:09.2363550Z 2025-05-07T19:46:09.2363643Z 2025-05-07T19:46:09.2472884Z libcufft-11.3.0.4 | 156.2 MB | 7 | 7%  2025-05-07T19:46:09.2473172Z 2025-05-07T19:46:09.2473177Z 2025-05-07T19:46:09.2473182Z 2025-05-07T19:46:09.2473185Z 2025-05-07T19:46:09.3002713Z cuda-nsight-12.6.77 | 113.2 MB | 2 | 3%  2025-05-07T19:46:09.3218072Z nsight-compute-2024. | 443.1 MB | 2 | 2% 2025-05-07T19:46:09.3218363Z 2025-05-07T19:46:09.3236650Z libcublas-12.6.4.1 | 256.2 MB | 8 | 8%  2025-05-07T19:46:09.3236952Z 2025-05-07T19:46:09.3237047Z 2025-05-07T19:46:09.3237056Z 2025-05-07T19:46:09.3476141Z libcusparse-12.5.4.2 | 118.6 MB | 8 | 8%  2025-05-07T19:46:09.3477007Z 2025-05-07T19:46:09.3477021Z 2025-05-07T19:46:09.3477032Z 2025-05-07T19:46:09.3477042Z 2025-05-07T19:46:09.3713801Z cuda-nsight-12.6.77 | 113.2 MB | 7 | 8%  2025-05-07T19:46:09.3714666Z 2025-05-07T19:46:09.4003393Z 2025-05-07T19:46:09.4004349Z libcufft-11.3.0.4 | 156.2 MB | #1 | 11%  2025-05-07T19:46:09.4238115Z nsight-compute-2024. | 443.1 MB | 3 | 4% 2025-05-07T19:46:09.4238676Z 2025-05-07T19:46:09.4238701Z 2025-05-07T19:46:09.4238705Z 2025-05-07T19:46:09.4477156Z libcusparse-12.5.4.2 | 118.6 MB | #3 | 14%  2025-05-07T19:46:09.4477458Z 2025-05-07T19:46:09.4477593Z 2025-05-07T19:46:09.4477601Z 2025-05-07T19:46:09.4477606Z 2025-05-07T19:46:09.4513792Z cuda-nsight-12.6.77 | 113.2 MB | #2 | 12%  2025-05-07T19:46:09.4514701Z 2025-05-07T19:46:09.4714040Z libcublas-12.6.4.1 | 256.2 MB | #1 | 12%  2025-05-07T19:46:09.4714343Z 2025-05-07T19:46:09.4714454Z 2025-05-07T19:46:09.5006759Z libcufft-11.3.0.4 | 156.2 MB | #5 | 15%  2025-05-07T19:46:09.5239474Z nsight-compute-2024. | 443.1 MB | 5 | 5% 2025-05-07T19:46:09.5239754Z 2025-05-07T19:46:09.5239955Z 2025-05-07T19:46:09.5240042Z 2025-05-07T19:46:09.5480355Z libcusparse-12.5.4.2 | 118.6 MB | #9 | 19%  2025-05-07T19:46:09.5480660Z 2025-05-07T19:46:09.5480665Z 2025-05-07T19:46:09.5480696Z 2025-05-07T19:46:09.5480700Z 2025-05-07T19:46:09.5786676Z cuda-nsight-12.6.77 | 113.2 MB | #6 | 17%  2025-05-07T19:46:09.5787394Z 2025-05-07T19:46:09.5864366Z libcublas-12.6.4.1 | 256.2 MB | #4 | 15%  2025-05-07T19:46:09.5865178Z 2025-05-07T19:46:09.5865191Z 2025-05-07T19:46:09.6009807Z libcufft-11.3.0.4 | 156.2 MB | #9 | 19%  2025-05-07T19:46:09.6241489Z nsight-compute-2024. | 443.1 MB | 6 | 7% 2025-05-07T19:46:09.6242334Z 2025-05-07T19:46:09.6242348Z 2025-05-07T19:46:09.6480016Z 2025-05-07T19:46:09.6480440Z libcusparse-12.5.4.2 | 118.6 MB | ##3 | 24%  2025-05-07T19:46:09.6480745Z 2025-05-07T19:46:09.6480872Z 2025-05-07T19:46:09.6480881Z 2025-05-07T19:46:09.6480885Z 2025-05-07T19:46:09.6865988Z cuda-nsight-12.6.77 | 113.2 MB | ## | 21%  2025-05-07T19:46:09.6866862Z 2025-05-07T19:46:09.6866887Z 2025-05-07T19:46:09.6924737Z libcufft-11.3.0.4 | 156.2 MB | ##3 | 23%  2025-05-07T19:46:09.6925032Z 2025-05-07T19:46:09.7009595Z libcublas-12.6.4.1 | 256.2 MB | #7 | 17%  2025-05-07T19:46:09.7244580Z nsight-compute-2024. | 443.1 MB | 7 | 8% 2025-05-07T19:46:09.7245361Z 2025-05-07T19:46:09.7245376Z 2025-05-07T19:46:09.7245419Z 2025-05-07T19:46:09.7483213Z libcusparse-12.5.4.2 | 118.6 MB | ##9 | 29%  2025-05-07T19:46:09.7484091Z 2025-05-07T19:46:09.7484137Z 2025-05-07T19:46:09.7484163Z 2025-05-07T19:46:09.7484174Z 2025-05-07T19:46:09.7866242Z cuda-nsight-12.6.77 | 113.2 MB | ##5 | 25%  2025-05-07T19:46:09.7867109Z 2025-05-07T19:46:09.7867140Z 2025-05-07T19:46:09.8017745Z libcufft-11.3.0.4 | 156.2 MB | ##6 | 27%  2025-05-07T19:46:09.8022577Z nsight-compute-2024. | 443.1 MB | 9 | 9% 2025-05-07T19:46:09.8023554Z 2025-05-07T19:46:09.8245775Z libcublas-12.6.4.1 | 256.2 MB | #9 | 20%  2025-05-07T19:46:09.8246059Z 2025-05-07T19:46:09.8246176Z 2025-05-07T19:46:09.8246179Z 2025-05-07T19:46:09.8485670Z libcusparse-12.5.4.2 | 118.6 MB | ###3 | 34%  2025-05-07T19:46:09.8486536Z 2025-05-07T19:46:09.8486550Z 2025-05-07T19:46:09.8486561Z 2025-05-07T19:46:09.8486571Z 2025-05-07T19:46:09.8869316Z cuda-nsight-12.6.77 | 113.2 MB | ##9 | 30%  2025-05-07T19:46:09.8869851Z 2025-05-07T19:46:09.8869889Z 2025-05-07T19:46:09.9094284Z libcufft-11.3.0.4 | 156.2 MB | ### | 31%  2025-05-07T19:46:09.9095112Z 2025-05-07T19:46:09.9128660Z libcublas-12.6.4.1 | 256.2 MB | ##2 | 23%  2025-05-07T19:46:09.9246181Z nsight-compute-2024. | 443.1 MB | # | 11% 2025-05-07T19:46:09.9246577Z 2025-05-07T19:46:09.9246727Z 2025-05-07T19:46:09.9246736Z 2025-05-07T19:46:09.9522852Z libcusparse-12.5.4.2 | 118.6 MB | ###9 | 39%  2025-05-07T19:46:09.9523164Z 2025-05-07T19:46:09.9523169Z 2025-05-07T19:46:09.9523172Z 2025-05-07T19:46:09.9523176Z 2025-05-07T19:46:09.9875935Z cuda-nsight-12.6.77 | 113.2 MB | ###4 | 34%  2025-05-07T19:46:09.9876817Z 2025-05-07T19:46:09.9876832Z 2025-05-07T19:46:10.0129503Z libcufft-11.3.0.4 | 156.2 MB | ###4 | 35%  2025-05-07T19:46:10.0160943Z nsight-compute-2024. | 443.1 MB | #1 | 12% 2025-05-07T19:46:10.0161728Z 2025-05-07T19:46:10.0249388Z libcublas-12.6.4.1 | 256.2 MB | ##4 | 25%  2025-05-07T19:46:10.0249718Z 2025-05-07T19:46:10.0249936Z 2025-05-07T19:46:10.0249941Z 2025-05-07T19:46:10.0528233Z libcusparse-12.5.4.2 | 118.6 MB | ####4 | 44%  2025-05-07T19:46:10.0528536Z 2025-05-07T19:46:10.0528689Z 2025-05-07T19:46:10.0528693Z 2025-05-07T19:46:10.0528724Z 2025-05-07T19:46:10.0876316Z cuda-nsight-12.6.77 | 113.2 MB | ###8 | 39%  2025-05-07T19:46:10.0876627Z 2025-05-07T19:46:10.0876725Z 2025-05-07T19:46:10.1131391Z libcufft-11.3.0.4 | 156.2 MB | ###8 | 39%  2025-05-07T19:46:10.1178716Z nsight-compute-2024. | 443.1 MB | #3 | 13% 2025-05-07T19:46:10.1179025Z 2025-05-07T19:46:10.1252313Z libcublas-12.6.4.1 | 256.2 MB | ##7 | 27%  2025-05-07T19:46:10.1252599Z 2025-05-07T19:46:10.1252789Z 2025-05-07T19:46:10.1252826Z 2025-05-07T19:46:10.1619163Z libcusparse-12.5.4.2 | 118.6 MB | ####9 | 49%  2025-05-07T19:46:10.1619472Z 2025-05-07T19:46:10.1619476Z 2025-05-07T19:46:10.1619481Z 2025-05-07T19:46:10.1619484Z 2025-05-07T19:46:10.1882182Z cuda-nsight-12.6.77 | 113.2 MB | ####2 | 43%  2025-05-07T19:46:10.1882510Z 2025-05-07T19:46:10.1882515Z 2025-05-07T19:46:10.2131637Z libcufft-11.3.0.4 | 156.2 MB | ####2 | 43%  2025-05-07T19:46:10.2213754Z nsight-compute-2024. | 443.1 MB | #4 | 15% 2025-05-07T19:46:10.2214531Z 2025-05-07T19:46:10.2253151Z libcublas-12.6.4.1 | 256.2 MB | ##9 | 30%  2025-05-07T19:46:10.2253952Z 2025-05-07T19:46:10.2253968Z 2025-05-07T19:46:10.2253979Z 2025-05-07T19:46:10.2624406Z libcusparse-12.5.4.2 | 118.6 MB | #####4 | 55%  2025-05-07T19:46:10.2625259Z 2025-05-07T19:46:10.2625272Z 2025-05-07T19:46:10.2625283Z 2025-05-07T19:46:10.2625307Z 2025-05-07T19:46:10.2885413Z cuda-nsight-12.6.77 | 113.2 MB | ####6 | 47%  2025-05-07T19:46:10.2886299Z 2025-05-07T19:46:10.2886312Z 2025-05-07T19:46:10.3132437Z libcufft-11.3.0.4 | 156.2 MB | ####6 | 47%  2025-05-07T19:46:10.3214474Z nsight-compute-2024. | 443.1 MB | #6 | 16% 2025-05-07T19:46:10.3215270Z 2025-05-07T19:46:10.3255584Z libcublas-12.6.4.1 | 256.2 MB | ###2 | 33%  2025-05-07T19:46:10.3256389Z 2025-05-07T19:46:10.3256403Z 2025-05-07T19:46:10.3256414Z 2025-05-07T19:46:10.3624723Z libcusparse-12.5.4.2 | 118.6 MB | ######1 | 61%  2025-05-07T19:46:10.3625046Z 2025-05-07T19:46:10.3625051Z 2025-05-07T19:46:10.3625055Z 2025-05-07T19:46:10.3625059Z 2025-05-07T19:46:10.3884715Z cuda-nsight-12.6.77 | 113.2 MB | #####4 | 54%  2025-05-07T19:46:10.3885043Z 2025-05-07T19:46:10.3885048Z 2025-05-07T19:46:10.4134406Z libcufft-11.3.0.4 | 156.2 MB | #####2 | 52%  2025-05-07T19:46:10.4256007Z nsight-compute-2024. | 443.1 MB | #8 | 18% 2025-05-07T19:46:10.4256370Z 2025-05-07T19:46:10.4256563Z 2025-05-07T19:46:10.4256569Z 2025-05-07T19:46:10.4260294Z libcusparse-12.5.4.2 | 118.6 MB | ######8 | 69%  2025-05-07T19:46:10.4260699Z 2025-05-07T19:46:10.4624895Z libcublas-12.6.4.1 | 256.2 MB | ###5 | 36%  2025-05-07T19:46:10.4625197Z 2025-05-07T19:46:10.4625202Z 2025-05-07T19:46:10.4625206Z 2025-05-07T19:46:10.4625209Z 2025-05-07T19:46:10.4885724Z cuda-nsight-12.6.77 | 113.2 MB | #####9 | 60%  2025-05-07T19:46:10.4886599Z 2025-05-07T19:46:10.5143202Z 2025-05-07T19:46:10.5143586Z libcufft-11.3.0.4 | 156.2 MB | #####8 | 59%  2025-05-07T19:46:10.5262429Z nsight-compute-2024. | 443.1 MB | #9 | 20% 2025-05-07T19:46:10.5263217Z 2025-05-07T19:46:10.5490139Z libcublas-12.6.4.1 | 256.2 MB | ###8 | 39%  2025-05-07T19:46:10.5490421Z 2025-05-07T19:46:10.5490426Z 2025-05-07T19:46:10.5490430Z 2025-05-07T19:46:10.5625523Z libcusparse-12.5.4.2 | 118.6 MB | #######4 | 75%  2025-05-07T19:46:10.5625830Z 2025-05-07T19:46:10.5625837Z 2025-05-07T19:46:10.5625841Z 2025-05-07T19:46:10.5625860Z 2025-05-07T19:46:10.5996511Z cuda-nsight-12.6.77 | 113.2 MB | ######8 | 69%  2025-05-07T19:46:10.5996835Z 2025-05-07T19:46:10.5997067Z 2025-05-07T19:46:10.6144647Z libcufft-11.3.0.4 | 156.2 MB | ######4 | 64%  2025-05-07T19:46:10.6538463Z nsight-compute-2024. | 443.1 MB | ##1 | 22% 2025-05-07T19:46:10.6538759Z 2025-05-07T19:46:10.6626852Z libcublas-12.6.4.1 | 256.2 MB | ####1 | 42%  2025-05-07T19:46:10.6627141Z 2025-05-07T19:46:10.6627145Z 2025-05-07T19:46:10.6627149Z 2025-05-07T19:46:10.6627153Z 2025-05-07T19:46:10.7264139Z cuda-nsight-12.6.77 | 113.2 MB | #######7 | 77%  2025-05-07T19:46:10.7264446Z 2025-05-07T19:46:10.7264451Z 2025-05-07T19:46:10.7264573Z 2025-05-07T19:46:10.7491449Z libcusparse-12.5.4.2 | 118.6 MB | ######## | 81%  2025-05-07T19:46:10.7491756Z 2025-05-07T19:46:10.7491761Z 2025-05-07T19:46:10.7628591Z libcufft-11.3.0.4 | 156.2 MB | ######9 | 69%  2025-05-07T19:46:10.7774479Z nsight-compute-2024. | 443.1 MB | ##3 | 23% 2025-05-07T19:46:10.7774993Z 2025-05-07T19:46:10.7928694Z libcublas-12.6.4.1 | 256.2 MB | ####4 | 44%  2025-05-07T19:46:10.7929015Z 2025-05-07T19:46:10.7929019Z 2025-05-07T19:46:10.7929023Z 2025-05-07T19:46:10.7929038Z 2025-05-07T19:46:10.8279701Z cuda-nsight-12.6.77 | 113.2 MB | ########4 | 84%  2025-05-07T19:46:10.8280586Z 2025-05-07T19:46:10.8280600Z 2025-05-07T19:46:10.8280611Z 2025-05-07T19:46:10.8621782Z libcusparse-12.5.4.2 | 118.6 MB | ########5 | 86%  2025-05-07T19:46:10.8622091Z 2025-05-07T19:46:10.8622096Z 2025-05-07T19:46:10.8665315Z libcufft-11.3.0.4 | 156.2 MB | #######3 | 74%  2025-05-07T19:46:10.8988029Z nsight-compute-2024. | 443.1 MB | ##4 | 25% 2025-05-07T19:46:10.8988355Z 2025-05-07T19:46:10.9395363Z libcublas-12.6.4.1 | 256.2 MB | ####6 | 47%  2025-05-07T19:46:10.9395650Z 2025-05-07T19:46:10.9395785Z 2025-05-07T19:46:10.9395790Z 2025-05-07T19:46:10.9635531Z libcusparse-12.5.4.2 | 118.6 MB | ######### | 91%  2025-05-07T19:46:10.9636430Z 2025-05-07T19:46:10.9636462Z 2025-05-07T19:46:10.9664510Z libcufft-11.3.0.4 | 156.2 MB | #######8 | 78%  2025-05-07T19:46:10.9795419Z nsight-compute-2024. | 443.1 MB | ##6 | 26% 2025-05-07T19:46:10.9796262Z 2025-05-07T19:46:10.9796267Z 2025-05-07T19:46:10.9796271Z 2025-05-07T19:46:10.9796274Z 2025-05-07T19:46:11.0049700Z cuda-nsight-12.6.77 | 113.2 MB | ######### | 91%  2025-05-07T19:46:11.0050022Z 2025-05-07T19:46:11.0636121Z libcublas-12.6.4.1 | 256.2 MB | ####9 | 49%  2025-05-07T19:46:11.0636951Z 2025-05-07T19:46:11.0636965Z 2025-05-07T19:46:11.0666134Z libcufft-11.3.0.4 | 156.2 MB | ########2 | 83%  2025-05-07T19:46:11.1049048Z nsight-compute-2024. | 443.1 MB | ##8 | 28% 2025-05-07T19:46:11.1050025Z 2025-05-07T19:46:11.1050030Z 2025-05-07T19:46:11.1050034Z 2025-05-07T19:46:11.1050038Z 2025-05-07T19:46:11.1050359Z cuda-nsight-12.6.77 | 113.2 MB | #########6 | 96%  2025-05-07T19:46:11.1050986Z 2025-05-07T19:46:11.1635605Z libcublas-12.6.4.1 | 256.2 MB | #####1 | 52%  2025-05-07T19:46:11.1635915Z 2025-05-07T19:46:11.1636121Z 2025-05-07T19:46:11.1665825Z libcufft-11.3.0.4 | 156.2 MB | ########7 | 87%  2025-05-07T19:46:11.1826496Z nsight-compute-2024. | 443.1 MB | ##9 | 30% 2025-05-07T19:46:11.1826944Z 2025-05-07T19:46:11.1826950Z 2025-05-07T19:46:11.1826953Z 2025-05-07T19:46:11.2118249Z libcusparse-12.5.4.2 | 118.6 MB | #########5 | 96%  2025-05-07T19:46:11.2118579Z 2025-05-07T19:46:11.2635879Z libcublas-12.6.4.1 | 256.2 MB | #####4 | 54%  2025-05-07T19:46:11.2636179Z 2025-05-07T19:46:11.2636293Z 2025-05-07T19:46:11.2666380Z libcufft-11.3.0.4 | 156.2 MB | #########2 | 92%  2025-05-07T19:46:11.3119790Z nsight-compute-2024. | 443.1 MB | ###1 | 32% 2025-05-07T19:46:11.3120215Z 2025-05-07T19:46:11.3667961Z libcublas-12.6.4.1 | 256.2 MB | #####7 | 57%  2025-05-07T19:46:11.3691496Z nsight-compute-2024. | 443.1 MB | ###3 | 34% 2025-05-07T19:46:11.3692038Z 2025-05-07T19:46:11.3692044Z 2025-05-07T19:46:11.4120225Z libcufft-11.3.0.4 | 156.2 MB | #########7 | 98%  2025-05-07T19:46:11.4120510Z 2025-05-07T19:46:11.4668565Z libcublas-12.6.4.1 | 256.2 MB | ###### | 61%  2025-05-07T19:46:11.5120680Z nsight-compute-2024. | 443.1 MB | ###6 | 36% 2025-05-07T19:46:11.5121091Z 2025-05-07T19:46:11.5670862Z libcublas-12.6.4.1 | 256.2 MB | ######4 | 65%  2025-05-07T19:46:11.6184775Z nsight-compute-2024. | 443.1 MB | ###8 | 38% 2025-05-07T19:46:11.6185064Z 2025-05-07T19:46:11.6671375Z libcublas-12.6.4.1 | 256.2 MB | ######9 | 70%  2025-05-07T19:46:11.7294433Z nsight-compute-2024. | 443.1 MB | ####1 | 41% 2025-05-07T19:46:11.7294732Z 2025-05-07T19:46:11.8170596Z libcublas-12.6.4.1 | 256.2 MB | #######3 | 74%  2025-05-07T19:46:11.8295128Z nsight-compute-2024. | 443.1 MB | ####3 | 43% 2025-05-07T19:46:11.8295428Z 2025-05-07T19:46:11.9171583Z libcublas-12.6.4.1 | 256.2 MB | #######9 | 79%  2025-05-07T19:46:11.9299579Z nsight-compute-2024. | 443.1 MB | ####6 | 46% 2025-05-07T19:46:11.9299874Z 2025-05-07T19:46:12.0229157Z libcublas-12.6.4.1 | 256.2 MB | ########3 | 84%  2025-05-07T19:46:12.0256066Z nsight-compute-2024. | 443.1 MB | ####8 | 48% 2025-05-07T19:46:12.0257096Z 2025-05-07T19:46:12.0257130Z 2025-05-07T19:46:12.0257141Z 2025-05-07T19:46:12.0257152Z 2025-05-07T19:46:12.0366095Z cuda-nsight-12.6.77 | 113.2 MB | ########## | 100%  2025-05-07T19:46:12.0366403Z 2025-05-07T19:46:12.0707207Z libcublas-12.6.4.1 | 256.2 MB | ########7 | 88%  2025-05-07T19:46:12.0708022Z 2025-05-07T19:46:12.0708063Z 2025-05-07T19:46:12.0708074Z 2025-05-07T19:46:12.0708783Z libcusparse-12.5.4.2 | 118.6 MB | ########## | 100%  2025-05-07T19:46:12.0709233Z 2025-05-07T19:46:12.0709237Z 2025-05-07T19:46:12.0709254Z 2025-05-07T19:46:12.0904911Z libcusparse-12.5.4.2 | 118.6 MB | ########## | 100%  2025-05-07T19:46:12.0905226Z 2025-05-07T19:46:12.0905240Z 2025-05-07T19:46:12.0905243Z 2025-05-07T19:46:12.0905247Z 2025-05-07T19:46:12.0905250Z 2025-05-07T19:46:12.1140660Z cuda-nvvp-12.6.80 | 109.3 MB | | 0%  2025-05-07T19:46:12.1140974Z 2025-05-07T19:46:12.1140978Z 2025-05-07T19:46:12.1140982Z 2025-05-07T19:46:12.1140986Z 2025-05-07T19:46:12.1140989Z 2025-05-07T19:46:12.1140993Z 2025-05-07T19:46:12.1674137Z libcusolver-11.7.1.2 | 95.8 MB | | 0%  2025-05-07T19:46:12.1908111Z nsight-compute-2024. | 443.1 MB | ##### | 51% 2025-05-07T19:46:12.1908429Z 2025-05-07T19:46:12.1908434Z 2025-05-07T19:46:12.1908438Z 2025-05-07T19:46:12.1908442Z 2025-05-07T19:46:12.1908445Z 2025-05-07T19:46:12.2140443Z cuda-nvvp-12.6.80 | 109.3 MB | 5 | 5%  2025-05-07T19:46:12.2140760Z 2025-05-07T19:46:12.2140778Z 2025-05-07T19:46:12.2140782Z 2025-05-07T19:46:12.2140785Z 2025-05-07T19:46:12.2140789Z 2025-05-07T19:46:12.2140807Z 2025-05-07T19:46:12.2879771Z libcusolver-11.7.1.2 | 95.8 MB | 7 | 7%  2025-05-07T19:46:12.2880098Z 2025-05-07T19:46:12.2909700Z libcublas-12.6.4.1 | 256.2 MB | #########2 | 92%  2025-05-07T19:46:12.2910491Z 2025-05-07T19:46:12.2910518Z 2025-05-07T19:46:12.2910529Z 2025-05-07T19:46:12.2910540Z 2025-05-07T19:46:12.2910550Z 2025-05-07T19:46:12.3114452Z cuda-nvvp-12.6.80 | 109.3 MB | # | 11%  2025-05-07T19:46:12.3142760Z nsight-compute-2024. | 443.1 MB | #####2 | 53% 2025-05-07T19:46:12.3143075Z 2025-05-07T19:46:12.3143080Z 2025-05-07T19:46:12.3143084Z 2025-05-07T19:46:12.3143088Z 2025-05-07T19:46:12.3143091Z 2025-05-07T19:46:12.3143095Z 2025-05-07T19:46:12.3910953Z libcusolver-11.7.1.2 | 95.8 MB | #3 | 13%  2025-05-07T19:46:12.3911285Z 2025-05-07T19:46:12.3911293Z 2025-05-07T19:46:12.3911311Z 2025-05-07T19:46:12.3911315Z 2025-05-07T19:46:12.3911318Z 2025-05-07T19:46:12.4144404Z cuda-nvvp-12.6.80 | 109.3 MB | #6 | 16%  2025-05-07T19:46:12.4145730Z 2025-05-07T19:46:12.4145744Z 2025-05-07T19:46:12.4145772Z 2025-05-07T19:46:12.4145782Z 2025-05-07T19:46:12.4145793Z 2025-05-07T19:46:12.4145804Z 2025-05-07T19:46:12.4417912Z libcusolver-11.7.1.2 | 95.8 MB | #9 | 20%  2025-05-07T19:46:12.4568755Z nsight-compute-2024. | 443.1 MB | #####4 | 55% 2025-05-07T19:46:12.4569134Z 2025-05-07T19:46:12.5144657Z libcublas-12.6.4.1 | 256.2 MB | #########5 | 96%  2025-05-07T19:46:12.5144964Z 2025-05-07T19:46:12.5144969Z 2025-05-07T19:46:12.5144973Z 2025-05-07T19:46:12.5144977Z 2025-05-07T19:46:12.5144981Z 2025-05-07T19:46:12.5144985Z 2025-05-07T19:46:12.5417883Z libcusolver-11.7.1.2 | 95.8 MB | ##6 | 27%  2025-05-07T19:46:12.5488822Z nsight-compute-2024. | 443.1 MB | #####6 | 56% 2025-05-07T19:46:12.5489362Z 2025-05-07T19:46:12.5489382Z 2025-05-07T19:46:12.5489388Z 2025-05-07T19:46:12.5489661Z 2025-05-07T19:46:12.5489678Z 2025-05-07T19:46:12.5916622Z cuda-nvvp-12.6.80 | 109.3 MB | ## | 21%  2025-05-07T19:46:12.5916948Z 2025-05-07T19:46:12.6147933Z libcublas-12.6.4.1 | 256.2 MB | #########8 | 99%  2025-05-07T19:46:12.6148729Z 2025-05-07T19:46:12.6148767Z 2025-05-07T19:46:12.6148779Z 2025-05-07T19:46:12.6148790Z 2025-05-07T19:46:12.6148800Z 2025-05-07T19:46:12.6148811Z 2025-05-07T19:46:12.6493878Z libcusolver-11.7.1.2 | 95.8 MB | ###3 | 33%  2025-05-07T19:46:12.6494781Z 2025-05-07T19:46:12.6603363Z 2025-05-07T19:46:12.6603382Z 2025-05-07T19:46:12.6603397Z 2025-05-07T19:46:12.6603411Z 2025-05-07T19:46:12.6604482Z cuda-nvvp-12.6.80 | 109.3 MB | ##5 | 26%  2025-05-07T19:46:12.7147788Z nsight-compute-2024. | 443.1 MB | #####7 | 58% 2025-05-07T19:46:12.7148117Z 2025-05-07T19:46:12.7148121Z 2025-05-07T19:46:12.7148125Z 2025-05-07T19:46:12.7148128Z 2025-05-07T19:46:12.7148132Z 2025-05-07T19:46:12.7148151Z 2025-05-07T19:46:12.7495742Z libcusolver-11.7.1.2 | 95.8 MB | ####1 | 41%  2025-05-07T19:46:12.7496890Z 2025-05-07T19:46:12.7496904Z 2025-05-07T19:46:12.7496914Z 2025-05-07T19:46:12.7496925Z 2025-05-07T19:46:12.7496936Z 2025-05-07T19:46:12.7603759Z cuda-nvvp-12.6.80 | 109.3 MB | ###3 | 33%  2025-05-07T19:46:12.8148796Z nsight-compute-2024. | 443.1 MB | #####9 | 60% 2025-05-07T19:46:12.8149095Z 2025-05-07T19:46:12.8149117Z 2025-05-07T19:46:12.8149122Z 2025-05-07T19:46:12.8149126Z 2025-05-07T19:46:12.8149130Z 2025-05-07T19:46:12.8149135Z 2025-05-07T19:46:12.8496086Z libcusolver-11.7.1.2 | 95.8 MB | ##### | 51%  2025-05-07T19:46:12.8496420Z 2025-05-07T19:46:12.8496424Z 2025-05-07T19:46:12.8496428Z 2025-05-07T19:46:12.8496431Z 2025-05-07T19:46:12.8496435Z 2025-05-07T19:46:12.8605156Z cuda-nvvp-12.6.80 | 109.3 MB | #### | 40%  2025-05-07T19:46:12.9149197Z nsight-compute-2024. | 443.1 MB | ######1 | 62% 2025-05-07T19:46:12.9149726Z 2025-05-07T19:46:12.9149973Z 2025-05-07T19:46:12.9149978Z 2025-05-07T19:46:12.9149981Z 2025-05-07T19:46:12.9150014Z 2025-05-07T19:46:12.9150078Z 2025-05-07T19:46:12.9496331Z libcusolver-11.7.1.2 | 95.8 MB | #####9 | 60%  2025-05-07T19:46:12.9496762Z 2025-05-07T19:46:12.9496767Z 2025-05-07T19:46:12.9496770Z 2025-05-07T19:46:12.9496774Z 2025-05-07T19:46:12.9496777Z 2025-05-07T19:46:12.9607205Z cuda-nvvp-12.6.80 | 109.3 MB | ####7 | 47%  2025-05-07T19:46:12.9764262Z nsight-compute-2024. | 443.1 MB | ######3 | 64% 2025-05-07T19:46:12.9764778Z 2025-05-07T19:46:12.9764921Z 2025-05-07T19:46:13.0153033Z libcufft-11.3.0.4 | 156.2 MB | ########## | 100%  2025-05-07T19:46:13.0153325Z 2025-05-07T19:46:13.0153477Z 2025-05-07T19:46:13.0153485Z 2025-05-07T19:46:13.0153490Z 2025-05-07T19:46:13.0153494Z 2025-05-07T19:46:13.0153498Z 2025-05-07T19:46:13.0529901Z libcusolver-11.7.1.2 | 95.8 MB | ######9 | 70%  2025-05-07T19:46:13.0530438Z 2025-05-07T19:46:13.0530443Z 2025-05-07T19:46:13.0530447Z 2025-05-07T19:46:13.0530450Z 2025-05-07T19:46:13.0530454Z 2025-05-07T19:46:13.0552073Z cuda-nvvp-12.6.80 | 109.3 MB | #####3 | 54%  2025-05-07T19:46:13.0552410Z 2025-05-07T19:46:13.0552415Z 2025-05-07T19:46:13.0552418Z 2025-05-07T19:46:13.0552422Z 2025-05-07T19:46:13.0552425Z 2025-05-07T19:46:13.0552429Z 2025-05-07T19:46:13.0552432Z 2025-05-07T19:46:13.1153370Z libnpp-12.3.1.54 | 93.4 MB | | 0%  2025-05-07T19:46:13.1153686Z 2025-05-07T19:46:13.1153775Z 2025-05-07T19:46:13.1153784Z 2025-05-07T19:46:13.1153790Z 2025-05-07T19:46:13.1153794Z 2025-05-07T19:46:13.1153798Z 2025-05-07T19:46:13.1288489Z libcusolver-11.7.1.2 | 95.8 MB | #######9 | 79%  2025-05-07T19:46:13.1288967Z nsight-compute-2024. | 443.1 MB | ######5 | 65% 2025-05-07T19:46:13.1289217Z 2025-05-07T19:46:13.1289431Z 2025-05-07T19:46:13.1289443Z 2025-05-07T19:46:13.1289447Z 2025-05-07T19:46:13.1539666Z cuda-nsight-12.6.77 | 113.2 MB | ########## | 100%  2025-05-07T19:46:13.1539984Z 2025-05-07T19:46:13.1539988Z 2025-05-07T19:46:13.1539992Z 2025-05-07T19:46:13.1539996Z 2025-05-07T19:46:13.1540000Z 2025-05-07T19:46:13.1561305Z cuda-nvvp-12.6.80 | 109.3 MB | ###### | 60%  2025-05-07T19:46:13.1561620Z 2025-05-07T19:46:13.1561625Z 2025-05-07T19:46:13.1561629Z 2025-05-07T19:46:13.1561632Z 2025-05-07T19:46:13.1561635Z 2025-05-07T19:46:13.1561639Z 2025-05-07T19:46:13.1561642Z 2025-05-07T19:46:13.2156196Z libnpp-12.3.1.54 | 93.4 MB | 3 | 3%  2025-05-07T19:46:13.2156508Z 2025-05-07T19:46:13.2156604Z 2025-05-07T19:46:13.2156609Z 2025-05-07T19:46:13.2156615Z 2025-05-07T19:46:13.2156625Z 2025-05-07T19:46:13.2156629Z 2025-05-07T19:46:13.2290230Z libcusolver-11.7.1.2 | 95.8 MB | ########9 | 89%  2025-05-07T19:46:13.2748008Z nsight-compute-2024. | 443.1 MB | ######7 | 67% 2025-05-07T19:46:13.2748303Z 2025-05-07T19:46:13.2748321Z 2025-05-07T19:46:13.2748325Z 2025-05-07T19:46:13.2748328Z 2025-05-07T19:46:13.2748332Z 2025-05-07T19:46:13.3015292Z cuda-nvvp-12.6.80 | 109.3 MB | ######6 | 66%  2025-05-07T19:46:13.3015617Z 2025-05-07T19:46:13.3015854Z 2025-05-07T19:46:13.3015858Z 2025-05-07T19:46:13.3015967Z 2025-05-07T19:46:13.3015971Z 2025-05-07T19:46:13.3015975Z 2025-05-07T19:46:13.3597808Z 2025-05-07T19:46:13.3598675Z libnpp-12.3.1.54 | 93.4 MB | 5 | 5%  2025-05-07T19:46:13.3752119Z nsight-compute-2024. | 443.1 MB | ######8 | 69% 2025-05-07T19:46:13.3752496Z 2025-05-07T19:46:13.3752567Z 2025-05-07T19:46:13.3752571Z 2025-05-07T19:46:13.3752595Z 2025-05-07T19:46:13.3752692Z 2025-05-07T19:46:13.4017127Z cuda-nvvp-12.6.80 | 109.3 MB | #######2 | 73%  2025-05-07T19:46:13.4017495Z 2025-05-07T19:46:13.4017521Z 2025-05-07T19:46:13.4017533Z 2025-05-07T19:46:13.4017537Z 2025-05-07T19:46:13.4017541Z 2025-05-07T19:46:13.4017545Z 2025-05-07T19:46:13.4017548Z 2025-05-07T19:46:13.4597383Z libnpp-12.3.1.54 | 93.4 MB | #3 | 13%  2025-05-07T19:46:13.5018000Z nsight-compute-2024. | 443.1 MB | ####### | 71% 2025-05-07T19:46:13.5018308Z 2025-05-07T19:46:13.5018312Z 2025-05-07T19:46:13.5018317Z 2025-05-07T19:46:13.5018336Z 2025-05-07T19:46:13.5018340Z 2025-05-07T19:46:13.5018344Z 2025-05-07T19:46:13.5018348Z 2025-05-07T19:46:13.5600778Z libnpp-12.3.1.54 | 93.4 MB | ##1 | 22%  2025-05-07T19:46:13.5911389Z nsight-compute-2024. | 443.1 MB | #######2 | 73% 2025-05-07T19:46:13.5911940Z 2025-05-07T19:46:13.5911951Z 2025-05-07T19:46:13.5912006Z 2025-05-07T19:46:13.5912014Z 2025-05-07T19:46:13.5912045Z 2025-05-07T19:46:13.6018447Z cuda-nvvp-12.6.80 | 109.3 MB | #######8 | 79%  2025-05-07T19:46:13.6018854Z 2025-05-07T19:46:13.6018889Z 2025-05-07T19:46:13.6019116Z 2025-05-07T19:46:13.6019120Z 2025-05-07T19:46:13.6019124Z 2025-05-07T19:46:13.6019127Z 2025-05-07T19:46:13.6019131Z 2025-05-07T19:46:13.6696327Z libnpp-12.3.1.54 | 93.4 MB | ##9 | 29%  2025-05-07T19:46:13.6912738Z nsight-compute-2024. | 443.1 MB | #######4 | 75% 2025-05-07T19:46:13.6913219Z 2025-05-07T19:46:13.6913226Z 2025-05-07T19:46:13.6913232Z 2025-05-07T19:46:13.6913236Z 2025-05-07T19:46:13.6913240Z 2025-05-07T19:46:13.7018315Z cuda-nvvp-12.6.80 | 109.3 MB | ########5 | 85%  2025-05-07T19:46:13.7018751Z 2025-05-07T19:46:13.7018756Z 2025-05-07T19:46:13.7018759Z 2025-05-07T19:46:13.7018776Z 2025-05-07T19:46:13.7018780Z 2025-05-07T19:46:13.7018783Z 2025-05-07T19:46:13.7018787Z 2025-05-07T19:46:13.7809426Z libnpp-12.3.1.54 | 93.4 MB | ###5 | 36%  2025-05-07T19:46:13.7914197Z nsight-compute-2024. | 443.1 MB | #######6 | 76% 2025-05-07T19:46:13.7915002Z 2025-05-07T19:46:13.7915019Z 2025-05-07T19:46:13.7915023Z 2025-05-07T19:46:13.7915026Z 2025-05-07T19:46:13.7915030Z 2025-05-07T19:46:13.8019047Z cuda-nvvp-12.6.80 | 109.3 MB | #########1 | 92%  2025-05-07T19:46:13.8019372Z 2025-05-07T19:46:13.8019377Z 2025-05-07T19:46:13.8019380Z 2025-05-07T19:46:13.8019384Z 2025-05-07T19:46:13.8019387Z 2025-05-07T19:46:13.8019391Z 2025-05-07T19:46:13.8019394Z 2025-05-07T19:46:13.8853914Z libnpp-12.3.1.54 | 93.4 MB | ####2 | 43%  2025-05-07T19:46:13.8920538Z nsight-compute-2024. | 443.1 MB | #######8 | 78% 2025-05-07T19:46:13.8921049Z 2025-05-07T19:46:13.8921109Z 2025-05-07T19:46:13.8921115Z 2025-05-07T19:46:13.8921119Z 2025-05-07T19:46:13.8921122Z 2025-05-07T19:46:13.9027997Z cuda-nvvp-12.6.80 | 109.3 MB | #########8 | 98%  2025-05-07T19:46:13.9028864Z 2025-05-07T19:46:13.9028878Z 2025-05-07T19:46:13.9028890Z 2025-05-07T19:46:13.9028901Z 2025-05-07T19:46:13.9028942Z 2025-05-07T19:46:13.9028968Z 2025-05-07T19:46:13.9029053Z 2025-05-07T19:46:13.9917232Z libnpp-12.3.1.54 | 93.4 MB | ####9 | 49%  2025-05-07T19:46:14.0028248Z nsight-compute-2024. | 443.1 MB | #######9 | 80% 2025-05-07T19:46:14.0028536Z 2025-05-07T19:46:14.0028556Z 2025-05-07T19:46:14.0028560Z 2025-05-07T19:46:14.0028564Z 2025-05-07T19:46:14.0028568Z 2025-05-07T19:46:14.0028571Z 2025-05-07T19:46:14.0028575Z 2025-05-07T19:46:14.1030970Z libnpp-12.3.1.54 | 93.4 MB | #####8 | 58%  2025-05-07T19:46:14.1031293Z 2025-05-07T19:46:14.1031298Z 2025-05-07T19:46:14.1031316Z 2025-05-07T19:46:14.1031319Z 2025-05-07T19:46:14.1031323Z 2025-05-07T19:46:14.1031326Z 2025-05-07T19:46:14.1031330Z 2025-05-07T19:46:14.1178753Z libnpp-12.3.1.54 | 93.4 MB | ######9 | 70%  2025-05-07T19:46:14.2080859Z nsight-compute-2024. | 443.1 MB | ########1 | 82% 2025-05-07T19:46:14.2081444Z 2025-05-07T19:46:14.2081451Z 2025-05-07T19:46:14.2081492Z 2025-05-07T19:46:14.2081516Z 2025-05-07T19:46:14.2081520Z 2025-05-07T19:46:14.2081523Z 2025-05-07T19:46:14.2081527Z 2025-05-07T19:46:14.2178953Z libnpp-12.3.1.54 | 93.4 MB | #######8 | 79%  2025-05-07T19:46:14.3137340Z nsight-compute-2024. | 443.1 MB | ########3 | 84% 2025-05-07T19:46:14.3137934Z 2025-05-07T19:46:14.3137943Z 2025-05-07T19:46:14.3137948Z 2025-05-07T19:46:14.3137953Z 2025-05-07T19:46:14.3137957Z 2025-05-07T19:46:14.3137962Z 2025-05-07T19:46:14.3137967Z 2025-05-07T19:46:14.3181738Z libnpp-12.3.1.54 | 93.4 MB | ########7 | 87%  2025-05-07T19:46:14.4138331Z nsight-compute-2024. | 443.1 MB | ########6 | 86% 2025-05-07T19:46:14.4138864Z 2025-05-07T19:46:14.4138871Z 2025-05-07T19:46:14.4138876Z 2025-05-07T19:46:14.4138884Z 2025-05-07T19:46:14.4138890Z 2025-05-07T19:46:14.4138894Z 2025-05-07T19:46:14.4138897Z 2025-05-07T19:46:14.4184470Z libnpp-12.3.1.54 | 93.4 MB | #########5 | 96%  2025-05-07T19:46:14.5193647Z nsight-compute-2024. | 443.1 MB | ########8 | 88% 2025-05-07T19:46:14.5539748Z nsight-compute-2024. | 443.1 MB | ######### | 91% 2025-05-07T19:46:14.5540300Z 2025-05-07T19:46:14.5540308Z 2025-05-07T19:46:14.5540314Z 2025-05-07T19:46:14.5540319Z 2025-05-07T19:46:14.5540325Z 2025-05-07T19:46:14.5540330Z 2025-05-07T19:46:14.5540678Z libcusolver-11.7.1.2 | 95.8 MB | ########## | 100%  2025-05-07T19:46:14.5540974Z 2025-05-07T19:46:14.5540978Z 2025-05-07T19:46:14.5540982Z 2025-05-07T19:46:14.5540985Z 2025-05-07T19:46:14.5541003Z 2025-05-07T19:46:14.5541006Z 2025-05-07T19:46:14.5899993Z libcusolver-11.7.1.2 | 95.8 MB | ########## | 100%  2025-05-07T19:46:14.5900610Z 2025-05-07T19:46:14.5900617Z 2025-05-07T19:46:14.5900622Z 2025-05-07T19:46:14.5900628Z 2025-05-07T19:46:14.5900634Z 2025-05-07T19:46:14.5900639Z 2025-05-07T19:46:14.5900659Z 2025-05-07T19:46:14.5900663Z 2025-05-07T19:46:14.6804299Z cuda-nvdisasm-12.6.7 | 47.6 MB | | 0%  2025-05-07T19:46:14.6919052Z nsight-compute-2024. | 443.1 MB | #########2 | 93% 2025-05-07T19:46:14.6919603Z 2025-05-07T19:46:14.6919660Z 2025-05-07T19:46:14.6919665Z 2025-05-07T19:46:14.6919670Z 2025-05-07T19:46:14.6919676Z 2025-05-07T19:46:14.6919682Z 2025-05-07T19:46:14.6919687Z 2025-05-07T19:46:14.6919693Z 2025-05-07T19:46:14.7876650Z cuda-nvdisasm-12.6.7 | 47.6 MB | #4 | 14%  2025-05-07T19:46:14.7919459Z nsight-compute-2024. | 443.1 MB | #########4 | 95% 2025-05-07T19:46:14.7919955Z 2025-05-07T19:46:14.7920183Z 2025-05-07T19:46:14.7920254Z 2025-05-07T19:46:14.7920258Z 2025-05-07T19:46:14.7920278Z 2025-05-07T19:46:14.7920359Z 2025-05-07T19:46:14.7920363Z 2025-05-07T19:46:14.7920713Z 2025-05-07T19:46:14.8875865Z cuda-nvdisasm-12.6.7 | 47.6 MB | ##9 | 30%  2025-05-07T19:46:14.8920884Z nsight-compute-2024. | 443.1 MB | #########6 | 97% 2025-05-07T19:46:14.8921193Z 2025-05-07T19:46:14.8921206Z 2025-05-07T19:46:14.8921210Z 2025-05-07T19:46:14.8921214Z 2025-05-07T19:46:14.8921217Z 2025-05-07T19:46:14.8921221Z 2025-05-07T19:46:14.8921224Z 2025-05-07T19:46:14.8921228Z 2025-05-07T19:46:14.9917585Z cuda-nvdisasm-12.6.7 | 47.6 MB | ####4 | 45%  2025-05-07T19:46:14.9919034Z nsight-compute-2024. | 443.1 MB | #########8 | 99% 2025-05-07T19:46:14.9919298Z 2025-05-07T19:46:14.9919319Z 2025-05-07T19:46:14.9919323Z 2025-05-07T19:46:14.9919326Z 2025-05-07T19:46:14.9919330Z 2025-05-07T19:46:14.9919333Z 2025-05-07T19:46:14.9919337Z 2025-05-07T19:46:14.9919340Z 2025-05-07T19:46:15.0925651Z cuda-nvdisasm-12.6.7 | 47.6 MB | ###### | 60%  2025-05-07T19:46:15.0925980Z 2025-05-07T19:46:15.0926002Z 2025-05-07T19:46:15.0926049Z 2025-05-07T19:46:15.0926052Z 2025-05-07T19:46:15.0926061Z 2025-05-07T19:46:15.0926084Z 2025-05-07T19:46:15.0926089Z 2025-05-07T19:46:15.0926092Z 2025-05-07T19:46:15.1805563Z cuda-nvdisasm-12.6.7 | 47.6 MB | ######## | 81%  2025-05-07T19:46:15.1805953Z 2025-05-07T19:46:15.1805961Z 2025-05-07T19:46:15.1805966Z 2025-05-07T19:46:15.2651282Z libcusparse-12.5.4.2 | 118.6 MB | ########## | 100%  2025-05-07T19:46:15.2651603Z 2025-05-07T19:46:15.2651624Z 2025-05-07T19:46:15.2651629Z 2025-05-07T19:46:15.2651634Z 2025-05-07T19:46:15.2651639Z 2025-05-07T19:46:15.3063279Z cuda-nvvp-12.6.80 | 109.3 MB | ########## | 100%  2025-05-07T19:46:15.3063615Z 2025-05-07T19:46:15.3063619Z 2025-05-07T19:46:15.3063623Z 2025-05-07T19:46:15.3063627Z 2025-05-07T19:46:15.3063643Z 2025-05-07T19:46:15.3063647Z 2025-05-07T19:46:15.3063650Z 2025-05-07T19:46:15.3063653Z 2025-05-07T19:46:15.3063657Z 2025-05-07T19:46:15.4063856Z libcurand-10.3.7.77 | 39.9 MB | | 0%  2025-05-07T19:46:15.4064188Z 2025-05-07T19:46:15.4064192Z 2025-05-07T19:46:15.4064196Z 2025-05-07T19:46:15.4064213Z 2025-05-07T19:46:15.4064229Z 2025-05-07T19:46:15.4064446Z 2025-05-07T19:46:15.4064451Z 2025-05-07T19:46:15.4064454Z 2025-05-07T19:46:15.4064457Z 2025-05-07T19:46:15.5067343Z libcurand-10.3.7.77 | 39.9 MB | #4 | 15%  2025-05-07T19:46:15.5068952Z 2025-05-07T19:46:15.5068966Z 2025-05-07T19:46:15.5068977Z 2025-05-07T19:46:15.5068988Z 2025-05-07T19:46:15.5068998Z 2025-05-07T19:46:15.5069008Z 2025-05-07T19:46:15.5069019Z 2025-05-07T19:46:15.5069029Z 2025-05-07T19:46:15.5069039Z 2025-05-07T19:46:15.6066279Z libcurand-10.3.7.77 | 39.9 MB | ###5 | 35%  2025-05-07T19:46:15.6066872Z 2025-05-07T19:46:15.6066889Z 2025-05-07T19:46:15.6066892Z 2025-05-07T19:46:15.6066896Z 2025-05-07T19:46:15.6066899Z 2025-05-07T19:46:15.6066903Z 2025-05-07T19:46:15.6066906Z 2025-05-07T19:46:15.6066910Z 2025-05-07T19:46:15.6066913Z 2025-05-07T19:46:15.6321581Z libcurand-10.3.7.77 | 39.9 MB | ######2 | 62%  2025-05-07T19:46:15.6322300Z 2025-05-07T19:46:15.6322312Z 2025-05-07T19:46:15.6322316Z 2025-05-07T19:46:15.6322320Z 2025-05-07T19:46:15.6322323Z 2025-05-07T19:46:15.6322326Z 2025-05-07T19:46:15.6322330Z 2025-05-07T19:46:15.6769093Z libnpp-12.3.1.54 | 93.4 MB | ########## | 100%  2025-05-07T19:46:15.6770745Z 2025-05-07T19:46:15.6927620Z 2025-05-07T19:46:15.6929193Z libcufft-11.3.0.4 | 156.2 MB | ########## | 100%  2025-05-07T19:46:15.6930003Z 2025-05-07T19:46:15.6930007Z 2025-05-07T19:46:15.6930010Z 2025-05-07T19:46:15.6930014Z 2025-05-07T19:46:15.6930017Z 2025-05-07T19:46:15.6930021Z 2025-05-07T19:46:15.6930025Z 2025-05-07T19:46:15.6930028Z 2025-05-07T19:46:15.6930031Z 2025-05-07T19:46:15.6930046Z 2025-05-07T19:46:15.7067453Z gds-tools-1.11.1.6 | 37.8 MB | | 0%  2025-05-07T19:46:15.7068628Z 2025-05-07T19:46:15.7068644Z 2025-05-07T19:46:15.7068656Z 2025-05-07T19:46:15.7068666Z 2025-05-07T19:46:15.7068677Z 2025-05-07T19:46:15.7068718Z 2025-05-07T19:46:15.7068742Z 2025-05-07T19:46:15.7068753Z 2025-05-07T19:46:15.7068800Z 2025-05-07T19:46:15.7124817Z libcurand-10.3.7.77 | 39.9 MB | ########4 | 84%  2025-05-07T19:46:15.7126233Z 2025-05-07T19:46:15.7126247Z 2025-05-07T19:46:15.7126257Z 2025-05-07T19:46:15.7126304Z 2025-05-07T19:46:15.7126315Z 2025-05-07T19:46:15.7126325Z 2025-05-07T19:46:15.7126336Z 2025-05-07T19:46:15.7126367Z 2025-05-07T19:46:15.7126928Z cuda-nvdisasm-12.6.7 | 47.6 MB | ########## | 100%  2025-05-07T19:46:15.7127226Z 2025-05-07T19:46:15.7127229Z 2025-05-07T19:46:15.7127233Z 2025-05-07T19:46:15.7127236Z 2025-05-07T19:46:15.7127240Z 2025-05-07T19:46:15.7127243Z 2025-05-07T19:46:15.7127247Z 2025-05-07T19:46:15.7127250Z 2025-05-07T19:46:15.7934616Z cuda-nvdisasm-12.6.7 | 47.6 MB | ########## | 100%  2025-05-07T19:46:15.7935049Z 2025-05-07T19:46:15.7935054Z 2025-05-07T19:46:15.7935058Z 2025-05-07T19:46:15.7935078Z 2025-05-07T19:46:15.7935092Z 2025-05-07T19:46:15.7935096Z 2025-05-07T19:46:15.7935099Z 2025-05-07T19:46:15.7935102Z 2025-05-07T19:46:15.7935121Z 2025-05-07T19:46:15.7935124Z 2025-05-07T19:46:15.8575291Z gds-tools-1.11.1.6 | 37.8 MB | #2 | 12%  2025-05-07T19:46:15.8575625Z 2025-05-07T19:46:15.8575629Z 2025-05-07T19:46:15.8575633Z 2025-05-07T19:46:15.8575636Z 2025-05-07T19:46:15.8575640Z 2025-05-07T19:46:15.8575656Z 2025-05-07T19:46:15.8575660Z 2025-05-07T19:46:15.8575664Z 2025-05-07T19:46:15.8575668Z 2025-05-07T19:46:15.8575671Z 2025-05-07T19:46:15.8575675Z 2025-05-07T19:46:15.8968207Z cuda-nvcc-tools-12.6 | 23.0 MB | | 0%  2025-05-07T19:46:15.8968626Z 2025-05-07T19:46:15.8968635Z 2025-05-07T19:46:15.8968662Z 2025-05-07T19:46:15.8968670Z 2025-05-07T19:46:15.8968678Z 2025-05-07T19:46:15.8968684Z 2025-05-07T19:46:15.8968692Z 2025-05-07T19:46:15.8968700Z 2025-05-07T19:46:15.8968708Z 2025-05-07T19:46:15.8968738Z 2025-05-07T19:46:15.9575735Z gds-tools-1.11.1.6 | 37.8 MB | ##3 | 23%  2025-05-07T19:46:15.9576251Z 2025-05-07T19:46:15.9576256Z 2025-05-07T19:46:15.9576259Z 2025-05-07T19:46:15.9576263Z 2025-05-07T19:46:15.9576266Z 2025-05-07T19:46:15.9576270Z 2025-05-07T19:46:15.9576273Z 2025-05-07T19:46:15.9576277Z 2025-05-07T19:46:15.9576280Z 2025-05-07T19:46:15.9576284Z 2025-05-07T19:46:15.9576287Z 2025-05-07T19:46:15.9985438Z cuda-nvcc-tools-12.6 | 23.0 MB | ###4 | 34%  2025-05-07T19:46:15.9985803Z 2025-05-07T19:46:15.9985808Z 2025-05-07T19:46:15.9985812Z 2025-05-07T19:46:15.9985815Z 2025-05-07T19:46:15.9985819Z 2025-05-07T19:46:15.9985822Z 2025-05-07T19:46:15.9985826Z 2025-05-07T19:46:15.9985830Z 2025-05-07T19:46:15.9985834Z 2025-05-07T19:46:15.9985837Z 2025-05-07T19:46:16.0578211Z gds-tools-1.11.1.6 | 37.8 MB | ###3 | 33%  2025-05-07T19:46:16.0578820Z 2025-05-07T19:46:16.0578838Z 2025-05-07T19:46:16.0578842Z 2025-05-07T19:46:16.0578845Z 2025-05-07T19:46:16.0578849Z 2025-05-07T19:46:16.0578852Z 2025-05-07T19:46:16.0578856Z 2025-05-07T19:46:16.0578859Z 2025-05-07T19:46:16.0578863Z 2025-05-07T19:46:16.0578866Z 2025-05-07T19:46:16.0578870Z 2025-05-07T19:46:16.1114636Z cuda-nvcc-tools-12.6 | 23.0 MB | #######6 | 76%  2025-05-07T19:46:16.1114992Z 2025-05-07T19:46:16.1528078Z libcublas-12.6.4.1 | 256.2 MB | ########## | 100%  2025-05-07T19:46:16.1528591Z 2025-05-07T19:46:16.1528596Z 2025-05-07T19:46:16.1528599Z 2025-05-07T19:46:16.1528602Z 2025-05-07T19:46:16.1528606Z 2025-05-07T19:46:16.1528609Z 2025-05-07T19:46:16.1528612Z 2025-05-07T19:46:16.1528617Z 2025-05-07T19:46:16.1528620Z 2025-05-07T19:46:16.1528624Z 2025-05-07T19:46:16.1646153Z gds-tools-1.11.1.6 | 37.8 MB | ####2 | 42%  2025-05-07T19:46:16.1646622Z 2025-05-07T19:46:16.1646631Z 2025-05-07T19:46:16.1646658Z 2025-05-07T19:46:16.1646674Z 2025-05-07T19:46:16.1646681Z 2025-05-07T19:46:16.1646689Z 2025-05-07T19:46:16.1646695Z 2025-05-07T19:46:16.1646702Z 2025-05-07T19:46:16.1646709Z 2025-05-07T19:46:16.1646717Z 2025-05-07T19:46:16.1646723Z 2025-05-07T19:46:16.1646730Z 2025-05-07T19:46:16.2083216Z cuda-nvrtc-12.6.85 | 17.3 MB | | 0%  2025-05-07T19:46:16.2083577Z 2025-05-07T19:46:16.2083582Z 2025-05-07T19:46:16.2083586Z 2025-05-07T19:46:16.2083590Z 2025-05-07T19:46:16.2083643Z 2025-05-07T19:46:16.2083646Z 2025-05-07T19:46:16.2083650Z 2025-05-07T19:46:16.2083653Z 2025-05-07T19:46:16.2083657Z 2025-05-07T19:46:16.2642845Z libcurand-10.3.7.77 | 39.9 MB | ########## | 100%  2025-05-07T19:46:16.2643361Z 2025-05-07T19:46:16.2643366Z 2025-05-07T19:46:16.2643369Z 2025-05-07T19:46:16.2643372Z 2025-05-07T19:46:16.2643376Z 2025-05-07T19:46:16.2643379Z 2025-05-07T19:46:16.2643383Z 2025-05-07T19:46:16.2643403Z 2025-05-07T19:46:16.2643418Z 2025-05-07T19:46:16.2643421Z 2025-05-07T19:46:16.2786515Z gds-tools-1.11.1.6 | 37.8 MB | ##### | 50%  2025-05-07T19:46:16.2787776Z 2025-05-07T19:46:16.2787790Z 2025-05-07T19:46:16.2787801Z 2025-05-07T19:46:16.2787811Z 2025-05-07T19:46:16.2787821Z 2025-05-07T19:46:16.2787832Z 2025-05-07T19:46:16.2787842Z 2025-05-07T19:46:16.2787852Z 2025-05-07T19:46:16.2787862Z 2025-05-07T19:46:16.2787872Z 2025-05-07T19:46:16.2787883Z 2025-05-07T19:46:16.2787894Z 2025-05-07T19:46:16.2787920Z 2025-05-07T19:46:16.2853306Z libnvjitlink-12.6.85 | 14.9 MB | | 0%  2025-05-07T19:46:16.2853842Z 2025-05-07T19:46:16.2853847Z 2025-05-07T19:46:16.2853850Z 2025-05-07T19:46:16.2853854Z 2025-05-07T19:46:16.2853857Z 2025-05-07T19:46:16.2853861Z 2025-05-07T19:46:16.2853864Z 2025-05-07T19:46:16.2853868Z 2025-05-07T19:46:16.2853871Z 2025-05-07T19:46:16.2853874Z 2025-05-07T19:46:16.2853906Z 2025-05-07T19:46:16.2853924Z 2025-05-07T19:46:16.3242696Z cuda-nvrtc-12.6.85 | 17.3 MB | ### | 30%  2025-05-07T19:46:16.3243202Z 2025-05-07T19:46:16.3243207Z 2025-05-07T19:46:16.3243211Z 2025-05-07T19:46:16.3243215Z 2025-05-07T19:46:16.3243218Z 2025-05-07T19:46:16.3243246Z 2025-05-07T19:46:16.3243249Z 2025-05-07T19:46:16.3243253Z 2025-05-07T19:46:16.3243256Z 2025-05-07T19:46:16.3243260Z 2025-05-07T19:46:16.3243263Z 2025-05-07T19:46:16.3593452Z cuda-nvcc-tools-12.6 | 23.0 MB | ########## | 100%  2025-05-07T19:46:16.3593897Z 2025-05-07T19:46:16.3593933Z 2025-05-07T19:46:16.3593942Z 2025-05-07T19:46:16.3593948Z 2025-05-07T19:46:16.3593955Z 2025-05-07T19:46:16.3593962Z 2025-05-07T19:46:16.3593970Z 2025-05-07T19:46:16.3593975Z 2025-05-07T19:46:16.3593983Z 2025-05-07T19:46:16.3593991Z 2025-05-07T19:46:16.3593996Z 2025-05-07T19:46:16.3594004Z 2025-05-07T19:46:16.3594011Z 2025-05-07T19:46:16.3594017Z 2025-05-07T19:46:16.3785983Z cuda-nvcc-dev_linux- | 10.8 MB | | 0%  2025-05-07T19:46:16.3786403Z 2025-05-07T19:46:16.3786408Z 2025-05-07T19:46:16.3786412Z 2025-05-07T19:46:16.3786416Z 2025-05-07T19:46:16.3786420Z 2025-05-07T19:46:16.3786423Z 2025-05-07T19:46:16.3786427Z 2025-05-07T19:46:16.3786430Z 2025-05-07T19:46:16.3786434Z 2025-05-07T19:46:16.3786437Z 2025-05-07T19:46:16.3786441Z 2025-05-07T19:46:16.3786444Z 2025-05-07T19:46:16.3786448Z 2025-05-07T19:46:16.3814423Z libnvjitlink-12.6.85 | 14.9 MB | ###2 | 32%  2025-05-07T19:46:16.3814865Z 2025-05-07T19:46:16.3814869Z 2025-05-07T19:46:16.3814873Z 2025-05-07T19:46:16.3814877Z 2025-05-07T19:46:16.3814880Z 2025-05-07T19:46:16.3814884Z 2025-05-07T19:46:16.3814887Z 2025-05-07T19:46:16.3814890Z 2025-05-07T19:46:16.3814894Z 2025-05-07T19:46:16.3814910Z 2025-05-07T19:46:16.3855348Z gds-tools-1.11.1.6 | 37.8 MB | #####8 | 58%  2025-05-07T19:46:16.3855822Z 2025-05-07T19:46:16.3855827Z 2025-05-07T19:46:16.3855831Z 2025-05-07T19:46:16.3855835Z 2025-05-07T19:46:16.3855839Z 2025-05-07T19:46:16.3855842Z 2025-05-07T19:46:16.3855846Z 2025-05-07T19:46:16.3855874Z 2025-05-07T19:46:16.3855878Z 2025-05-07T19:46:16.3855881Z 2025-05-07T19:46:16.3855885Z 2025-05-07T19:46:16.3855888Z 2025-05-07T19:46:16.4594130Z cuda-nvrtc-12.6.85 | 17.3 MB | ######4 | 64%  2025-05-07T19:46:16.4594602Z 2025-05-07T19:46:16.4594607Z 2025-05-07T19:46:16.4594611Z 2025-05-07T19:46:16.4594614Z 2025-05-07T19:46:16.4594618Z 2025-05-07T19:46:16.4594621Z 2025-05-07T19:46:16.4594625Z 2025-05-07T19:46:16.4594628Z 2025-05-07T19:46:16.4594644Z 2025-05-07T19:46:16.4594648Z 2025-05-07T19:46:16.4594651Z 2025-05-07T19:46:16.4594655Z 2025-05-07T19:46:16.4594658Z 2025-05-07T19:46:16.4594662Z 2025-05-07T19:46:16.4788371Z cuda-nvcc-dev_linux- | 10.8 MB | #####4 | 55%  2025-05-07T19:46:16.4789809Z 2025-05-07T19:46:16.4789823Z 2025-05-07T19:46:16.4789884Z 2025-05-07T19:46:16.4789895Z 2025-05-07T19:46:16.4789906Z 2025-05-07T19:46:16.4789916Z 2025-05-07T19:46:16.4789927Z 2025-05-07T19:46:16.4789937Z 2025-05-07T19:46:16.4789947Z 2025-05-07T19:46:16.4789958Z 2025-05-07T19:46:16.4789968Z 2025-05-07T19:46:16.4789978Z 2025-05-07T19:46:16.4789988Z 2025-05-07T19:46:16.4860432Z libnvjitlink-12.6.85 | 14.9 MB | #####8 | 59%  2025-05-07T19:46:16.4860813Z 2025-05-07T19:46:16.4860817Z 2025-05-07T19:46:16.4860821Z 2025-05-07T19:46:16.4860825Z 2025-05-07T19:46:16.4860829Z 2025-05-07T19:46:16.4860832Z 2025-05-07T19:46:16.4860835Z 2025-05-07T19:46:16.4860839Z 2025-05-07T19:46:16.4860842Z 2025-05-07T19:46:16.4860846Z 2025-05-07T19:46:16.4860849Z 2025-05-07T19:46:16.4860863Z 2025-05-07T19:46:16.5067881Z cuda-nvrtc-12.6.85 | 17.3 MB | #########6 | 97%  2025-05-07T19:46:16.5068461Z 2025-05-07T19:46:16.5068654Z 2025-05-07T19:46:16.5068658Z 2025-05-07T19:46:16.5068662Z 2025-05-07T19:46:16.5068665Z 2025-05-07T19:46:16.5068669Z 2025-05-07T19:46:16.5068672Z 2025-05-07T19:46:16.5068676Z 2025-05-07T19:46:16.5068679Z 2025-05-07T19:46:16.5068683Z 2025-05-07T19:46:16.5799394Z gds-tools-1.11.1.6 | 37.8 MB | ######5 | 65%  2025-05-07T19:46:16.5799864Z 2025-05-07T19:46:16.5799869Z 2025-05-07T19:46:16.5799872Z 2025-05-07T19:46:16.5799876Z 2025-05-07T19:46:16.5799879Z 2025-05-07T19:46:16.5799883Z 2025-05-07T19:46:16.5799886Z 2025-05-07T19:46:16.5799889Z 2025-05-07T19:46:16.5799893Z 2025-05-07T19:46:16.5799896Z 2025-05-07T19:46:16.5799900Z 2025-05-07T19:46:16.5799903Z 2025-05-07T19:46:16.5799907Z 2025-05-07T19:46:16.6187173Z libnvjitlink-12.6.85 | 14.9 MB | #########7 | 98%  2025-05-07T19:46:16.6187530Z 2025-05-07T19:46:16.6187534Z 2025-05-07T19:46:16.6187538Z 2025-05-07T19:46:16.6187541Z 2025-05-07T19:46:16.6187738Z 2025-05-07T19:46:16.6187749Z 2025-05-07T19:46:16.6187753Z 2025-05-07T19:46:16.6187771Z 2025-05-07T19:46:16.6187775Z 2025-05-07T19:46:16.6187778Z 2025-05-07T19:46:16.6593375Z gds-tools-1.11.1.6 | 37.8 MB | #######4 | 75%  2025-05-07T19:46:16.6593760Z 2025-05-07T19:46:16.6593766Z 2025-05-07T19:46:16.6593770Z 2025-05-07T19:46:16.6593775Z 2025-05-07T19:46:16.6593814Z 2025-05-07T19:46:16.6593819Z 2025-05-07T19:46:16.6593824Z 2025-05-07T19:46:16.6593828Z 2025-05-07T19:46:16.6593833Z 2025-05-07T19:46:16.6593837Z 2025-05-07T19:46:16.6593842Z 2025-05-07T19:46:16.6593846Z 2025-05-07T19:46:16.6593850Z 2025-05-07T19:46:16.6593855Z 2025-05-07T19:46:16.6594570Z cuda-nvcc-dev_linux- | 10.8 MB | ########## | 100%  2025-05-07T19:46:16.6595060Z 2025-05-07T19:46:16.6595074Z 2025-05-07T19:46:16.6595077Z 2025-05-07T19:46:16.6595081Z 2025-05-07T19:46:16.6595085Z 2025-05-07T19:46:16.6595088Z 2025-05-07T19:46:16.6595105Z 2025-05-07T19:46:16.6595113Z 2025-05-07T19:46:16.6595117Z 2025-05-07T19:46:16.6595120Z 2025-05-07T19:46:16.6595123Z 2025-05-07T19:46:16.6595127Z 2025-05-07T19:46:16.6595130Z 2025-05-07T19:46:16.6595133Z 2025-05-07T19:46:16.6906116Z cuda-nvcc-dev_linux- | 10.8 MB | ########## | 100%  2025-05-07T19:46:16.6906735Z 2025-05-07T19:46:16.6906740Z 2025-05-07T19:46:16.6906743Z 2025-05-07T19:46:16.6906747Z 2025-05-07T19:46:16.6906750Z 2025-05-07T19:46:16.6906754Z 2025-05-07T19:46:16.6906757Z 2025-05-07T19:46:16.6906761Z 2025-05-07T19:46:16.6906764Z 2025-05-07T19:46:16.6906777Z 2025-05-07T19:46:16.6906781Z 2025-05-07T19:46:16.6906784Z 2025-05-07T19:46:16.6908264Z cuda-nvrtc-12.6.85 | 17.3 MB | ########## | 100%  2025-05-07T19:46:16.6908759Z 2025-05-07T19:46:16.6908764Z 2025-05-07T19:46:16.6908768Z 2025-05-07T19:46:16.6908771Z 2025-05-07T19:46:16.6908775Z 2025-05-07T19:46:16.6908778Z 2025-05-07T19:46:16.6908794Z 2025-05-07T19:46:16.6908802Z 2025-05-07T19:46:16.6908806Z 2025-05-07T19:46:16.6908819Z 2025-05-07T19:46:16.6908836Z 2025-05-07T19:46:16.6908839Z 2025-05-07T19:46:16.6908843Z 2025-05-07T19:46:16.6908846Z 2025-05-07T19:46:16.6911258Z 2025-05-07T19:46:16.7188035Z cuda-nvvm-tools-12.6 | 10.4 MB | | 0%  2025-05-07T19:46:16.7188467Z 2025-05-07T19:46:16.7188471Z 2025-05-07T19:46:16.7188475Z 2025-05-07T19:46:16.7188478Z 2025-05-07T19:46:16.7188482Z 2025-05-07T19:46:16.7188490Z 2025-05-07T19:46:16.7188493Z 2025-05-07T19:46:16.7188497Z 2025-05-07T19:46:16.7188501Z 2025-05-07T19:46:16.7188506Z 2025-05-07T19:46:16.7385240Z gds-tools-1.11.1.6 | 37.8 MB | ########2 | 82%  2025-05-07T19:46:16.7385749Z 2025-05-07T19:46:16.7385756Z 2025-05-07T19:46:16.7385759Z 2025-05-07T19:46:16.7385763Z 2025-05-07T19:46:16.7385766Z 2025-05-07T19:46:16.7385770Z 2025-05-07T19:46:16.7385773Z 2025-05-07T19:46:16.7385792Z 2025-05-07T19:46:16.7385983Z 2025-05-07T19:46:16.7385987Z 2025-05-07T19:46:16.7385990Z 2025-05-07T19:46:16.7385994Z 2025-05-07T19:46:16.7385997Z 2025-05-07T19:46:16.7386001Z 2025-05-07T19:46:16.7386004Z 2025-05-07T19:46:16.7386007Z 2025-05-07T19:46:16.7566918Z cuda-sanitizer-api-1 | 8.9 MB | | 0%  2025-05-07T19:46:16.7567437Z 2025-05-07T19:46:16.7567442Z 2025-05-07T19:46:16.7567446Z 2025-05-07T19:46:16.7567450Z 2025-05-07T19:46:16.7567453Z 2025-05-07T19:46:16.7567457Z 2025-05-07T19:46:16.7567460Z 2025-05-07T19:46:16.7567477Z 2025-05-07T19:46:16.7567481Z 2025-05-07T19:46:16.7567484Z 2025-05-07T19:46:16.7567488Z 2025-05-07T19:46:16.7567491Z 2025-05-07T19:46:16.7567495Z 2025-05-07T19:46:16.7906744Z libnvjitlink-12.6.85 | 14.9 MB | ########## | 100%  2025-05-07T19:46:16.7907104Z 2025-05-07T19:46:16.7907125Z 2025-05-07T19:46:16.7907129Z 2025-05-07T19:46:16.7907134Z 2025-05-07T19:46:16.7907362Z 2025-05-07T19:46:16.7907375Z 2025-05-07T19:46:16.7907378Z 2025-05-07T19:46:16.7907382Z 2025-05-07T19:46:16.7907385Z 2025-05-07T19:46:16.7907389Z 2025-05-07T19:46:16.7907392Z 2025-05-07T19:46:16.7907396Z 2025-05-07T19:46:16.7907399Z 2025-05-07T19:46:16.7907402Z 2025-05-07T19:46:16.7907406Z 2025-05-07T19:46:16.8064471Z cuda-nvvm-tools-12.6 | 10.4 MB | #####9 | 59%  2025-05-07T19:46:16.8064936Z 2025-05-07T19:46:16.8064941Z 2025-05-07T19:46:16.8064945Z 2025-05-07T19:46:16.8064948Z 2025-05-07T19:46:16.8064952Z 2025-05-07T19:46:16.8064955Z 2025-05-07T19:46:16.8064958Z 2025-05-07T19:46:16.8064962Z 2025-05-07T19:46:16.8064965Z 2025-05-07T19:46:16.8064968Z 2025-05-07T19:46:16.8064972Z 2025-05-07T19:46:16.8064976Z 2025-05-07T19:46:16.8064979Z 2025-05-07T19:46:16.8064993Z 2025-05-07T19:46:16.8064996Z 2025-05-07T19:46:16.8065000Z 2025-05-07T19:46:16.8065003Z 2025-05-07T19:46:16.8189718Z cuda-nvvm-impl-12.6. | 7.7 MB | | 0%  2025-05-07T19:46:16.8190078Z 2025-05-07T19:46:16.8190317Z 2025-05-07T19:46:16.8190325Z 2025-05-07T19:46:16.8190329Z 2025-05-07T19:46:16.8190333Z 2025-05-07T19:46:16.8190338Z 2025-05-07T19:46:16.8190342Z 2025-05-07T19:46:16.8190358Z 2025-05-07T19:46:16.8190364Z 2025-05-07T19:46:16.8190370Z 2025-05-07T19:46:16.8395909Z gds-tools-1.11.1.6 | 37.8 MB | #########7 | 98%  2025-05-07T19:46:16.8396338Z 2025-05-07T19:46:16.8396446Z 2025-05-07T19:46:16.8396452Z 2025-05-07T19:46:16.8396488Z 2025-05-07T19:46:16.8396514Z 2025-05-07T19:46:16.8396518Z 2025-05-07T19:46:16.8396550Z 2025-05-07T19:46:16.8396554Z 2025-05-07T19:46:16.8396581Z 2025-05-07T19:46:16.8396587Z 2025-05-07T19:46:16.8396641Z 2025-05-07T19:46:16.8396645Z 2025-05-07T19:46:16.8396650Z 2025-05-07T19:46:16.8396731Z 2025-05-07T19:46:16.8396739Z 2025-05-07T19:46:16.8396745Z 2025-05-07T19:46:16.8518051Z cuda-sanitizer-api-1 | 8.9 MB | ######4 | 65%  2025-05-07T19:46:16.8519152Z 2025-05-07T19:46:16.8519165Z 2025-05-07T19:46:16.8519177Z 2025-05-07T19:46:16.8519188Z 2025-05-07T19:46:16.8519198Z 2025-05-07T19:46:16.8519209Z 2025-05-07T19:46:16.9065378Z libcusolver-11.7.1.2 | 95.8 MB | ########## | 100%  2025-05-07T19:46:16.9065727Z 2025-05-07T19:46:16.9065731Z 2025-05-07T19:46:16.9065735Z 2025-05-07T19:46:16.9065738Z 2025-05-07T19:46:16.9065742Z 2025-05-07T19:46:16.9065745Z 2025-05-07T19:46:16.9065749Z 2025-05-07T19:46:16.9065753Z 2025-05-07T19:46:16.9065756Z 2025-05-07T19:46:16.9065760Z 2025-05-07T19:46:16.9065763Z 2025-05-07T19:46:16.9065767Z 2025-05-07T19:46:16.9065770Z 2025-05-07T19:46:16.9065774Z 2025-05-07T19:46:16.9065777Z 2025-05-07T19:46:16.9065781Z 2025-05-07T19:46:16.9065784Z 2025-05-07T19:46:17.0111913Z cuda-nvvm-impl-12.6. | 7.7 MB | #######1 | 72%  2025-05-07T19:46:17.0112320Z 2025-05-07T19:46:17.0112533Z 2025-05-07T19:46:17.0112536Z 2025-05-07T19:46:17.0112540Z 2025-05-07T19:46:17.0112543Z 2025-05-07T19:46:17.0112547Z 2025-05-07T19:46:17.0112550Z 2025-05-07T19:46:17.0112554Z 2025-05-07T19:46:17.0112557Z 2025-05-07T19:46:17.0112561Z 2025-05-07T19:46:17.0112564Z 2025-05-07T19:46:17.0112567Z 2025-05-07T19:46:17.0112571Z 2025-05-07T19:46:17.0112574Z 2025-05-07T19:46:17.0112578Z 2025-05-07T19:46:17.0112581Z 2025-05-07T19:46:17.0172592Z cuda-sanitizer-api-1 | 8.9 MB | ########## | 100%  2025-05-07T19:46:17.0172992Z 2025-05-07T19:46:17.0172997Z 2025-05-07T19:46:17.0173001Z 2025-05-07T19:46:17.0173004Z 2025-05-07T19:46:17.0173008Z 2025-05-07T19:46:17.0173011Z 2025-05-07T19:46:17.0173014Z 2025-05-07T19:46:17.0173018Z 2025-05-07T19:46:17.0173050Z 2025-05-07T19:46:17.0173053Z 2025-05-07T19:46:17.0173056Z 2025-05-07T19:46:17.0173060Z 2025-05-07T19:46:17.0173063Z 2025-05-07T19:46:17.0173067Z 2025-05-07T19:46:17.0173321Z 2025-05-07T19:46:17.0173664Z cuda-nvvm-tools-12.6 | 10.4 MB | ########## | 100%  2025-05-07T19:46:17.0174000Z 2025-05-07T19:46:17.0174003Z 2025-05-07T19:46:17.0174031Z 2025-05-07T19:46:17.0174035Z 2025-05-07T19:46:17.0174038Z 2025-05-07T19:46:17.0174041Z 2025-05-07T19:46:17.0174045Z 2025-05-07T19:46:17.0174048Z 2025-05-07T19:46:17.0174051Z 2025-05-07T19:46:17.0174055Z 2025-05-07T19:46:17.0174059Z 2025-05-07T19:46:17.0174062Z 2025-05-07T19:46:17.0174066Z 2025-05-07T19:46:17.0174069Z 2025-05-07T19:46:17.0174073Z 2025-05-07T19:46:17.0228759Z cuda-nvvm-tools-12.6 | 10.4 MB | ########## | 100%  2025-05-07T19:46:17.0229206Z 2025-05-07T19:46:17.0229212Z 2025-05-07T19:46:17.0229217Z 2025-05-07T19:46:17.0229221Z 2025-05-07T19:46:17.0229226Z 2025-05-07T19:46:17.0229230Z 2025-05-07T19:46:17.0229235Z 2025-05-07T19:46:17.0229239Z 2025-05-07T19:46:17.0229244Z 2025-05-07T19:46:17.0229248Z 2025-05-07T19:46:17.0229268Z 2025-05-07T19:46:17.0229283Z 2025-05-07T19:46:17.0229288Z 2025-05-07T19:46:17.0229292Z 2025-05-07T19:46:17.0229296Z 2025-05-07T19:46:17.0229301Z 2025-05-07T19:46:17.0229581Z 2025-05-07T19:46:17.0507661Z cuda-nvvm-impl-12.6. | 7.7 MB | ########## | 100%  2025-05-07T19:46:17.0508026Z 2025-05-07T19:46:17.0508031Z 2025-05-07T19:46:17.0508035Z 2025-05-07T19:46:17.0508038Z 2025-05-07T19:46:17.0508042Z 2025-05-07T19:46:17.0508045Z 2025-05-07T19:46:17.0508049Z 2025-05-07T19:46:17.0508052Z 2025-05-07T19:46:17.0508080Z 2025-05-07T19:46:17.0508084Z 2025-05-07T19:46:17.0508087Z 2025-05-07T19:46:17.0508091Z 2025-05-07T19:46:17.0508094Z 2025-05-07T19:46:17.0508098Z 2025-05-07T19:46:17.0508101Z 2025-05-07T19:46:17.0508105Z 2025-05-07T19:46:17.0508118Z 2025-05-07T19:46:17.0508121Z 2025-05-07T19:46:17.0516758Z cuda-cupti-dev-12.6. | 3.4 MB | | 0%  2025-05-07T19:46:17.0517138Z 2025-05-07T19:46:17.0517158Z 2025-05-07T19:46:17.0517162Z 2025-05-07T19:46:17.0517166Z 2025-05-07T19:46:17.0517169Z 2025-05-07T19:46:17.0517172Z 2025-05-07T19:46:17.0517176Z 2025-05-07T19:46:17.0517179Z 2025-05-07T19:46:17.0517182Z 2025-05-07T19:46:17.0517186Z 2025-05-07T19:46:17.0517189Z 2025-05-07T19:46:17.0517192Z 2025-05-07T19:46:17.0517196Z 2025-05-07T19:46:17.0517199Z 2025-05-07T19:46:17.0517203Z 2025-05-07T19:46:17.0517206Z 2025-05-07T19:46:17.0517209Z 2025-05-07T19:46:17.0517213Z 2025-05-07T19:46:17.0517216Z 2025-05-07T19:46:17.1359962Z ... (more hidden) ... 2025-05-07T19:46:17.1360884Z 2025-05-07T19:46:17.1360899Z 2025-05-07T19:46:17.1360910Z 2025-05-07T19:46:17.1360924Z 2025-05-07T19:46:17.1360934Z 2025-05-07T19:46:17.1360944Z 2025-05-07T19:46:17.1360955Z 2025-05-07T19:46:17.1360997Z 2025-05-07T19:46:17.1361008Z 2025-05-07T19:46:17.1361018Z 2025-05-07T19:46:17.1361028Z 2025-05-07T19:46:17.1361071Z 2025-05-07T19:46:17.1361471Z 2025-05-07T19:46:17.1361483Z 2025-05-07T19:46:17.1361493Z 2025-05-07T19:46:17.1361503Z 2025-05-07T19:46:17.1361513Z 2025-05-07T19:46:17.1361523Z 2025-05-07T19:46:17.1361533Z 2025-05-07T19:46:17.1504859Z ... (more hidden) ... 2025-05-07T19:46:17.1505218Z 2025-05-07T19:46:17.1505222Z 2025-05-07T19:46:17.1505226Z 2025-05-07T19:46:17.1505229Z 2025-05-07T19:46:17.1505234Z 2025-05-07T19:46:17.1505237Z 2025-05-07T19:46:17.1505241Z 2025-05-07T19:46:17.1505244Z 2025-05-07T19:46:17.1505247Z 2025-05-07T19:46:17.1505251Z 2025-05-07T19:46:17.1505254Z 2025-05-07T19:46:17.1505258Z 2025-05-07T19:46:17.1505261Z 2025-05-07T19:46:17.1505264Z 2025-05-07T19:46:17.1505268Z 2025-05-07T19:46:17.1505271Z 2025-05-07T19:46:17.1505275Z 2025-05-07T19:46:17.1505278Z 2025-05-07T19:46:17.2894765Z cuda-cupti-dev-12.6. | 3.4 MB | ########## | 100%  2025-05-07T19:46:17.2895385Z 2025-05-07T19:46:17.2895416Z 2025-05-07T19:46:17.2895420Z 2025-05-07T19:46:17.2895424Z 2025-05-07T19:46:17.2895427Z 2025-05-07T19:46:17.2895431Z 2025-05-07T19:46:17.2895456Z 2025-05-07T19:46:17.2895460Z 2025-05-07T19:46:17.3220363Z cuda-nvdisasm-12.6.7 | 47.6 MB | ########## | 100%  2025-05-07T19:46:17.3220707Z 2025-05-07T19:46:17.3220712Z 2025-05-07T19:46:17.3220716Z 2025-05-07T19:46:17.3220719Z 2025-05-07T19:46:17.3220723Z 2025-05-07T19:46:17.3220748Z 2025-05-07T19:46:17.3220752Z 2025-05-07T19:46:17.3220755Z 2025-05-07T19:46:17.3220759Z 2025-05-07T19:46:17.3220762Z 2025-05-07T19:46:17.4920909Z gds-tools-1.11.1.6 | 37.8 MB | ########## | 100%  2025-05-07T19:46:17.4921243Z 2025-05-07T19:46:17.4921248Z 2025-05-07T19:46:17.4921251Z 2025-05-07T19:46:17.4921255Z 2025-05-07T19:46:17.4921259Z 2025-05-07T19:46:17.7293272Z cuda-nvvp-12.6.80 | 109.3 MB | ########## | 100%  2025-05-07T19:46:17.7293919Z 2025-05-07T19:46:17.7293973Z 2025-05-07T19:46:17.7293999Z 2025-05-07T19:46:17.7294030Z 2025-05-07T19:46:17.7294033Z 2025-05-07T19:46:17.7294061Z 2025-05-07T19:46:17.7294064Z 2025-05-07T19:46:17.8271142Z libnpp-12.3.1.54 | 93.4 MB | ########## | 100%  2025-05-07T19:46:17.8272565Z 2025-05-07T19:46:17.8272579Z 2025-05-07T19:46:17.8272619Z 2025-05-07T19:46:17.8272630Z 2025-05-07T19:46:17.8272641Z 2025-05-07T19:46:17.8272652Z 2025-05-07T19:46:17.8272662Z 2025-05-07T19:46:17.8272672Z 2025-05-07T19:46:17.8272682Z 2025-05-07T19:46:18.0546857Z libcurand-10.3.7.77 | 39.9 MB | ########## | 100%  2025-05-07T19:46:18.0548094Z 2025-05-07T19:46:18.0548109Z 2025-05-07T19:46:18.0548120Z 2025-05-07T19:46:18.0548131Z 2025-05-07T19:46:18.0548141Z 2025-05-07T19:46:18.0548151Z 2025-05-07T19:46:18.0548162Z 2025-05-07T19:46:18.0548172Z 2025-05-07T19:46:18.0548182Z 2025-05-07T19:46:18.0548193Z 2025-05-07T19:46:18.0548203Z 2025-05-07T19:46:18.0548213Z 2025-05-07T19:46:18.0548258Z 2025-05-07T19:46:18.0548289Z 2025-05-07T19:46:18.1296342Z cuda-nvcc-dev_linux- | 10.8 MB | ########## | 100%  2025-05-07T19:46:18.1297198Z 2025-05-07T19:46:18.1297225Z 2025-05-07T19:46:18.1297229Z 2025-05-07T19:46:18.1297233Z 2025-05-07T19:46:18.1297236Z 2025-05-07T19:46:18.1297240Z 2025-05-07T19:46:18.1297243Z 2025-05-07T19:46:18.1297247Z 2025-05-07T19:46:18.1297251Z 2025-05-07T19:46:18.1297254Z 2025-05-07T19:46:18.1297258Z 2025-05-07T19:46:18.3082991Z cuda-nvcc-tools-12.6 | 23.0 MB | ########## | 100%  2025-05-07T19:46:18.3083486Z 2025-05-07T19:46:18.3083491Z 2025-05-07T19:46:18.3083495Z 2025-05-07T19:46:18.3083499Z 2025-05-07T19:46:18.3083503Z 2025-05-07T19:46:18.3083507Z 2025-05-07T19:46:18.3083510Z 2025-05-07T19:46:18.3083514Z 2025-05-07T19:46:18.3083518Z 2025-05-07T19:46:18.3083521Z 2025-05-07T19:46:18.3083525Z 2025-05-07T19:46:18.3083529Z 2025-05-07T19:46:18.3839820Z cuda-nvrtc-12.6.85 | 17.3 MB | ########## | 100%  2025-05-07T19:46:18.3840467Z 2025-05-07T19:46:18.3840472Z 2025-05-07T19:46:18.3840475Z 2025-05-07T19:46:18.3840479Z 2025-05-07T19:46:18.3840483Z 2025-05-07T19:46:18.3840486Z 2025-05-07T19:46:18.3840490Z 2025-05-07T19:46:18.3840493Z 2025-05-07T19:46:18.3840524Z 2025-05-07T19:46:18.3840528Z 2025-05-07T19:46:18.3840531Z 2025-05-07T19:46:18.3840535Z 2025-05-07T19:46:18.3840538Z 2025-05-07T19:46:18.5168887Z libnvjitlink-12.6.85 | 14.9 MB | ########## | 100%  2025-05-07T19:46:18.5169269Z 2025-05-07T19:46:18.5169273Z 2025-05-07T19:46:18.5169277Z 2025-05-07T19:46:18.5169348Z 2025-05-07T19:46:18.5169352Z 2025-05-07T19:46:18.5169456Z 2025-05-07T19:46:18.5169465Z 2025-05-07T19:46:18.5169472Z 2025-05-07T19:46:18.5169476Z 2025-05-07T19:46:18.5169481Z 2025-05-07T19:46:18.5169486Z 2025-05-07T19:46:18.5169491Z 2025-05-07T19:46:18.5169496Z 2025-05-07T19:46:18.5169501Z 2025-05-07T19:46:18.5169733Z 2025-05-07T19:46:18.5169754Z 2025-05-07T19:46:18.5711704Z cuda-sanitizer-api-1 | 8.9 MB | ########## | 100%  2025-05-07T19:46:18.5712102Z 2025-05-07T19:46:18.5712292Z 2025-05-07T19:46:18.5712301Z 2025-05-07T19:46:18.5712308Z 2025-05-07T19:46:18.5712313Z 2025-05-07T19:46:18.5712318Z 2025-05-07T19:46:18.5712322Z 2025-05-07T19:46:18.5712343Z 2025-05-07T19:46:18.5712348Z 2025-05-07T19:46:18.5712353Z 2025-05-07T19:46:18.5712358Z 2025-05-07T19:46:18.5712363Z 2025-05-07T19:46:18.5712367Z 2025-05-07T19:46:18.5712372Z 2025-05-07T19:46:18.5712382Z 2025-05-07T19:46:18.6294666Z cuda-nvvm-tools-12.6 | 10.4 MB | ########## | 100%  2025-05-07T19:46:18.6295029Z 2025-05-07T19:46:18.6295137Z 2025-05-07T19:46:18.6295142Z 2025-05-07T19:46:18.6295230Z 2025-05-07T19:46:18.6295238Z 2025-05-07T19:46:18.6295246Z 2025-05-07T19:46:18.6295250Z 2025-05-07T19:46:18.6295321Z 2025-05-07T19:46:18.6295347Z 2025-05-07T19:46:18.6295364Z 2025-05-07T19:46:18.6295369Z 2025-05-07T19:46:18.6295373Z 2025-05-07T19:46:18.6295378Z 2025-05-07T19:46:18.6295383Z 2025-05-07T19:46:18.6295388Z 2025-05-07T19:46:18.6295407Z 2025-05-07T19:46:18.6295411Z 2025-05-07T19:46:18.6295416Z 2025-05-07T19:46:18.6295420Z 2025-05-07T19:46:18.6295897Z ... (more hidden) ... 2025-05-07T19:46:18.6296240Z 2025-05-07T19:46:18.6296258Z 2025-05-07T19:46:18.6296262Z 2025-05-07T19:46:18.6296265Z 2025-05-07T19:46:18.6296268Z 2025-05-07T19:46:18.6296272Z 2025-05-07T19:46:18.6296275Z 2025-05-07T19:46:18.6296279Z 2025-05-07T19:46:18.6296282Z 2025-05-07T19:46:18.6296286Z 2025-05-07T19:46:18.6296289Z 2025-05-07T19:46:18.6296293Z 2025-05-07T19:46:18.6296297Z 2025-05-07T19:46:18.6296300Z 2025-05-07T19:46:18.6296304Z 2025-05-07T19:46:18.6296307Z 2025-05-07T19:46:18.6296310Z 2025-05-07T19:46:18.6296333Z 2025-05-07T19:46:18.6296336Z 2025-05-07T19:46:18.6472930Z ... (more hidden) ... 2025-05-07T19:46:18.6473262Z 2025-05-07T19:46:18.6473268Z 2025-05-07T19:46:18.6473271Z 2025-05-07T19:46:18.6473275Z 2025-05-07T19:46:18.6473278Z 2025-05-07T19:46:18.6473282Z 2025-05-07T19:46:18.6473285Z 2025-05-07T19:46:18.6473310Z 2025-05-07T19:46:18.6473313Z 2025-05-07T19:46:18.6473317Z 2025-05-07T19:46:18.6473320Z 2025-05-07T19:46:18.6473335Z 2025-05-07T19:46:18.6473338Z 2025-05-07T19:46:18.6473342Z 2025-05-07T19:46:18.6473345Z 2025-05-07T19:46:18.6473349Z 2025-05-07T19:46:18.6473352Z 2025-05-07T19:46:18.8104733Z cuda-nvvm-impl-12.6. | 7.7 MB | ########## | 100%  2025-05-07T19:46:18.8105140Z 2025-05-07T19:46:18.8105145Z 2025-05-07T19:46:18.8105148Z 2025-05-07T19:46:18.8105152Z 2025-05-07T19:46:18.8105157Z 2025-05-07T19:46:18.8105161Z 2025-05-07T19:46:18.8105165Z 2025-05-07T19:46:18.8105169Z 2025-05-07T19:46:18.8105173Z 2025-05-07T19:46:18.8105177Z 2025-05-07T19:46:18.8105199Z 2025-05-07T19:46:18.8105461Z 2025-05-07T19:46:18.8105465Z 2025-05-07T19:46:18.8105468Z 2025-05-07T19:46:18.8105471Z 2025-05-07T19:46:18.8105475Z 2025-05-07T19:46:18.8105478Z 2025-05-07T19:46:18.8105482Z 2025-05-07T19:46:18.8106084Z cuda-cupti-dev-12.6. | 3.4 MB | ########## | 100%  2025-05-07T19:46:18.8106725Z 2025-05-07T19:46:18.8106733Z 2025-05-07T19:46:18.8106760Z 2025-05-07T19:46:18.8106765Z 2025-05-07T19:46:18.8106772Z 2025-05-07T19:46:18.8106780Z 2025-05-07T19:46:18.8106787Z 2025-05-07T19:46:18.8106792Z 2025-05-07T19:46:18.8106799Z 2025-05-07T19:46:18.8106807Z 2025-05-07T19:46:18.8106812Z 2025-05-07T19:46:18.8106848Z 2025-05-07T19:46:18.8106852Z 2025-05-07T19:46:18.8106857Z 2025-05-07T19:46:18.8106863Z 2025-05-07T19:46:18.8106869Z 2025-05-07T19:46:18.8106877Z 2025-05-07T19:46:18.8106883Z 2025-05-07T19:46:19.0700474Z cuda-cupti-dev-12.6. | 3.4 MB | ########## | 100%  2025-05-07T19:46:19.6260321Z nsight-compute-2024. | 443.1 MB | ########## | 100% 2025-05-07T19:46:19.6260858Z 2025-05-07T19:46:19.6260867Z 2025-05-07T19:46:19.6260907Z 2025-05-07T19:46:19.6260912Z 2025-05-07T19:46:19.6260917Z 2025-05-07T19:46:19.6260922Z 2025-05-07T19:46:19.6260928Z 2025-05-07T19:46:19.6260933Z 2025-05-07T19:46:19.6260938Z 2025-05-07T19:46:19.6260943Z 2025-05-07T19:46:20.7607783Z gds-tools-1.11.1.6 | 37.8 MB | ########## | 100%  2025-05-07T19:46:20.7608146Z 2025-05-07T19:46:23.3522516Z libcublas-12.6.4.1 | 256.2 MB | ########## | 100%  2025-05-07T19:46:23.3533758Z nsight-compute-2024. | 443.1 MB | ########## | 100% 2025-05-07T19:46:23.3534083Z 2025-05-07T19:46:23.3534091Z 2025-05-07T19:46:23.3534097Z 2025-05-07T19:46:23.3534124Z 2025-05-07T19:46:23.3534130Z 2025-05-07T19:46:23.3534137Z 2025-05-07T19:46:23.3534666Z 2025-05-07T19:46:23.3534713Z 2025-05-07T19:46:23.3534739Z 2025-05-07T19:46:23.3534757Z 2025-05-07T19:46:23.3534840Z 2025-05-07T19:46:23.3534905Z 2025-05-07T19:46:23.3534923Z 2025-05-07T19:46:23.3534943Z 2025-05-07T19:46:23.3534964Z 2025-05-07T19:46:23.3534985Z 2025-05-07T19:46:23.3535002Z 2025-05-07T19:46:23.3535019Z 2025-05-07T19:46:23.3535036Z 2025-05-07T19:46:23.3536050Z 2025-05-07T19:46:23.3537903Z  2025-05-07T19:46:23.3538555Z 2025-05-07T19:46:23.3538944Z 2025-05-07T19:46:23.3539254Z  2025-05-07T19:46:23.3539674Z 2025-05-07T19:46:23.3539679Z 2025-05-07T19:46:23.3539941Z  2025-05-07T19:46:23.3540318Z 2025-05-07T19:46:23.3540324Z 2025-05-07T19:46:23.3540329Z 2025-05-07T19:46:23.3540672Z  2025-05-07T19:46:23.3541004Z 2025-05-07T19:46:23.3541010Z 2025-05-07T19:46:23.3541019Z 2025-05-07T19:46:23.3541041Z 2025-05-07T19:46:23.3541354Z  2025-05-07T19:46:23.3541769Z 2025-05-07T19:46:23.3541775Z 2025-05-07T19:46:23.3541782Z 2025-05-07T19:46:23.3541789Z 2025-05-07T19:46:23.3541793Z 2025-05-07T19:46:23.3542086Z  2025-05-07T19:46:23.3542464Z 2025-05-07T19:46:23.3542473Z 2025-05-07T19:46:23.3542683Z 2025-05-07T19:46:23.3542689Z 2025-05-07T19:46:23.3542696Z 2025-05-07T19:46:23.3542700Z 2025-05-07T19:46:23.3542989Z  2025-05-07T19:46:23.3543357Z 2025-05-07T19:46:23.3543364Z 2025-05-07T19:46:23.3543372Z 2025-05-07T19:46:23.3543424Z 2025-05-07T19:46:23.3543429Z 2025-05-07T19:46:23.3543436Z 2025-05-07T19:46:23.3543444Z 2025-05-07T19:46:23.3543791Z  2025-05-07T19:46:23.3544208Z 2025-05-07T19:46:23.3544215Z 2025-05-07T19:46:23.3544234Z 2025-05-07T19:46:23.3544552Z 2025-05-07T19:46:23.3544557Z 2025-05-07T19:46:23.3544563Z 2025-05-07T19:46:23.3544604Z 2025-05-07T19:46:23.3544610Z 2025-05-07T19:46:23.3544974Z  2025-05-07T19:46:23.3545418Z 2025-05-07T19:46:23.3545424Z 2025-05-07T19:46:23.3545428Z 2025-05-07T19:46:23.3545433Z 2025-05-07T19:46:23.3545438Z 2025-05-07T19:46:23.3545445Z 2025-05-07T19:46:23.3545450Z 2025-05-07T19:46:23.3545454Z 2025-05-07T19:46:23.3545492Z 2025-05-07T19:46:23.3545827Z  2025-05-07T19:46:23.3546253Z 2025-05-07T19:46:23.3546261Z 2025-05-07T19:46:23.3546265Z 2025-05-07T19:46:23.3546270Z 2025-05-07T19:46:23.3546277Z 2025-05-07T19:46:23.3546285Z 2025-05-07T19:46:23.3546289Z 2025-05-07T19:46:23.3546294Z 2025-05-07T19:46:23.3546301Z 2025-05-07T19:46:23.3546350Z 2025-05-07T19:46:23.3548239Z  2025-05-07T19:46:23.3548705Z 2025-05-07T19:46:23.3548712Z 2025-05-07T19:46:23.3548719Z 2025-05-07T19:46:23.3548727Z 2025-05-07T19:46:23.3548732Z 2025-05-07T19:46:23.3548740Z 2025-05-07T19:46:23.3548747Z 2025-05-07T19:46:23.3548753Z 2025-05-07T19:46:23.3548759Z 2025-05-07T19:46:23.3548807Z 2025-05-07T19:46:23.3548814Z 2025-05-07T19:46:23.3549206Z  2025-05-07T19:46:23.3549645Z 2025-05-07T19:46:23.3549653Z 2025-05-07T19:46:23.3549660Z 2025-05-07T19:46:23.3549666Z 2025-05-07T19:46:23.3549673Z 2025-05-07T19:46:23.3549681Z 2025-05-07T19:46:23.3549686Z 2025-05-07T19:46:23.3549693Z 2025-05-07T19:46:23.3549739Z 2025-05-07T19:46:23.3549744Z 2025-05-07T19:46:23.3549750Z 2025-05-07T19:46:23.3549754Z 2025-05-07T19:46:23.3550145Z  2025-05-07T19:46:23.3550573Z 2025-05-07T19:46:23.3550583Z 2025-05-07T19:46:23.3550592Z 2025-05-07T19:46:23.3550599Z 2025-05-07T19:46:23.3550605Z 2025-05-07T19:46:23.3550612Z 2025-05-07T19:46:23.3550620Z 2025-05-07T19:46:23.3550625Z 2025-05-07T19:46:23.3550632Z 2025-05-07T19:46:23.3550639Z 2025-05-07T19:46:23.3550644Z 2025-05-07T19:46:23.3550651Z 2025-05-07T19:46:23.3550656Z 2025-05-07T19:46:23.3551006Z  2025-05-07T19:46:23.3551338Z 2025-05-07T19:46:23.3551342Z 2025-05-07T19:46:23.3551350Z 2025-05-07T19:46:23.3551355Z 2025-05-07T19:46:23.3551360Z 2025-05-07T19:46:23.3551364Z 2025-05-07T19:46:23.3551368Z 2025-05-07T19:46:23.3551373Z 2025-05-07T19:46:23.3551377Z 2025-05-07T19:46:23.3551382Z 2025-05-07T19:46:23.3551387Z 2025-05-07T19:46:23.3551394Z 2025-05-07T19:46:23.3551422Z 2025-05-07T19:46:23.3551428Z 2025-05-07T19:46:23.3551736Z  2025-05-07T19:46:23.3552119Z 2025-05-07T19:46:23.3552128Z 2025-05-07T19:46:23.3552132Z 2025-05-07T19:46:23.3552137Z 2025-05-07T19:46:23.3552141Z 2025-05-07T19:46:23.3552146Z 2025-05-07T19:46:23.3552151Z 2025-05-07T19:46:23.3552156Z 2025-05-07T19:46:23.3552182Z 2025-05-07T19:46:23.3552186Z 2025-05-07T19:46:23.3552190Z 2025-05-07T19:46:23.3552195Z 2025-05-07T19:46:23.3552199Z 2025-05-07T19:46:23.3552203Z 2025-05-07T19:46:23.3552208Z 2025-05-07T19:46:23.3552541Z  2025-05-07T19:46:23.3552957Z 2025-05-07T19:46:23.3552962Z 2025-05-07T19:46:23.3552969Z 2025-05-07T19:46:23.3552994Z 2025-05-07T19:46:23.3553001Z 2025-05-07T19:46:23.3553009Z 2025-05-07T19:46:23.3553014Z 2025-05-07T19:46:23.3553021Z 2025-05-07T19:46:23.3553028Z 2025-05-07T19:46:23.3553033Z 2025-05-07T19:46:23.3553040Z 2025-05-07T19:46:23.3553045Z 2025-05-07T19:46:23.3553052Z 2025-05-07T19:46:23.3553058Z 2025-05-07T19:46:23.3553063Z 2025-05-07T19:46:23.3553072Z 2025-05-07T19:46:23.3553482Z  2025-05-07T19:46:23.3553899Z 2025-05-07T19:46:23.3553906Z 2025-05-07T19:46:23.3553911Z 2025-05-07T19:46:23.3553915Z 2025-05-07T19:46:23.3553920Z 2025-05-07T19:46:23.3553925Z 2025-05-07T19:46:23.3553931Z 2025-05-07T19:46:23.3553936Z 2025-05-07T19:46:23.3553941Z 2025-05-07T19:46:23.3553946Z 2025-05-07T19:46:23.3553950Z 2025-05-07T19:46:23.3553954Z 2025-05-07T19:46:23.3553959Z 2025-05-07T19:46:23.3553964Z 2025-05-07T19:46:23.3553969Z 2025-05-07T19:46:23.3553976Z 2025-05-07T19:46:23.3553984Z 2025-05-07T19:46:23.3554344Z  2025-05-07T19:46:23.3554690Z 2025-05-07T19:46:23.3554697Z 2025-05-07T19:46:23.3554705Z 2025-05-07T19:46:23.3554710Z 2025-05-07T19:46:23.3554717Z 2025-05-07T19:46:23.3554724Z 2025-05-07T19:46:23.3554732Z 2025-05-07T19:46:23.3554825Z 2025-05-07T19:46:23.3554835Z 2025-05-07T19:46:23.3554840Z 2025-05-07T19:46:23.3554845Z 2025-05-07T19:46:23.3554849Z 2025-05-07T19:46:23.3554853Z 2025-05-07T19:46:23.3554886Z 2025-05-07T19:46:23.3554890Z 2025-05-07T19:46:23.3554895Z 2025-05-07T19:46:23.3554899Z 2025-05-07T19:46:23.3554904Z 2025-05-07T19:46:23.3555244Z  2025-05-07T19:46:23.3555599Z 2025-05-07T19:46:23.3555604Z 2025-05-07T19:46:23.3555758Z  2025-05-07T19:46:23.3555896Z 2025-05-07T19:46:23.3555901Z 2025-05-07T19:46:23.3556049Z  2025-05-07T19:46:23.3556228Z 2025-05-07T19:46:23.3556232Z 2025-05-07T19:46:23.3556236Z 2025-05-07T19:46:23.3556392Z  2025-05-07T19:46:23.3556549Z 2025-05-07T19:46:23.3556555Z 2025-05-07T19:46:23.3556562Z 2025-05-07T19:46:23.3556567Z 2025-05-07T19:46:23.3556742Z  2025-05-07T19:46:23.3556910Z 2025-05-07T19:46:23.3556915Z 2025-05-07T19:46:23.3556922Z 2025-05-07T19:46:23.3556934Z 2025-05-07T19:46:23.3556946Z 2025-05-07T19:46:23.3557092Z  2025-05-07T19:46:23.3557285Z 2025-05-07T19:46:23.3557293Z 2025-05-07T19:46:23.3557299Z 2025-05-07T19:46:23.3557305Z 2025-05-07T19:46:23.3557311Z 2025-05-07T19:46:23.3557318Z 2025-05-07T19:46:23.3557501Z  2025-05-07T19:46:23.3557706Z 2025-05-07T19:46:23.3557734Z 2025-05-07T19:46:23.3557739Z 2025-05-07T19:46:23.3557746Z 2025-05-07T19:46:23.3557754Z 2025-05-07T19:46:23.3557760Z 2025-05-07T19:46:23.3557767Z 2025-05-07T19:46:23.3557980Z  2025-05-07T19:46:23.3558249Z 2025-05-07T19:46:23.3558254Z 2025-05-07T19:46:23.3558261Z 2025-05-07T19:46:23.3558266Z 2025-05-07T19:46:23.3558270Z 2025-05-07T19:46:23.3558292Z 2025-05-07T19:46:23.3558298Z 2025-05-07T19:46:23.3558302Z 2025-05-07T19:46:23.3558455Z  2025-05-07T19:46:23.3558608Z 2025-05-07T19:46:23.3558612Z 2025-05-07T19:46:23.3558615Z 2025-05-07T19:46:23.3558619Z 2025-05-07T19:46:23.3558626Z 2025-05-07T19:46:23.3558632Z 2025-05-07T19:46:23.3558636Z 2025-05-07T19:46:23.3558640Z 2025-05-07T19:46:23.3558657Z 2025-05-07T19:46:23.3558778Z  2025-05-07T19:46:23.3558936Z 2025-05-07T19:46:23.3558939Z 2025-05-07T19:46:23.3558943Z 2025-05-07T19:46:23.3558946Z 2025-05-07T19:46:23.3558950Z 2025-05-07T19:46:23.3558953Z 2025-05-07T19:46:23.3558957Z 2025-05-07T19:46:23.3558961Z 2025-05-07T19:46:23.3558964Z 2025-05-07T19:46:23.3558968Z 2025-05-07T19:46:23.3559116Z  2025-05-07T19:46:23.3559283Z 2025-05-07T19:46:23.3559287Z 2025-05-07T19:46:23.3559290Z 2025-05-07T19:46:23.3559293Z 2025-05-07T19:46:23.3559297Z 2025-05-07T19:46:23.3559300Z 2025-05-07T19:46:23.3559304Z 2025-05-07T19:46:23.3559307Z 2025-05-07T19:46:23.3559311Z 2025-05-07T19:46:23.3559314Z 2025-05-07T19:46:23.3559318Z 2025-05-07T19:46:23.3559465Z  2025-05-07T19:46:23.3559644Z 2025-05-07T19:46:23.3559647Z 2025-05-07T19:46:23.3559655Z 2025-05-07T19:46:23.3559722Z 2025-05-07T19:46:23.3559726Z 2025-05-07T19:46:23.3559729Z 2025-05-07T19:46:23.3559733Z 2025-05-07T19:46:23.3559736Z 2025-05-07T19:46:23.3559740Z 2025-05-07T19:46:23.3559743Z 2025-05-07T19:46:23.3559747Z 2025-05-07T19:46:23.3559767Z 2025-05-07T19:46:23.3559899Z  2025-05-07T19:46:23.3560085Z 2025-05-07T19:46:23.3560088Z 2025-05-07T19:46:23.3560092Z 2025-05-07T19:46:23.3560095Z 2025-05-07T19:46:23.3560099Z 2025-05-07T19:46:23.3560102Z 2025-05-07T19:46:23.3560106Z 2025-05-07T19:46:23.3560109Z 2025-05-07T19:46:23.3560113Z 2025-05-07T19:46:23.3560116Z 2025-05-07T19:46:23.3560132Z 2025-05-07T19:46:23.3560136Z 2025-05-07T19:46:23.3560139Z 2025-05-07T19:46:23.3560279Z  2025-05-07T19:46:23.3560470Z 2025-05-07T19:46:23.3560474Z 2025-05-07T19:46:23.3560478Z 2025-05-07T19:46:23.3560482Z 2025-05-07T19:46:23.3560485Z 2025-05-07T19:46:23.3560489Z 2025-05-07T19:46:23.3560493Z 2025-05-07T19:46:23.3560570Z 2025-05-07T19:46:23.3560578Z 2025-05-07T19:46:23.3560581Z 2025-05-07T19:46:23.3560585Z 2025-05-07T19:46:23.3560588Z 2025-05-07T19:46:23.3560592Z 2025-05-07T19:46:23.3560595Z 2025-05-07T19:46:23.3560738Z  2025-05-07T19:46:23.3560937Z 2025-05-07T19:46:23.3560946Z 2025-05-07T19:46:23.3560951Z 2025-05-07T19:46:23.3560955Z 2025-05-07T19:46:23.3560974Z 2025-05-07T19:46:23.3560980Z 2025-05-07T19:46:23.3560985Z 2025-05-07T19:46:23.3560991Z 2025-05-07T19:46:23.3561000Z 2025-05-07T19:46:23.3561005Z 2025-05-07T19:46:23.3561009Z 2025-05-07T19:46:23.3561016Z 2025-05-07T19:46:23.3561022Z 2025-05-07T19:46:23.3561028Z 2025-05-07T19:46:23.3561032Z 2025-05-07T19:46:23.3561255Z  2025-05-07T19:46:23.3561674Z 2025-05-07T19:46:23.3561681Z 2025-05-07T19:46:23.3561689Z 2025-05-07T19:46:23.3561694Z 2025-05-07T19:46:23.3561701Z 2025-05-07T19:46:23.3561709Z 2025-05-07T19:46:23.3561717Z 2025-05-07T19:46:23.3561728Z 2025-05-07T19:46:23.3561739Z 2025-05-07T19:46:23.3561747Z 2025-05-07T19:46:23.3561752Z 2025-05-07T19:46:23.3561759Z 2025-05-07T19:46:23.3561767Z 2025-05-07T19:46:23.3561773Z 2025-05-07T19:46:23.3561780Z 2025-05-07T19:46:23.3561787Z 2025-05-07T19:46:23.3561997Z  2025-05-07T19:46:23.3562226Z 2025-05-07T19:46:23.3562230Z 2025-05-07T19:46:23.3562233Z 2025-05-07T19:46:23.3562237Z 2025-05-07T19:46:23.3562240Z 2025-05-07T19:46:23.3562243Z 2025-05-07T19:46:23.3562247Z 2025-05-07T19:46:23.3562250Z 2025-05-07T19:46:23.3562253Z 2025-05-07T19:46:23.3562418Z 2025-05-07T19:46:23.3562421Z 2025-05-07T19:46:23.3562425Z 2025-05-07T19:46:23.3562428Z 2025-05-07T19:46:23.3562431Z 2025-05-07T19:46:23.3562435Z 2025-05-07T19:46:23.3562438Z 2025-05-07T19:46:23.3562442Z 2025-05-07T19:46:23.3562614Z  2025-05-07T19:46:23.3562827Z 2025-05-07T19:46:23.3562831Z 2025-05-07T19:46:23.3562834Z 2025-05-07T19:46:23.3562841Z 2025-05-07T19:46:23.3562848Z 2025-05-07T19:46:23.3562852Z 2025-05-07T19:46:23.3562855Z 2025-05-07T19:46:23.3562859Z 2025-05-07T19:46:23.3562862Z 2025-05-07T19:46:23.3562865Z 2025-05-07T19:46:23.3562885Z 2025-05-07T19:46:23.3562888Z 2025-05-07T19:46:23.3562891Z 2025-05-07T19:46:23.3562895Z 2025-05-07T19:46:23.3562898Z 2025-05-07T19:46:23.3562902Z 2025-05-07T19:46:23.3562905Z 2025-05-07T19:46:23.3562909Z 2025-05-07T19:46:23.3563073Z  2025-05-07T19:46:23.3563292Z 2025-05-07T19:46:23.3563296Z 2025-05-07T19:46:23.3563402Z  2025-05-07T19:46:23.3563506Z 2025-05-07T19:46:23.3563510Z 2025-05-07T19:46:23.3563605Z  2025-05-07T19:46:23.3563728Z 2025-05-07T19:46:23.3563732Z 2025-05-07T19:46:23.3563735Z 2025-05-07T19:46:23.3563835Z  2025-05-07T19:46:23.3563941Z 2025-05-07T19:46:23.3563945Z 2025-05-07T19:46:23.3563949Z 2025-05-07T19:46:23.3563952Z 2025-05-07T19:46:23.3564071Z  2025-05-07T19:46:23.3564190Z 2025-05-07T19:46:23.3564276Z 2025-05-07T19:46:23.3564279Z 2025-05-07T19:46:23.3564283Z 2025-05-07T19:46:23.3564286Z 2025-05-07T19:46:23.3564394Z  2025-05-07T19:46:23.3564535Z 2025-05-07T19:46:23.3564539Z 2025-05-07T19:46:23.3564542Z 2025-05-07T19:46:23.3564546Z 2025-05-07T19:46:23.3564549Z 2025-05-07T19:46:23.3564552Z 2025-05-07T19:46:23.3564662Z  2025-05-07T19:46:23.3564806Z 2025-05-07T19:46:23.3564810Z 2025-05-07T19:46:23.3564813Z 2025-05-07T19:46:23.3564816Z 2025-05-07T19:46:23.3564820Z 2025-05-07T19:46:23.3564823Z 2025-05-07T19:46:23.3564827Z 2025-05-07T19:46:23.3564938Z  2025-05-07T19:46:23.3565080Z 2025-05-07T19:46:23.3565083Z 2025-05-07T19:46:23.3565087Z 2025-05-07T19:46:23.3565090Z 2025-05-07T19:46:23.3565109Z 2025-05-07T19:46:23.3565113Z 2025-05-07T19:46:23.3565116Z 2025-05-07T19:46:23.3565119Z 2025-05-07T19:46:23.3565236Z  2025-05-07T19:46:23.3565385Z 2025-05-07T19:46:23.3565389Z 2025-05-07T19:46:23.3565445Z 2025-05-07T19:46:23.3565453Z 2025-05-07T19:46:23.3565456Z 2025-05-07T19:46:23.3565459Z 2025-05-07T19:46:23.3565463Z 2025-05-07T19:46:23.3565480Z 2025-05-07T19:46:23.3565483Z 2025-05-07T19:46:23.3565604Z  2025-05-07T19:46:23.3565762Z 2025-05-07T19:46:23.3565766Z 2025-05-07T19:46:23.3565770Z 2025-05-07T19:46:23.3565773Z 2025-05-07T19:46:23.3565776Z 2025-05-07T19:46:23.3565780Z 2025-05-07T19:46:23.3565783Z 2025-05-07T19:46:23.3565786Z 2025-05-07T19:46:23.3565790Z 2025-05-07T19:46:23.3565808Z 2025-05-07T19:46:23.3565931Z  2025-05-07T19:46:23.3566095Z 2025-05-07T19:46:23.3566099Z 2025-05-07T19:46:23.3566102Z 2025-05-07T19:46:23.3566105Z 2025-05-07T19:46:23.3566109Z 2025-05-07T19:46:23.3566112Z 2025-05-07T19:46:23.3566115Z 2025-05-07T19:46:23.3566119Z 2025-05-07T19:46:23.3566122Z 2025-05-07T19:46:23.3566125Z 2025-05-07T19:46:23.3566143Z 2025-05-07T19:46:23.3566271Z  2025-05-07T19:46:23.3566450Z 2025-05-07T19:46:23.3566457Z 2025-05-07T19:46:23.3566460Z 2025-05-07T19:46:23.3566463Z 2025-05-07T19:46:23.3566467Z 2025-05-07T19:46:23.3566470Z 2025-05-07T19:46:23.3566473Z 2025-05-07T19:46:23.3566477Z 2025-05-07T19:46:23.3566480Z 2025-05-07T19:46:23.3566484Z 2025-05-07T19:46:23.3566500Z 2025-05-07T19:46:23.3566504Z 2025-05-07T19:46:23.3566637Z  2025-05-07T19:46:23.3566820Z 2025-05-07T19:46:23.3566823Z 2025-05-07T19:46:23.3566827Z 2025-05-07T19:46:23.3566830Z 2025-05-07T19:46:23.3566833Z 2025-05-07T19:46:23.3566837Z 2025-05-07T19:46:23.3566840Z 2025-05-07T19:46:23.3566843Z 2025-05-07T19:46:23.3566847Z 2025-05-07T19:46:23.3566864Z 2025-05-07T19:46:23.3566868Z 2025-05-07T19:46:23.3566871Z 2025-05-07T19:46:23.3566874Z 2025-05-07T19:46:23.3567006Z  2025-05-07T19:46:23.3567199Z 2025-05-07T19:46:23.3567203Z 2025-05-07T19:46:23.3567206Z 2025-05-07T19:46:23.3567210Z 2025-05-07T19:46:23.3567213Z 2025-05-07T19:46:23.3567220Z 2025-05-07T19:46:23.3567242Z 2025-05-07T19:46:23.3567245Z 2025-05-07T19:46:23.3567248Z 2025-05-07T19:46:23.3567252Z 2025-05-07T19:46:23.3567255Z 2025-05-07T19:46:23.3567259Z 2025-05-07T19:46:23.3567262Z 2025-05-07T19:46:23.3567265Z 2025-05-07T19:46:23.3567403Z  2025-05-07T19:46:23.3567601Z 2025-05-07T19:46:23.3567605Z 2025-05-07T19:46:23.3567609Z 2025-05-07T19:46:23.3567625Z 2025-05-07T19:46:23.3567629Z 2025-05-07T19:46:23.3567632Z 2025-05-07T19:46:23.3567636Z 2025-05-07T19:46:23.3567639Z 2025-05-07T19:46:23.3567642Z 2025-05-07T19:46:23.3567646Z 2025-05-07T19:46:23.3567649Z 2025-05-07T19:46:23.3567653Z 2025-05-07T19:46:23.3567656Z 2025-05-07T19:46:23.3567660Z 2025-05-07T19:46:23.3567663Z 2025-05-07T19:46:23.3567808Z  2025-05-07T19:46:23.3568025Z 2025-05-07T19:46:23.3568029Z 2025-05-07T19:46:23.3568032Z 2025-05-07T19:46:23.3568036Z 2025-05-07T19:46:23.3568039Z 2025-05-07T19:46:23.3568045Z 2025-05-07T19:46:23.3568103Z 2025-05-07T19:46:23.3568106Z 2025-05-07T19:46:23.3568110Z 2025-05-07T19:46:23.3568113Z 2025-05-07T19:46:23.3568117Z 2025-05-07T19:46:23.3568120Z 2025-05-07T19:46:23.3568124Z 2025-05-07T19:46:23.3568127Z 2025-05-07T19:46:23.3568130Z 2025-05-07T19:46:23.3568134Z 2025-05-07T19:46:23.3568285Z  2025-05-07T19:46:23.3568510Z 2025-05-07T19:46:23.3568514Z 2025-05-07T19:46:23.3568517Z 2025-05-07T19:46:23.3568521Z 2025-05-07T19:46:23.3568524Z 2025-05-07T19:46:23.3568527Z 2025-05-07T19:46:23.3568531Z 2025-05-07T19:46:23.3568534Z 2025-05-07T19:46:23.3568537Z 2025-05-07T19:46:23.3568541Z 2025-05-07T19:46:23.3568544Z 2025-05-07T19:46:23.3568547Z 2025-05-07T19:46:23.3568551Z 2025-05-07T19:46:23.3568554Z 2025-05-07T19:46:23.3568558Z 2025-05-07T19:46:23.3568561Z 2025-05-07T19:46:23.3568564Z 2025-05-07T19:46:23.3568734Z  2025-05-07T19:46:23.3568946Z 2025-05-07T19:46:23.3569002Z 2025-05-07T19:46:23.3569010Z 2025-05-07T19:46:23.3569013Z 2025-05-07T19:46:23.3569017Z 2025-05-07T19:46:23.3569020Z 2025-05-07T19:46:23.3569024Z 2025-05-07T19:46:23.3569027Z 2025-05-07T19:46:23.3569030Z 2025-05-07T19:46:23.3569034Z 2025-05-07T19:46:23.3569051Z 2025-05-07T19:46:23.3569055Z 2025-05-07T19:46:23.3569058Z 2025-05-07T19:46:23.3569062Z 2025-05-07T19:46:23.3569065Z 2025-05-07T19:46:23.3569069Z 2025-05-07T19:46:23.3569072Z 2025-05-07T19:46:23.3569075Z 2025-05-07T19:46:23.3569241Z  2025-05-07T19:46:23.3569460Z 2025-05-07T19:46:23.3569463Z 2025-05-07T19:46:23.3569572Z  2025-05-07T19:46:23.3569676Z 2025-05-07T19:46:23.3569679Z 2025-05-07T19:46:23.3569776Z  2025-05-07T19:46:23.3569898Z 2025-05-07T19:46:23.3569902Z 2025-05-07T19:46:23.3569905Z 2025-05-07T19:46:23.3570003Z  2025-05-07T19:46:23.3570343Z 2025-05-07T19:46:23.3570350Z 2025-05-07T19:46:23.3570355Z 2025-05-07T19:46:23.3570366Z 2025-05-07T19:46:23.3570517Z  2025-05-07T19:46:23.3570641Z 2025-05-07T19:46:23.3570645Z 2025-05-07T19:46:23.3570648Z 2025-05-07T19:46:23.3570651Z 2025-05-07T19:46:23.3570655Z 2025-05-07T19:46:23.3570759Z  2025-05-07T19:46:23.3570900Z 2025-05-07T19:46:23.3570903Z 2025-05-07T19:46:23.3570907Z 2025-05-07T19:46:23.3570910Z 2025-05-07T19:46:23.3570914Z 2025-05-07T19:46:23.3570917Z 2025-05-07T19:46:23.3571026Z  2025-05-07T19:46:23.3571171Z 2025-05-07T19:46:23.3571174Z 2025-05-07T19:46:23.3571178Z 2025-05-07T19:46:23.3571181Z 2025-05-07T19:46:23.3571185Z 2025-05-07T19:46:23.3571188Z 2025-05-07T19:46:23.3571192Z 2025-05-07T19:46:23.3571302Z  2025-05-07T19:46:23.3571445Z 2025-05-07T19:46:23.3571449Z 2025-05-07T19:46:23.3571452Z 2025-05-07T19:46:23.3571456Z 2025-05-07T19:46:23.3571476Z 2025-05-07T19:46:23.3571479Z 2025-05-07T19:46:23.3571483Z 2025-05-07T19:46:23.3571486Z 2025-05-07T19:46:23.3571605Z  2025-05-07T19:46:23.3571763Z 2025-05-07T19:46:23.3571770Z 2025-05-07T19:46:23.3571773Z 2025-05-07T19:46:23.3571777Z 2025-05-07T19:46:23.3571780Z 2025-05-07T19:46:23.3571784Z 2025-05-07T19:46:23.3571787Z 2025-05-07T19:46:23.3571804Z 2025-05-07T19:46:23.3571808Z 2025-05-07T19:46:23.3571927Z  2025-05-07T19:46:23.3572087Z 2025-05-07T19:46:23.3572090Z 2025-05-07T19:46:23.3572093Z 2025-05-07T19:46:23.3572097Z 2025-05-07T19:46:23.3572100Z 2025-05-07T19:46:23.3572104Z 2025-05-07T19:46:23.3572107Z 2025-05-07T19:46:23.3572111Z 2025-05-07T19:46:23.3572115Z 2025-05-07T19:46:23.3572132Z 2025-05-07T19:46:23.3572254Z  2025-05-07T19:46:23.3572419Z 2025-05-07T19:46:23.3572422Z 2025-05-07T19:46:23.3572426Z 2025-05-07T19:46:23.3572429Z 2025-05-07T19:46:23.3572432Z 2025-05-07T19:46:23.3572436Z 2025-05-07T19:46:23.3572439Z 2025-05-07T19:46:23.3572442Z 2025-05-07T19:46:23.3572446Z 2025-05-07T19:46:23.3572449Z 2025-05-07T19:46:23.3572467Z 2025-05-07T19:46:23.3572600Z  2025-05-07T19:46:23.3572904Z 2025-05-07T19:46:23.3572907Z 2025-05-07T19:46:23.3572910Z 2025-05-07T19:46:23.3572914Z 2025-05-07T19:46:23.3572917Z 2025-05-07T19:46:23.3572921Z 2025-05-07T19:46:23.3572924Z 2025-05-07T19:46:23.3572927Z 2025-05-07T19:46:23.3572931Z 2025-05-07T19:46:23.3572934Z 2025-05-07T19:46:23.3572954Z 2025-05-07T19:46:23.3572957Z 2025-05-07T19:46:23.3573091Z  2025-05-07T19:46:23.3573275Z 2025-05-07T19:46:23.3573278Z 2025-05-07T19:46:23.3573282Z 2025-05-07T19:46:23.3573285Z 2025-05-07T19:46:23.3573289Z 2025-05-07T19:46:23.3573292Z 2025-05-07T19:46:23.3573296Z 2025-05-07T19:46:23.3573299Z 2025-05-07T19:46:23.3573302Z 2025-05-07T19:46:23.3573321Z 2025-05-07T19:46:23.3573325Z 2025-05-07T19:46:23.3573328Z 2025-05-07T19:46:23.3573331Z 2025-05-07T19:46:23.3573468Z  2025-05-07T19:46:23.3573661Z 2025-05-07T19:46:23.3573664Z 2025-05-07T19:46:23.3573748Z 2025-05-07T19:46:23.3573756Z 2025-05-07T19:46:23.3573759Z 2025-05-07T19:46:23.3573763Z 2025-05-07T19:46:23.3573781Z 2025-05-07T19:46:23.3573784Z 2025-05-07T19:46:23.3573787Z 2025-05-07T19:46:23.3573791Z 2025-05-07T19:46:23.3573794Z 2025-05-07T19:46:23.3573797Z 2025-05-07T19:46:23.3573801Z 2025-05-07T19:46:23.3573804Z 2025-05-07T19:46:23.3573948Z  2025-05-07T19:46:23.3574145Z 2025-05-07T19:46:23.3574149Z 2025-05-07T19:46:23.3574152Z 2025-05-07T19:46:23.3574171Z 2025-05-07T19:46:23.3574174Z 2025-05-07T19:46:23.3574178Z 2025-05-07T19:46:23.3574181Z 2025-05-07T19:46:23.3574184Z 2025-05-07T19:46:23.3574188Z 2025-05-07T19:46:23.3574191Z 2025-05-07T19:46:23.3574194Z 2025-05-07T19:46:23.3574198Z 2025-05-07T19:46:23.3574201Z 2025-05-07T19:46:23.3574205Z 2025-05-07T19:46:23.3574208Z 2025-05-07T19:46:23.3574352Z  2025-05-07T19:46:23.3574570Z 2025-05-07T19:46:23.3574573Z 2025-05-07T19:46:23.3574577Z 2025-05-07T19:46:23.3574584Z 2025-05-07T19:46:23.3574591Z 2025-05-07T19:46:23.3574594Z 2025-05-07T19:46:23.3574597Z 2025-05-07T19:46:23.3574601Z 2025-05-07T19:46:23.3574605Z 2025-05-07T19:46:23.3574608Z 2025-05-07T19:46:23.3574612Z 2025-05-07T19:46:23.3574616Z 2025-05-07T19:46:23.3574619Z 2025-05-07T19:46:23.3574622Z 2025-05-07T19:46:23.3574626Z 2025-05-07T19:46:23.3574629Z 2025-05-07T19:46:23.3574780Z  2025-05-07T19:46:23.3575005Z 2025-05-07T19:46:23.3575009Z 2025-05-07T19:46:23.3575013Z 2025-05-07T19:46:23.3575016Z 2025-05-07T19:46:23.3575019Z 2025-05-07T19:46:23.3575023Z 2025-05-07T19:46:23.3575026Z 2025-05-07T19:46:23.3575030Z 2025-05-07T19:46:23.3575033Z 2025-05-07T19:46:23.3575037Z 2025-05-07T19:46:23.3575041Z 2025-05-07T19:46:23.3575045Z 2025-05-07T19:46:23.3575048Z 2025-05-07T19:46:23.3575052Z 2025-05-07T19:46:23.3575055Z 2025-05-07T19:46:23.3575059Z 2025-05-07T19:46:23.3575062Z 2025-05-07T19:46:23.3575242Z  2025-05-07T19:46:23.3575461Z 2025-05-07T19:46:23.3575465Z 2025-05-07T19:46:23.3575468Z 2025-05-07T19:46:23.3575472Z 2025-05-07T19:46:23.3575475Z 2025-05-07T19:46:23.3575479Z 2025-05-07T19:46:23.3575482Z 2025-05-07T19:46:23.3575486Z 2025-05-07T19:46:23.3575490Z 2025-05-07T19:46:23.3575493Z 2025-05-07T19:46:23.3575515Z 2025-05-07T19:46:23.3575518Z 2025-05-07T19:46:23.3575522Z 2025-05-07T19:46:23.3575525Z 2025-05-07T19:46:23.3575528Z 2025-05-07T19:46:23.3575532Z 2025-05-07T19:46:23.3575535Z 2025-05-07T19:46:23.3575539Z 2025-05-07T19:46:23.3575704Z  2025-05-07T19:46:23.3575921Z 2025-05-07T19:46:23.3575924Z 2025-05-07T19:46:23.3576036Z  2025-05-07T19:46:23.3576139Z 2025-05-07T19:46:23.3576143Z 2025-05-07T19:46:23.3576238Z  2025-05-07T19:46:23.3576357Z 2025-05-07T19:46:23.3576361Z 2025-05-07T19:46:23.3576364Z 2025-05-07T19:46:23.3576558Z  2025-05-07T19:46:23.3576672Z 2025-05-07T19:46:23.3576679Z 2025-05-07T19:46:23.3576746Z 2025-05-07T19:46:23.3576750Z 2025-05-07T19:46:23.3576873Z  2025-05-07T19:46:23.3576989Z 2025-05-07T19:46:23.3576992Z 2025-05-07T19:46:23.3576996Z 2025-05-07T19:46:23.3576999Z 2025-05-07T19:46:23.3577003Z 2025-05-07T19:46:23.3577106Z  2025-05-07T19:46:23.3577244Z 2025-05-07T19:46:23.3577248Z 2025-05-07T19:46:23.3577251Z 2025-05-07T19:46:23.3577255Z 2025-05-07T19:46:23.3577258Z 2025-05-07T19:46:23.3577262Z 2025-05-07T19:46:23.3577368Z  2025-05-07T19:46:23.3577512Z 2025-05-07T19:46:23.3577516Z 2025-05-07T19:46:23.3577519Z 2025-05-07T19:46:23.3577522Z 2025-05-07T19:46:23.3577526Z 2025-05-07T19:46:23.3577529Z 2025-05-07T19:46:23.3577533Z 2025-05-07T19:46:23.3577643Z  2025-05-07T19:46:23.3577783Z 2025-05-07T19:46:23.3577787Z 2025-05-07T19:46:23.3577790Z 2025-05-07T19:46:23.3577794Z 2025-05-07T19:46:23.3577812Z 2025-05-07T19:46:23.3577816Z 2025-05-07T19:46:23.3577819Z 2025-05-07T19:46:23.3577891Z 2025-05-07T19:46:23.3578014Z  2025-05-07T19:46:23.3578165Z 2025-05-07T19:46:23.3578168Z 2025-05-07T19:46:23.3578172Z 2025-05-07T19:46:23.3578175Z 2025-05-07T19:46:23.3578178Z 2025-05-07T19:46:23.3578182Z 2025-05-07T19:46:23.3578185Z 2025-05-07T19:46:23.3578202Z 2025-05-07T19:46:23.3578205Z 2025-05-07T19:46:23.3578324Z  2025-05-07T19:46:23.3578485Z 2025-05-07T19:46:23.3578489Z 2025-05-07T19:46:23.3578493Z 2025-05-07T19:46:23.3578496Z 2025-05-07T19:46:23.3578499Z 2025-05-07T19:46:23.3578503Z 2025-05-07T19:46:23.3578506Z 2025-05-07T19:46:23.3578509Z 2025-05-07T19:46:23.3578513Z 2025-05-07T19:46:23.3578530Z 2025-05-07T19:46:23.3578653Z  2025-05-07T19:46:23.3578818Z 2025-05-07T19:46:23.3578821Z 2025-05-07T19:46:23.3578824Z 2025-05-07T19:46:23.3578828Z 2025-05-07T19:46:23.3578831Z 2025-05-07T19:46:23.3578835Z 2025-05-07T19:46:23.3578838Z 2025-05-07T19:46:23.3578841Z 2025-05-07T19:46:23.3578848Z 2025-05-07T19:46:23.3578852Z 2025-05-07T19:46:23.3578872Z 2025-05-07T19:46:23.3578998Z  2025-05-07T19:46:23.3579172Z 2025-05-07T19:46:23.3579176Z 2025-05-07T19:46:23.3579179Z 2025-05-07T19:46:23.3579183Z 2025-05-07T19:46:23.3579186Z 2025-05-07T19:46:23.3579189Z 2025-05-07T19:46:23.3579193Z 2025-05-07T19:46:23.3579196Z 2025-05-07T19:46:23.3579200Z 2025-05-07T19:46:23.3579203Z 2025-05-07T19:46:23.3579220Z 2025-05-07T19:46:23.3579224Z 2025-05-07T19:46:23.3579355Z  2025-05-07T19:46:23.3579538Z 2025-05-07T19:46:23.3579541Z 2025-05-07T19:46:23.3579545Z 2025-05-07T19:46:23.3579548Z 2025-05-07T19:46:23.3579552Z 2025-05-07T19:46:23.3579556Z 2025-05-07T19:46:23.3579560Z 2025-05-07T19:46:23.3579563Z 2025-05-07T19:46:23.3579582Z 2025-05-07T19:46:23.3579588Z 2025-05-07T19:46:23.3579592Z 2025-05-07T19:46:23.3579595Z 2025-05-07T19:46:23.3579598Z 2025-05-07T19:46:23.3579735Z  2025-05-07T19:46:23.3579929Z 2025-05-07T19:46:23.3579936Z 2025-05-07T19:46:23.3579940Z 2025-05-07T19:46:23.3579943Z 2025-05-07T19:46:23.3579947Z 2025-05-07T19:46:23.3579950Z 2025-05-07T19:46:23.3579970Z 2025-05-07T19:46:23.3579973Z 2025-05-07T19:46:23.3579977Z 2025-05-07T19:46:23.3579980Z 2025-05-07T19:46:23.3579984Z 2025-05-07T19:46:23.3579988Z 2025-05-07T19:46:23.3579991Z 2025-05-07T19:46:23.3579995Z 2025-05-07T19:46:23.3580140Z  2025-05-07T19:46:23.3580340Z 2025-05-07T19:46:23.3580344Z 2025-05-07T19:46:23.3580347Z 2025-05-07T19:46:23.3580366Z 2025-05-07T19:46:23.3580370Z 2025-05-07T19:46:23.3580373Z 2025-05-07T19:46:23.3580376Z 2025-05-07T19:46:23.3580380Z 2025-05-07T19:46:23.3580383Z 2025-05-07T19:46:23.3580387Z 2025-05-07T19:46:23.3580390Z 2025-05-07T19:46:23.3580394Z 2025-05-07T19:46:23.3580398Z 2025-05-07T19:46:23.3580401Z 2025-05-07T19:46:23.3580405Z 2025-05-07T19:46:23.3580550Z  2025-05-07T19:46:23.3580774Z 2025-05-07T19:46:23.3580829Z 2025-05-07T19:46:23.3580832Z 2025-05-07T19:46:23.3580835Z 2025-05-07T19:46:23.3580839Z 2025-05-07T19:46:23.3580842Z 2025-05-07T19:46:23.3580846Z 2025-05-07T19:46:23.3580849Z 2025-05-07T19:46:23.3580852Z 2025-05-07T19:46:23.3580856Z 2025-05-07T19:46:23.3580859Z 2025-05-07T19:46:23.3580863Z 2025-05-07T19:46:23.3580866Z 2025-05-07T19:46:23.3580869Z 2025-05-07T19:46:23.3580873Z 2025-05-07T19:46:23.3580876Z 2025-05-07T19:46:23.3581045Z  2025-05-07T19:46:23.3581256Z 2025-05-07T19:46:23.3581260Z 2025-05-07T19:46:23.3581263Z 2025-05-07T19:46:23.3581267Z 2025-05-07T19:46:23.3581270Z 2025-05-07T19:46:23.3581274Z 2025-05-07T19:46:23.3581278Z 2025-05-07T19:46:23.3581281Z 2025-05-07T19:46:23.3581285Z 2025-05-07T19:46:23.3581288Z 2025-05-07T19:46:23.3581292Z 2025-05-07T19:46:23.3581295Z 2025-05-07T19:46:23.3581299Z 2025-05-07T19:46:23.3581302Z 2025-05-07T19:46:23.3581305Z 2025-05-07T19:46:23.3581309Z 2025-05-07T19:46:23.3581378Z 2025-05-07T19:46:23.3581555Z  2025-05-07T19:46:23.3581768Z 2025-05-07T19:46:23.3581772Z 2025-05-07T19:46:23.3581775Z 2025-05-07T19:46:23.3581779Z 2025-05-07T19:46:23.3581782Z 2025-05-07T19:46:23.3581786Z 2025-05-07T19:46:23.3581789Z 2025-05-07T19:46:23.3581792Z 2025-05-07T19:46:23.3581796Z 2025-05-07T19:46:23.3581799Z 2025-05-07T19:46:23.3581817Z 2025-05-07T19:46:23.3581820Z 2025-05-07T19:46:23.3581824Z 2025-05-07T19:46:23.3581827Z 2025-05-07T19:46:23.3581830Z 2025-05-07T19:46:23.3581834Z 2025-05-07T19:46:23.3581837Z 2025-05-07T19:46:23.3581841Z 2025-05-07T19:46:23.3582005Z  2025-05-07T19:46:23.3582222Z 2025-05-07T19:46:23.3582226Z 2025-05-07T19:46:23.3582335Z  2025-05-07T19:46:23.3582437Z 2025-05-07T19:46:23.3582440Z 2025-05-07T19:46:23.3582538Z  2025-05-07T19:46:23.3582661Z 2025-05-07T19:46:23.3582665Z 2025-05-07T19:46:23.3582669Z 2025-05-07T19:46:23.3582772Z  2025-05-07T19:46:23.3582883Z 2025-05-07T19:46:23.3582887Z 2025-05-07T19:46:23.3582890Z 2025-05-07T19:46:23.3582894Z 2025-05-07T19:46:23.3583010Z  2025-05-07T19:46:23.3583126Z 2025-05-07T19:46:23.3583130Z 2025-05-07T19:46:23.3583133Z 2025-05-07T19:46:23.3583136Z 2025-05-07T19:46:23.3583140Z 2025-05-07T19:46:23.3583243Z  2025-05-07T19:46:23.3583381Z 2025-05-07T19:46:23.3583384Z 2025-05-07T19:46:23.3583388Z 2025-05-07T19:46:23.3583391Z 2025-05-07T19:46:23.3583394Z 2025-05-07T19:46:23.3583399Z 2025-05-07T19:46:23.3583515Z  done 2025-05-07T19:46:23.5677168Z Preparing transaction: / - done 2025-05-07T19:46:24.2714891Z Verifying transaction: | / - \ | / - done 2025-05-07T19:46:24.5757682Z Executing transaction: | / - done 2025-05-07T19:46:26.3200405Z [INSTALL] Fixing file placements for CUDA 12.6.3+ ... 2025-05-07T19:46:26.3201282Z [INSTALL] Creating symlinks: libnvToolsExt.so 2025-05-07T19:46:26.3202026Z + ln -sf /github/home/miniconda/envs/build_binary/lib/libnvToolsExt.so.1 /github/home/miniconda/envs/build_binary/lib/libnvToolsExt.so 2025-05-07T19:46:26.3202637Z 2025-05-07T19:46:26.3221615Z 2025-05-07T19:46:26.3224120Z + ln -sf /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvToolsExt.so.1 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvToolsExt.so 2025-05-07T19:46:26.3226427Z 2025-05-07T19:46:26.3244265Z 2025-05-07T19:46:26.3244800Z [INSTALL] Copying nvtx3 headers ... 2025-05-07T19:46:26.3252040Z + cp -r /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvToolsExt.h /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvToolsExtCuda.h /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvToolsExtCudaRt.h /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvToolsExtOpenCL.h /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvToolsExtSync.h /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvtx3.hpp /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvtxDetail /github/home/miniconda/envs/build_binary/include/ 2025-05-07T19:46:26.3256332Z 2025-05-07T19:46:26.3458590Z 2025-05-07T19:46:26.3463422Z + cp -r /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvToolsExt.h /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvToolsExtCuda.h /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvToolsExtCudaRt.h /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvToolsExtOpenCL.h /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvToolsExtSync.h /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvtx3.hpp /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvtxDetail /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include/ 2025-05-07T19:46:26.3467328Z 2025-05-07T19:46:26.3478452Z 2025-05-07T19:46:26.3479622Z [INSTALL] Appending libcuda.so path to LD_LIBRARY_PATH ... 2025-05-07T19:46:26.3867857Z [ENV] Appending to LD_LIBRARY_PATH: /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs ... 2025-05-07T19:46:28.0324298Z + conda env config vars set -n build_binary LD_LIBRARY_PATH=/github/home/miniconda/envs/build_binary/lib:/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs 2025-05-07T19:46:28.0326510Z 2025-05-07T19:46:28.4441017Z 2025-05-07T19:46:28.4446752Z [INSTALL] Setting environment variable NVML_LIB_PATH ... 2025-05-07T19:46:28.4818381Z + conda env config vars set -n build_binary NVML_LIB_PATH=/github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:46:28.4818924Z 2025-05-07T19:46:28.8972284Z 2025-05-07T19:46:28.8973929Z [INSTALL] Setting environment variable CUDA_INCLUDE_DIRS ... 2025-05-07T19:46:28.8977359Z + conda env config vars set -n build_binary CUDA_INCLUDE_DIRS="/github/home/miniconda/envs/build_binary/include/:/github/home/miniconda/envs/build_binary/targets/x86_64-linux/include/" 2025-05-07T19:46:28.8978953Z 2025-05-07T19:46:29.3154560Z 2025-05-07T19:46:31.0762189Z [CHECK] cuda_runtime.h found in CONDA_PREFIX PATH (file): /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include/cuda_runtime.h 2025-05-07T19:46:32.8268511Z [CHECK] libcuda.so found in CONDA_PREFIX PATH (file): /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:46:34.5630575Z [CHECK] libnvToolsExt.so found in CONDA_PREFIX PATH (symbolic link): /github/home/miniconda/envs/build_binary/lib/libnvToolsExt.so 2025-05-07T19:46:34.5631469Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvToolsExt.so 2025-05-07T19:46:36.2760792Z [CHECK] libnvidia-ml.so found in CONDA_PREFIX PATH (file): /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libnvidia-ml.so 2025-05-07T19:46:37.8696971Z /github/home/miniconda/envs/build_binary/bin/nvcc 2025-05-07T19:46:37.8697453Z 2025-05-07T19:46:37.9451791Z [CHECK] Binary nvcc found in PATH 2025-05-07T19:46:41.2290837Z clang version 16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4) 2025-05-07T19:46:41.2291517Z Target: x86_64-conda-linux-gnu 2025-05-07T19:46:41.2291792Z Thread model: posix 2025-05-07T19:46:41.2292133Z InstalledDir: /github/home/miniconda/envs/build_binary/bin 2025-05-07T19:46:41.2292741Z Configuration file: /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-clang.cfg 2025-05-07T19:46:41.2293226Z 2025-05-07T19:46:41.2859841Z [INSTALL] Resetting compiler symlinks to clang ... 2025-05-07T19:46:44.5910871Z + ln -sf /github/home/miniconda/envs/build_binary/bin/clang /github/home/miniconda/envs/build_binary/bin/cc 2025-05-07T19:46:44.5911481Z 2025-05-07T19:46:44.5925484Z 2025-05-07T19:46:44.5947638Z + ln -sf /github/home/miniconda/envs/build_binary/bin/clang /github/home/miniconda/envs/build_binary/bin/gcc 2025-05-07T19:46:44.5948942Z 2025-05-07T19:46:44.5961578Z 2025-05-07T19:46:44.5992260Z + ln -sf /github/home/miniconda/envs/build_binary/bin/clang++ /github/home/miniconda/envs/build_binary/bin/c++ 2025-05-07T19:46:44.5992802Z 2025-05-07T19:46:44.6010722Z 2025-05-07T19:46:44.6034194Z + ln -sf /github/home/miniconda/envs/build_binary/bin/clang++ /github/home/miniconda/envs/build_binary/bin/g++ 2025-05-07T19:46:44.6034717Z 2025-05-07T19:46:44.6048632Z 2025-05-07T19:46:44.6049077Z + ls -la /github/home/miniconda/envs/build_binary/etc/conda/activate.d 2025-05-07T19:46:44.6049735Z 2025-05-07T19:46:44.6062318Z total 20 2025-05-07T19:46:44.6062586Z drwxr-xr-x. 2 root root 154 May 7 19:46 . 2025-05-07T19:46:44.6062925Z drwxr-xr-x. 5 root root 62 May 7 19:44 .. 2025-05-07T19:46:44.6063351Z -rw-r--r--. 2 root root 3778 Jun 10 2024 activate-binutils_linux-64.sh 2025-05-07T19:46:44.6063803Z -rw-r--r--. 2 root root 136 Mar 27 01:27 libglib_activate.sh 2025-05-07T19:46:44.6064228Z -rw-r--r--. 2 root root 873 Jun 5 2024 libxml2_activate.sh 2025-05-07T19:46:44.6064645Z -rw-r--r--. 2 root root 499 Nov 30 04:26 openjdk_activate.sh 2025-05-07T19:46:44.6065056Z -rw-r--r--. 2 root root 2932 Nov 20 20:32 ~cuda-nvcc_activate.sh 2025-05-07T19:46:44.6065342Z 2025-05-07T19:46:44.6065559Z [INSTALL] Removing the -ccbin=CXX hook from NVCC activation scripts ... 2025-05-07T19:46:44.6066212Z + sed -i /-ccbin=/d /github/home/miniconda/envs/build_binary/etc/conda/activate.d/*cuda-nvcc_activate.sh 2025-05-07T19:46:44.6066661Z 2025-05-07T19:46:44.6080664Z 2025-05-07T19:46:44.6080893Z + conda run -n build_binary c++ --version | grep -i clang 2025-05-07T19:46:44.6081170Z 2025-05-07T19:46:46.3264259Z clang version 16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4) 2025-05-07T19:46:46.3266814Z Configuration file: /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-clang++.cfg 2025-05-07T19:46:46.3268992Z 2025-05-07T19:46:46.3269400Z [BUILD] Setting Clang as the NVCC host compiler: 2025-05-07T19:46:47.9701253Z [BUILD] Setting prepend flags for NVCC ... 2025-05-07T19:46:47.9703853Z + conda env config vars set -n build_binary NVCC_PREPEND_FLAGS="-allow-unsupported-compiler -Xcompiler -stdlib=libstdc++ -ccbin /github/home/miniconda/envs/build_binary/bin/c++" 2025-05-07T19:46:47.9705972Z 2025-05-07T19:46:48.3797050Z 2025-05-07T19:46:48.3797897Z + conda run -n build_binary printenv NVCC_PREPEND_FLAGS 2025-05-07T19:46:48.3798731Z 2025-05-07T19:46:49.9636751Z -allow-unsupported-compiler -Xcompiler -stdlib=libstdc++ -ccbin /github/home/miniconda/envs/build_binary/bin/c++ 2025-05-07T19:46:49.9638331Z 2025-05-07T19:46:50.0204000Z 2025-05-07T19:46:50.0204822Z [INFO] Printing out all preprocessor defines in nvcc ... 2025-05-07T19:46:50.0206292Z + conda run -n build_binary nvcc --compiler-options -dM -E -x cu - < /dev/null 2025-05-07T19:46:50.0207293Z 2025-05-07T19:46:51.7174490Z #define ADJ_ESTERROR 0x0008 2025-05-07T19:46:51.7174833Z #define ADJ_FREQUENCY 0x0002 2025-05-07T19:46:51.7175114Z #define ADJ_MAXERROR 0x0004 2025-05-07T19:46:51.7175367Z #define ADJ_MICRO 0x1000 2025-05-07T19:46:51.7175623Z #define ADJ_NANO 0x2000 2025-05-07T19:46:51.7175863Z #define ADJ_OFFSET 0x0001 2025-05-07T19:46:51.7176139Z #define ADJ_OFFSET_SINGLESHOT 0x8001 2025-05-07T19:46:51.7176563Z #define ADJ_OFFSET_SS_READ 0xa001 2025-05-07T19:46:51.7176842Z #define ADJ_STATUS 0x0010 2025-05-07T19:46:51.7177100Z #define ADJ_TAI 0x0080 2025-05-07T19:46:51.7177337Z #define ADJ_TICK 0x4000 2025-05-07T19:46:51.7177597Z #define ADJ_TIMECONST 0x0020 2025-05-07T19:46:51.7177894Z #define AIO_PRIO_DELTA_MAX 20 2025-05-07T19:46:51.7178516Z #define BC_BASE_MAX _POSIX2_BC_BASE_MAX 2025-05-07T19:46:51.7178857Z #define BC_DIM_MAX _POSIX2_BC_DIM_MAX 2025-05-07T19:46:51.7179221Z #define BC_SCALE_MAX _POSIX2_BC_SCALE_MAX 2025-05-07T19:46:51.7179581Z #define BC_STRING_MAX _POSIX2_BC_STRING_MAX 2025-05-07T19:46:51.7179958Z #define BIG_ENDIAN __BIG_ENDIAN 2025-05-07T19:46:51.7180263Z #define BUFSIZ _IO_BUFSIZ 2025-05-07T19:46:51.7180579Z #define BYTE_ORDER __BYTE_ORDER 2025-05-07T19:46:51.7180917Z #define CHARCLASS_NAME_MAX 2048 2025-05-07T19:46:51.7181221Z #define CHAR_BIT __CHAR_BIT__ 2025-05-07T19:46:51.7181553Z #define CHAR_MAX __SCHAR_MAX__ 2025-05-07T19:46:51.7181849Z #define CHAR_MIN SCHAR_MIN 2025-05-07T19:46:51.7182164Z #define CLOCKS_PER_SEC 1000000l 2025-05-07T19:46:51.7182461Z #define CLOCK_BOOTTIME 7 2025-05-07T19:46:51.7182778Z #define CLOCK_BOOTTIME_ALARM 9 2025-05-07T19:46:51.7183068Z #define CLOCK_MONOTONIC 1 2025-05-07T19:46:51.7183392Z #define CLOCK_MONOTONIC_COARSE 6 2025-05-07T19:46:51.7183848Z #define CLOCK_MONOTONIC_RAW 4 2025-05-07T19:46:51.7184199Z #define CLOCK_PROCESS_CPUTIME_ID 2 2025-05-07T19:46:51.7184543Z #define CLOCK_REALTIME 0 2025-05-07T19:46:51.7184814Z #define CLOCK_REALTIME_ALARM 8 2025-05-07T19:46:51.7185150Z #define CLOCK_REALTIME_COARSE 5 2025-05-07T19:46:51.7185432Z #define CLOCK_TAI 11 2025-05-07T19:46:51.7185702Z #define CLOCK_THREAD_CPUTIME_ID 3 2025-05-07T19:46:51.7186011Z #define COLL_WEIGHTS_MAX 255 2025-05-07T19:46:51.7186320Z #define CUDARTAPI 2025-05-07T19:46:51.7186568Z #define CUDARTAPI_CDECL 2025-05-07T19:46:51.7186868Z #define CUDART_CB 2025-05-07T19:46:51.7187160Z #define CUDART_DEVICE __device__ 2025-05-07T19:46:51.7187465Z #define CUDART_VERSION 12060 2025-05-07T19:46:51.7187807Z #define CUDA_DOUBLE_MATH_FUNCTIONS 1 2025-05-07T19:46:51.7188132Z #define CUDA_IPC_HANDLE_SIZE 64 2025-05-07T19:46:51.7188461Z #define CU_UUID_HAS_BEEN_DEFINED 2025-05-07T19:46:51.7188767Z #define DELAYTIMER_MAX 2147483647 2025-05-07T19:46:51.7189098Z #define DOMAIN 1 2025-05-07T19:46:51.7189346Z #define EOF (-1) 2025-05-07T19:46:51.7189609Z #define EXIT_FAILURE 1 2025-05-07T19:46:51.7189869Z #define EXIT_SUCCESS 0 2025-05-07T19:46:51.7190152Z #define EXPR_NEST_MAX _POSIX2_EXPR_NEST_MAX 2025-05-07T19:46:51.7190529Z #define FD_CLR(fd,fdsetp) __FD_CLR (fd, fdsetp) 2025-05-07T19:46:51.7190918Z #define FD_ISSET(fd,fdsetp) __FD_ISSET (fd, fdsetp) 2025-05-07T19:46:51.7191332Z #define FD_SET(fd,fdsetp) __FD_SET (fd, fdsetp) 2025-05-07T19:46:51.7191677Z #define FD_SETSIZE __FD_SETSIZE 2025-05-07T19:46:51.7192017Z #define FD_ZERO(fdsetp) __FD_ZERO (fdsetp) 2025-05-07T19:46:51.7192359Z #define FILENAME_MAX 4096 2025-05-07T19:46:51.7192662Z #define FOPEN_MAX 16 2025-05-07T19:46:51.7192936Z #define FP_ILOGB0 (-2147483647 - 1) 2025-05-07T19:46:51.7193298Z #define FP_ILOGBNAN (-2147483647 - 1) 2025-05-07T19:46:51.7193604Z #define FP_INFINITE 1 2025-05-07T19:46:51.7193895Z #define FP_NAN 0 2025-05-07T19:46:51.7194175Z #define FP_NORMAL 4 2025-05-07T19:46:51.7194424Z #define FP_SUBNORMAL 3 2025-05-07T19:46:51.7194711Z #define FP_ZERO 2 2025-05-07T19:46:51.7194952Z #define HOST_NAME_MAX 64 2025-05-07T19:46:51.7195246Z #define HUGE 3.40282347e+38F 2025-05-07T19:46:51.7195536Z #define HUGE_VAL (__builtin_huge_val()) 2025-05-07T19:46:51.7195893Z #define HUGE_VALF (__builtin_huge_valf()) 2025-05-07T19:46:51.7196233Z #define HUGE_VALL (__builtin_huge_vall()) 2025-05-07T19:46:51.7196592Z #define INFINITY (__builtin_inff()) 2025-05-07T19:46:51.7196897Z #define INT_MAX __INT_MAX__ 2025-05-07T19:46:51.7197217Z #define INT_MIN (-__INT_MAX__ -1) 2025-05-07T19:46:51.7197531Z #define IOV_MAX 1024 2025-05-07T19:46:51.7197793Z #define LINE_MAX _POSIX2_LINE_MAX 2025-05-07T19:46:51.7198537Z #define LITTLE_ENDIAN __LITTLE_ENDIAN 2025-05-07T19:46:51.7198857Z #define LLONG_MAX __LONG_LONG_MAX__ 2025-05-07T19:46:51.7199210Z #define LLONG_MIN (-__LONG_LONG_MAX__-1LL) 2025-05-07T19:46:51.7199540Z #define LOGIN_NAME_MAX 256 2025-05-07T19:46:51.7199841Z #define LONG_BIT 64 2025-05-07T19:46:51.7200103Z #define LONG_LONG_MAX __LONG_LONG_MAX__ 2025-05-07T19:46:51.7200558Z #define LONG_LONG_MIN (-__LONG_LONG_MAX__-1LL) 2025-05-07T19:46:51.7200929Z #define LONG_MAX __LONG_MAX__ 2025-05-07T19:46:51.7201228Z #define LONG_MIN (-__LONG_MAX__ -1L) 2025-05-07T19:46:51.7201588Z #define L_ctermid 9 2025-05-07T19:46:51.7201839Z #define L_cuserid 9 2025-05-07T19:46:51.7202116Z #define L_tmpnam 20 2025-05-07T19:46:51.7202359Z #define MATH_ERREXCEPT 2 2025-05-07T19:46:51.7202614Z #define MATH_ERRNO 1 2025-05-07T19:46:51.7202839Z #define MAX_CANON 255 2025-05-07T19:46:51.7203084Z #define MAX_INPUT 255 2025-05-07T19:46:51.7203346Z #define MB_CUR_MAX (__ctype_get_mb_cur_max ()) 2025-05-07T19:46:51.7203681Z #define MB_LEN_MAX 16 2025-05-07T19:46:51.7203927Z #define MOD_CLKA ADJ_OFFSET_SINGLESHOT 2025-05-07T19:46:51.7204234Z #define MOD_CLKB ADJ_TICK 2025-05-07T19:46:51.7204503Z #define MOD_ESTERROR ADJ_ESTERROR 2025-05-07T19:46:51.7204787Z #define MOD_FREQUENCY ADJ_FREQUENCY 2025-05-07T19:46:51.7205095Z #define MOD_MAXERROR ADJ_MAXERROR 2025-05-07T19:46:51.7205445Z #define MOD_MICRO ADJ_MICRO 2025-05-07T19:46:51.7205723Z #define MOD_NANO ADJ_NANO 2025-05-07T19:46:51.7205975Z #define MOD_OFFSET ADJ_OFFSET 2025-05-07T19:46:51.7206364Z #define MOD_STATUS ADJ_STATUS 2025-05-07T19:46:51.7206617Z #define MOD_TAI ADJ_TAI 2025-05-07T19:46:51.7206884Z #define MOD_TIMECONST ADJ_TIMECONST 2025-05-07T19:46:51.7207160Z #define MQ_PRIO_MAX 32768 2025-05-07T19:46:51.7207530Z #define M_1_PI 0.31830988618379067154 2025-05-07T19:46:51.7207840Z #define M_1_PIl 0.318309886183790671537767526745028724L 2025-05-07T19:46:51.7208147Z #define M_2_PI 0.63661977236758134308 2025-05-07T19:46:51.7208456Z #define M_2_PIl 0.636619772367581343075535053490057448L 2025-05-07T19:46:51.7208770Z #define M_2_SQRTPI 1.12837916709551257390 2025-05-07T19:46:51.7209108Z #define M_2_SQRTPIl 1.128379167095512573896158903121545172L 2025-05-07T19:46:51.7209430Z #define M_E 2.7182818284590452354 2025-05-07T19:46:51.7209719Z #define M_El 2.718281828459045235360287471352662498L 2025-05-07T19:46:51.7210026Z #define M_LN10 2.30258509299404568402 2025-05-07T19:46:51.7210376Z #define M_LN10l 2.302585092994045684017991454684364208L 2025-05-07T19:46:51.7210734Z #define M_LN2 0.69314718055994530942 2025-05-07T19:46:51.7211023Z #define M_LN2l 0.693147180559945309417232121458176568L 2025-05-07T19:46:51.7211346Z #define M_LOG10E 0.43429448190325182765 2025-05-07T19:46:51.7211655Z #define M_LOG10El 0.434294481903251827651128918916605082L 2025-05-07T19:46:51.7211986Z #define M_LOG2E 1.4426950408889634074 2025-05-07T19:46:51.7212285Z #define M_LOG2El 1.442695040888963407359924681001892137L 2025-05-07T19:46:51.7212602Z #define M_PI 3.14159265358979323846 2025-05-07T19:46:51.7212854Z #define M_PI_2 1.57079632679489661923 2025-05-07T19:46:51.7213155Z #define M_PI_2l 1.570796326794896619231321691639751442L 2025-05-07T19:46:51.7213467Z #define M_PI_4 0.78539816339744830962 2025-05-07T19:46:51.7213752Z #define M_PI_4l 0.785398163397448309615660845819875721L 2025-05-07T19:46:51.7214094Z #define M_PIl 3.141592653589793238462643383279502884L 2025-05-07T19:46:51.7214401Z #define M_SQRT1_2 0.70710678118654752440 2025-05-07T19:46:51.7214729Z #define M_SQRT1_2l 0.707106781186547524400844362104849039L 2025-05-07T19:46:51.7215042Z #define M_SQRT2 1.41421356237309504880 2025-05-07T19:46:51.7215353Z #define M_SQRT2l 1.414213562373095048801688724209698079L 2025-05-07T19:46:51.7215653Z #define NAME_MAX 255 2025-05-07T19:46:51.7215891Z #define NAN (__builtin_nanf ("")) 2025-05-07T19:46:51.7216164Z #define NFDBITS __NFDBITS 2025-05-07T19:46:51.7216460Z #define NGROUPS_MAX 65536 2025-05-07T19:46:51.7216717Z #define NL_ARGMAX _POSIX_ARG_MAX 2025-05-07T19:46:51.7217172Z #define NL_LANGMAX _POSIX2_LINE_MAX 2025-05-07T19:46:51.7217494Z #define NL_MSGMAX INT_MAX 2025-05-07T19:46:51.7217845Z #define NL_NMAX INT_MAX 2025-05-07T19:46:51.7238528Z #define NL_SETMAX INT_MAX 2025-05-07T19:46:51.7239065Z #define NL_TEXTMAX INT_MAX 2025-05-07T19:46:51.7239334Z #define NULL __null 2025-05-07T19:46:51.7239554Z #define NZERO 20 2025-05-07T19:46:51.7239810Z #define OVERFLOW 3 2025-05-07T19:46:51.7240175Z #define PATH_MAX 4096 2025-05-07T19:46:51.7240443Z #define PDP_ENDIAN __PDP_ENDIAN 2025-05-07T19:46:51.7240724Z #define PIPE_BUF 4096 2025-05-07T19:46:51.7240948Z #define PLOSS 6 2025-05-07T19:46:51.7241316Z #define PTHREAD_DESTRUCTOR_ITERATIONS _POSIX_THREAD_DESTRUCTOR_ITERATIONS 2025-05-07T19:46:51.7241744Z #define PTHREAD_KEYS_MAX 1024 2025-05-07T19:46:51.7242028Z #define PTHREAD_STACK_MIN 16384 2025-05-07T19:46:51.7242292Z #define P_tmpdir "/tmp" 2025-05-07T19:46:51.7242548Z #define RAND_MAX 2147483647 2025-05-07T19:46:51.7242794Z #define RE_DUP_MAX (0x7fff) 2025-05-07T19:46:51.7243052Z #define RTSIG_MAX 32 2025-05-07T19:46:51.7243283Z #define SCHAR_MAX __SCHAR_MAX__ 2025-05-07T19:46:51.7243568Z #define SCHAR_MIN (-__SCHAR_MAX__-1) 2025-05-07T19:46:51.7243855Z #define SEEK_CUR 1 2025-05-07T19:46:51.7244067Z #define SEEK_DATA 3 2025-05-07T19:46:51.7244292Z #define SEEK_END 2 2025-05-07T19:46:51.7244538Z #define SEEK_HOLE 4 2025-05-07T19:46:51.7246571Z #define SEEK_SET 0 2025-05-07T19:46:51.7246819Z #define SEM_VALUE_MAX (2147483647) 2025-05-07T19:46:51.7247111Z #define SHRT_MAX __SHRT_MAX__ 2025-05-07T19:46:51.7247376Z #define SHRT_MIN (-__SHRT_MAX__ -1) 2025-05-07T19:46:51.7247658Z #define SING 2 2025-05-07T19:46:51.7247868Z #define SSIZE_MAX LONG_MAX 2025-05-07T19:46:51.7248120Z #define STA_CLK 0x8000 2025-05-07T19:46:51.7248351Z #define STA_CLOCKERR 0x1000 2025-05-07T19:46:51.7248611Z #define STA_DEL 0x0020 2025-05-07T19:46:51.7248849Z #define STA_FLL 0x0008 2025-05-07T19:46:51.7249077Z #define STA_FREQHOLD 0x0080 2025-05-07T19:46:51.7249332Z #define STA_INS 0x0010 2025-05-07T19:46:51.7249554Z #define STA_MODE 0x4000 2025-05-07T19:46:51.7249800Z #define STA_NANO 0x2000 2025-05-07T19:46:51.7250027Z #define STA_PLL 0x0001 2025-05-07T19:46:51.7250277Z #define STA_PPSERROR 0x0800 2025-05-07T19:46:51.7250527Z #define STA_PPSFREQ 0x0002 2025-05-07T19:46:51.7250796Z #define STA_PPSJITTER 0x0200 2025-05-07T19:46:51.7251061Z #define STA_PPSSIGNAL 0x0100 2025-05-07T19:46:51.7251340Z #define STA_PPSTIME 0x0004 2025-05-07T19:46:51.7251699Z #define STA_PPSWANDER 0x0400 2025-05-07T19:46:51.7252230Z #define STA_RONLY (STA_PPSSIGNAL | STA_PPSJITTER | STA_PPSWANDER | STA_PPSERROR | STA_CLOCKERR | STA_NANO | STA_MODE | STA_CLK) 2025-05-07T19:46:51.7252787Z #define STA_UNSYNC 0x0040 2025-05-07T19:46:51.7253020Z #define TIMER_ABSTIME 1 2025-05-07T19:46:51.7253252Z #define TIME_UTC 1 2025-05-07T19:46:51.7253452Z #define TLOSS 5 2025-05-07T19:46:51.7253662Z #define TMP_MAX 238328 2025-05-07T19:46:51.7253878Z #define TTY_NAME_MAX 32 2025-05-07T19:46:51.7254124Z #define UCHAR_MAX (__SCHAR_MAX__*2 +1) 2025-05-07T19:46:51.7254401Z #define UINT_MAX (__INT_MAX__ *2U +1U) 2025-05-07T19:46:51.7254717Z #define ULLONG_MAX (__LONG_LONG_MAX__*2ULL+1ULL) 2025-05-07T19:46:51.7255074Z #define ULONG_LONG_MAX (__LONG_LONG_MAX__*2ULL+1ULL) 2025-05-07T19:46:51.7255397Z #define ULONG_MAX (__LONG_MAX__ *2UL+1UL) 2025-05-07T19:46:51.7255684Z #define UNDERFLOW 4 2025-05-07T19:46:51.7255910Z #define USHRT_MAX (__SHRT_MAX__ *2 +1) 2025-05-07T19:46:51.7256190Z #define WCONTINUED 8 2025-05-07T19:46:51.7256516Z #define WEXITED 4 2025-05-07T19:46:51.7256992Z #define WEXITSTATUS(status) __WEXITSTATUS (__WAIT_INT (status)) 2025-05-07T19:46:51.7257553Z #define WIFCONTINUED(status) __WIFCONTINUED (__WAIT_INT (status)) 2025-05-07T19:46:51.7258031Z #define WIFEXITED(status) __WIFEXITED (__WAIT_INT (status)) 2025-05-07T19:46:51.7258496Z #define WIFSIGNALED(status) __WIFSIGNALED (__WAIT_INT (status)) 2025-05-07T19:46:51.7258953Z #define WIFSTOPPED(status) __WIFSTOPPED (__WAIT_INT (status)) 2025-05-07T19:46:51.7259329Z #define WNOHANG 1 2025-05-07T19:46:51.7259555Z #define WNOWAIT 0x01000000 2025-05-07T19:46:51.7259819Z #define WORD_BIT 32 2025-05-07T19:46:51.7260035Z #define WSTOPPED 2 2025-05-07T19:46:51.7260338Z #define WSTOPSIG(status) __WSTOPSIG (__WAIT_INT (status)) 2025-05-07T19:46:51.7260754Z #define WTERMSIG(status) __WTERMSIG (__WAIT_INT (status)) 2025-05-07T19:46:51.7261112Z #define WUNTRACED 2 2025-05-07T19:46:51.7261347Z #define XATTR_LIST_MAX 65536 2025-05-07T19:46:51.7261734Z #define XATTR_NAME_MAX 255 2025-05-07T19:46:51.7262005Z #define XATTR_SIZE_MAX 65536 2025-05-07T19:46:51.7262276Z #define X_TLOSS 1.41484755040568800000e+16 2025-05-07T19:46:51.7262575Z #define _ACRTIMP 2025-05-07T19:46:51.7262788Z #define _ALLOCA_H 1 2025-05-07T19:46:51.7263018Z #define _ASSERT_H 1 2025-05-07T19:46:51.7263237Z #define _ATFILE_SOURCE 1 2025-05-07T19:46:51.7263499Z #define _BITS_BYTESWAP_H 1 2025-05-07T19:46:51.7263751Z #define _BITS_POSIX1_LIM_H 1 2025-05-07T19:46:51.7264026Z #define _BITS_POSIX2_LIM_H 1 2025-05-07T19:46:51.7264290Z #define _BITS_PTHREADTYPES_H 1 2025-05-07T19:46:51.7264565Z #define _BITS_TIMEX_H 1 2025-05-07T19:46:51.7264815Z #define _BITS_TIME_H 1 2025-05-07T19:46:51.7265052Z #define _BITS_TYPESIZES_H 1 2025-05-07T19:46:51.7265325Z #define _BITS_TYPES_H 1 2025-05-07T19:46:51.7265555Z #define _BSD_SOURCE 1 2025-05-07T19:46:51.7265806Z #define _CONCEPT_CHECK_H 1 2025-05-07T19:46:51.7266127Z #define _CPP_TYPE_TRAITS_H 1 2025-05-07T19:46:51.7266400Z #define _CRTIMP 2025-05-07T19:46:51.7266615Z #define _CTYPE_H 1 2025-05-07T19:46:51.7266846Z #define _ENDIAN_H 1 2025-05-07T19:46:51.7267074Z #define _EXCEPTION_DEFINES_H 1 2025-05-07T19:46:51.7267363Z #define _EXT_NUMERIC_TRAITS 1 2025-05-07T19:46:51.7267628Z #define _EXT_TYPE_TRAITS 1 2025-05-07T19:46:51.7267891Z #define _FEATURES_H 1 2025-05-07T19:46:51.7268144Z #define _FUNCTEXCEPT_H 1 2025-05-07T19:46:51.7268386Z #define _GCC_LIMITS_H_ 2025-05-07T19:46:51.7268680Z #define _GLIBCXX11_DEPRECATED _GLIBCXX_DEPRECATED 2025-05-07T19:46:51.7269145Z #define _GLIBCXX11_DEPRECATED_SUGGEST(ALT) _GLIBCXX_DEPRECATED_SUGGEST(ALT) 2025-05-07T19:46:51.7269601Z #define _GLIBCXX11_USE_C99_COMPLEX 1 2025-05-07T19:46:51.7269896Z #define _GLIBCXX11_USE_C99_MATH 1 2025-05-07T19:46:51.7270375Z #define _GLIBCXX11_USE_C99_STDIO 1 2025-05-07T19:46:51.7270664Z #define _GLIBCXX11_USE_C99_STDLIB 1 2025-05-07T19:46:51.7270969Z #define _GLIBCXX11_USE_C99_WCHAR 1 2025-05-07T19:46:51.7271265Z #define _GLIBCXX14_CONSTEXPR constexpr 2025-05-07T19:46:51.7271592Z #define _GLIBCXX17_CONSTEXPR constexpr 2025-05-07T19:46:51.7271934Z #define _GLIBCXX17_DEPRECATED [[__deprecated__]] 2025-05-07T19:46:51.7272392Z #define _GLIBCXX17_DEPRECATED_SUGGEST(ALT) _GLIBCXX_DEPRECATED_SUGGEST(ALT) 2025-05-07T19:46:51.7272844Z #define _GLIBCXX17_INLINE inline 2025-05-07T19:46:51.7273124Z #define _GLIBCXX20_CONSTEXPR 2025-05-07T19:46:51.7273412Z #define _GLIBCXX20_DEPRECATED(MSG) 2025-05-07T19:46:51.7273718Z #define _GLIBCXX20_DEPRECATED_SUGGEST(ALT) 2025-05-07T19:46:51.7274051Z #define _GLIBCXX98_USE_C99_COMPLEX 1 2025-05-07T19:46:51.7274339Z #define _GLIBCXX98_USE_C99_MATH 1 2025-05-07T19:46:51.7274634Z #define _GLIBCXX98_USE_C99_STDIO 1 2025-05-07T19:46:51.7274930Z #define _GLIBCXX98_USE_C99_STDLIB 1 2025-05-07T19:46:51.7275213Z #define _GLIBCXX98_USE_C99_WCHAR 1 2025-05-07T19:46:51.7275594Z #define _GLIBCXX_ABI_TAG_CXX11 __attribute ((__abi_tag__ ("cxx11"))) 2025-05-07T19:46:51.7275994Z #define _GLIBCXX_ATOMIC_BUILTINS 1 2025-05-07T19:46:51.7276323Z #define _GLIBCXX_BEGIN_EXTERN_C extern "C" { 2025-05-07T19:46:51.7276647Z #define _GLIBCXX_BEGIN_NAMESPACE_ALGO 2025-05-07T19:46:51.7276972Z #define _GLIBCXX_BEGIN_NAMESPACE_CONTAINER 2025-05-07T19:46:51.7277347Z #define _GLIBCXX_BEGIN_NAMESPACE_CXX11 namespace __cxx11 { 2025-05-07T19:46:51.7277732Z #define _GLIBCXX_BEGIN_NAMESPACE_LDBL 2025-05-07T19:46:51.7278169Z #define _GLIBCXX_BEGIN_NAMESPACE_LDBL_OR_CXX11 _GLIBCXX_BEGIN_NAMESPACE_CXX11 2025-05-07T19:46:51.7278619Z #define _GLIBCXX_BEGIN_NAMESPACE_VERSION 2025-05-07T19:46:51.7278945Z #define _GLIBCXX_BITS_SPECFUN_H 1 2025-05-07T19:46:51.7279229Z #define _GLIBCXX_BITS_STD_ABS_H 2025-05-07T19:46:51.7279515Z #define _GLIBCXX_CMATH 1 2025-05-07T19:46:51.7279799Z #define _GLIBCXX_CONST __attribute__ ((__const__)) 2025-05-07T19:46:51.7280157Z #define _GLIBCXX_CONSTEXPR constexpr 2025-05-07T19:46:51.7280451Z #define _GLIBCXX_CPU_DEFINES 1 2025-05-07T19:46:51.7280727Z #define _GLIBCXX_CSTDLIB 1 2025-05-07T19:46:51.7281002Z #define _GLIBCXX_CXX_CONFIG_H 1 2025-05-07T19:46:51.7281432Z #define _GLIBCXX_DARWIN_USE_64_BIT_INODE 1 2025-05-07T19:46:51.7281769Z #define _GLIBCXX_DEBUG_ASSERT(_Condition) 2025-05-07T19:46:51.7282088Z #define _GLIBCXX_DEBUG_ASSERTIONS_H 1 2025-05-07T19:46:51.7282510Z #define _GLIBCXX_DEBUG_MACRO_SWITCH_H 1 2025-05-07T19:46:51.7282814Z #define _GLIBCXX_DEBUG_ONLY(_Statement) 2025-05-07T19:46:51.7283151Z #define _GLIBCXX_DEBUG_PEDASSERT(_Condition) 2025-05-07T19:46:51.7283513Z #define _GLIBCXX_DEFAULT_ABI_TAG _GLIBCXX_ABI_TAG_CXX11 2025-05-07T19:46:51.7283937Z #define _GLIBCXX_DEPRECATED __attribute__ ((__deprecated__)) 2025-05-07T19:46:51.7284507Z #define _GLIBCXX_DEPRECATED_SUGGEST(ALT) __attribute__ ((__deprecated__ ("use '" ALT "' instead"))) 2025-05-07T19:46:51.7285014Z #define _GLIBCXX_DOUBLE_IS_IEEE_BINARY64 1 2025-05-07T19:46:51.7285333Z #define _GLIBCXX_END_EXTERN_C } 2025-05-07T19:46:51.7285606Z #define _GLIBCXX_END_NAMESPACE_ALGO 2025-05-07T19:46:51.7286014Z #define _GLIBCXX_END_NAMESPACE_CONTAINER 2025-05-07T19:46:51.7286331Z #define _GLIBCXX_END_NAMESPACE_CXX11 } 2025-05-07T19:46:51.7286643Z #define _GLIBCXX_END_NAMESPACE_LDBL 2025-05-07T19:46:51.7287035Z #define _GLIBCXX_END_NAMESPACE_LDBL_OR_CXX11 _GLIBCXX_END_NAMESPACE_CXX11 2025-05-07T19:46:51.7287467Z #define _GLIBCXX_END_NAMESPACE_VERSION 2025-05-07T19:46:51.7287782Z #define _GLIBCXX_EXTERN_TEMPLATE 1 2025-05-07T19:46:51.7288060Z #define _GLIBCXX_FAST_MATH 0 2025-05-07T19:46:51.7288442Z #define _GLIBCXX_FLOAT_IS_IEEE_BINARY32 1 2025-05-07T19:46:51.7288814Z #define _GLIBCXX_FORWARD(_Tp,__val) std::forward<_Tp>(__val) 2025-05-07T19:46:51.7289179Z #define _GLIBCXX_FULLY_DYNAMIC_STRING 0 2025-05-07T19:46:51.7289463Z #define _GLIBCXX_FWDREF(_Tp) _Tp&& 2025-05-07T19:46:51.7289742Z #define _GLIBCXX_HAS_GTHREADS 1 2025-05-07T19:46:51.7290560Z #define _GLIBCXX_HAS_NESTED_TYPE(_NTYPE) template> struct __has_##_NTYPE : false_type { }; template struct __has_##_NTYPE<_Tp, __void_t> : true_type { }; 2025-05-07T19:46:51.7291528Z #define _GLIBCXX_HAVE_ACOSF 1 2025-05-07T19:46:51.7291797Z #define _GLIBCXX_HAVE_ACOSL 1 2025-05-07T19:46:51.7292054Z #define _GLIBCXX_HAVE_ALIGNED_ALLOC 1 2025-05-07T19:46:51.7292349Z #define _GLIBCXX_HAVE_ARPA_INET_H 1 2025-05-07T19:46:51.7292632Z #define _GLIBCXX_HAVE_ASINF 1 2025-05-07T19:46:51.7292876Z #define _GLIBCXX_HAVE_ASINL 1 2025-05-07T19:46:51.7293162Z #define _GLIBCXX_HAVE_AS_SYMVER_DIRECTIVE 1 2025-05-07T19:46:51.7293453Z #define _GLIBCXX_HAVE_ATAN2F 1 2025-05-07T19:46:51.7293718Z #define _GLIBCXX_HAVE_ATAN2L 1 2025-05-07T19:46:51.7293963Z #define _GLIBCXX_HAVE_ATANF 1 2025-05-07T19:46:51.7294223Z #define _GLIBCXX_HAVE_ATANL 1 2025-05-07T19:46:51.7294486Z #define _GLIBCXX_HAVE_ATOMIC_LOCK_POLICY 1 2025-05-07T19:46:51.7294810Z #define _GLIBCXX_HAVE_ATTRIBUTE_VISIBILITY 1 2025-05-07T19:46:51.7295111Z #define _GLIBCXX_HAVE_AT_QUICK_EXIT 1 2025-05-07T19:46:51.7295427Z #define _GLIBCXX_HAVE_BUILTIN_HAS_UNIQ_OBJ_REP 1 2025-05-07T19:46:51.7295766Z #define _GLIBCXX_HAVE_BUILTIN_IS_AGGREGATE 1 2025-05-07T19:46:51.7296100Z #define _GLIBCXX_HAVE_BUILTIN_IS_CONSTANT_EVALUATED 1 2025-05-07T19:46:51.7296509Z #define _GLIBCXX_HAVE_BUILTIN_IS_SAME 1 2025-05-07T19:46:51.7296958Z #define _GLIBCXX_HAVE_BUILTIN_LAUNDER 1 2025-05-07T19:46:51.7297267Z #define _GLIBCXX_HAVE_CEILF 1 2025-05-07T19:46:51.7297575Z #define _GLIBCXX_HAVE_CEILL 1 2025-05-07T19:46:51.7297858Z #define _GLIBCXX_HAVE_COMPLEX_H 1 2025-05-07T19:46:51.7298139Z #define _GLIBCXX_HAVE_COSF 1 2025-05-07T19:46:51.7298417Z #define _GLIBCXX_HAVE_COSHF 1 2025-05-07T19:46:51.7298696Z #define _GLIBCXX_HAVE_COSHL 1 2025-05-07T19:46:51.7298957Z #define _GLIBCXX_HAVE_COSL 1 2025-05-07T19:46:51.7299240Z #define _GLIBCXX_HAVE_DIRENT_H 1 2025-05-07T19:46:51.7299517Z #define _GLIBCXX_HAVE_DLFCN_H 1 2025-05-07T19:46:51.7299806Z #define _GLIBCXX_HAVE_ENDIAN_H 1 2025-05-07T19:46:51.7300117Z #define _GLIBCXX_HAVE_EXCEPTION_PTR_SINCE_GCC46 1 2025-05-07T19:46:51.7300472Z #define _GLIBCXX_HAVE_EXECINFO_H 1 2025-05-07T19:46:51.7300762Z #define _GLIBCXX_HAVE_EXPF 1 2025-05-07T19:46:51.7301126Z #define _GLIBCXX_HAVE_EXPL 1 2025-05-07T19:46:51.7301387Z #define _GLIBCXX_HAVE_FABSF 1 2025-05-07T19:46:51.7301668Z #define _GLIBCXX_HAVE_FABSL 1 2025-05-07T19:46:51.7301951Z #define _GLIBCXX_HAVE_FCNTL_H 1 2025-05-07T19:46:51.7302222Z #define _GLIBCXX_HAVE_FENV_H 1 2025-05-07T19:46:51.7302508Z #define _GLIBCXX_HAVE_FINITE 1 2025-05-07T19:46:51.7302770Z #define _GLIBCXX_HAVE_FINITEF 1 2025-05-07T19:46:51.7303042Z #define _GLIBCXX_HAVE_FINITEL 1 2025-05-07T19:46:51.7303308Z #define _GLIBCXX_HAVE_FLOAT_H 1 2025-05-07T19:46:51.7303587Z #define _GLIBCXX_HAVE_FLOORF 1 2025-05-07T19:46:51.7303851Z #define _GLIBCXX_HAVE_FLOORL 1 2025-05-07T19:46:51.7304129Z #define _GLIBCXX_HAVE_FMODF 1 2025-05-07T19:46:51.7304390Z #define _GLIBCXX_HAVE_FMODL 1 2025-05-07T19:46:51.7304671Z #define _GLIBCXX_HAVE_FREXPF 1 2025-05-07T19:46:51.7304924Z #define _GLIBCXX_HAVE_FREXPL 1 2025-05-07T19:46:51.7305186Z #define _GLIBCXX_HAVE_GETIPINFO 1 2025-05-07T19:46:51.7305510Z #define _GLIBCXX_HAVE_GETS 1 2025-05-07T19:46:51.7305774Z #define _GLIBCXX_HAVE_HYPOT 1 2025-05-07T19:46:51.7306027Z #define _GLIBCXX_HAVE_HYPOTF 1 2025-05-07T19:46:51.7306286Z #define _GLIBCXX_HAVE_HYPOTL 1 2025-05-07T19:46:51.7306541Z #define _GLIBCXX_HAVE_ICONV 1 2025-05-07T19:46:51.7306790Z #define _GLIBCXX_HAVE_INT64_T 1 2025-05-07T19:46:51.7307060Z #define _GLIBCXX_HAVE_INT64_T_LONG 1 2025-05-07T19:46:51.7307345Z #define _GLIBCXX_HAVE_INTTYPES_H 1 2025-05-07T19:46:51.7307644Z #define _GLIBCXX_HAVE_ISINF 1 2025-05-07T19:46:51.7307908Z #define _GLIBCXX_HAVE_ISINFF 1 2025-05-07T19:46:51.7308167Z #define _GLIBCXX_HAVE_ISINFL 1 2025-05-07T19:46:51.7308409Z #define _GLIBCXX_HAVE_ISNAN 1 2025-05-07T19:46:51.7308656Z #define _GLIBCXX_HAVE_ISNANF 1 2025-05-07T19:46:51.7308902Z #define _GLIBCXX_HAVE_ISNANL 1 2025-05-07T19:46:51.7309259Z #define _GLIBCXX_HAVE_ISWBLANK 1 2025-05-07T19:46:51.7309537Z #define _GLIBCXX_HAVE_LC_MESSAGES 1 2025-05-07T19:46:51.7309809Z #define _GLIBCXX_HAVE_LDEXPF 1 2025-05-07T19:46:51.7310079Z #define _GLIBCXX_HAVE_LDEXPL 1 2025-05-07T19:46:51.7310341Z #define _GLIBCXX_HAVE_LIMIT_AS 1 2025-05-07T19:46:51.7310624Z #define _GLIBCXX_HAVE_LIMIT_DATA 1 2025-05-07T19:46:51.7310903Z #define _GLIBCXX_HAVE_LIMIT_FSIZE 1 2025-05-07T19:46:51.7311192Z #define _GLIBCXX_HAVE_LIMIT_RSS 1 2025-05-07T19:46:51.7311468Z #define _GLIBCXX_HAVE_LIMIT_VMEM 0 2025-05-07T19:46:51.7311754Z #define _GLIBCXX_HAVE_LINK 1 2025-05-07T19:46:51.7312017Z #define _GLIBCXX_HAVE_LINUX_FUTEX 1 2025-05-07T19:46:51.7312300Z #define _GLIBCXX_HAVE_LINUX_RANDOM_H 1 2025-05-07T19:46:51.7312599Z #define _GLIBCXX_HAVE_LINUX_TYPES_H 1 2025-05-07T19:46:51.7312880Z #define _GLIBCXX_HAVE_LOCALE_H 1 2025-05-07T19:46:51.7313153Z #define _GLIBCXX_HAVE_LOG10F 1 2025-05-07T19:46:51.7313408Z #define _GLIBCXX_HAVE_LOG10L 1 2025-05-07T19:46:51.7313670Z #define _GLIBCXX_HAVE_LOGF 1 2025-05-07T19:46:51.7313922Z #define _GLIBCXX_HAVE_LOGL 1 2025-05-07T19:46:51.7314189Z #define _GLIBCXX_HAVE_MBSTATE_T 1 2025-05-07T19:46:51.7314457Z #define _GLIBCXX_HAVE_MEMALIGN 1 2025-05-07T19:46:51.7314728Z #define _GLIBCXX_HAVE_MEMORY_H 1 2025-05-07T19:46:51.7314993Z #define _GLIBCXX_HAVE_MODF 1 2025-05-07T19:46:51.7315243Z #define _GLIBCXX_HAVE_MODFF 1 2025-05-07T19:46:51.7315511Z #define _GLIBCXX_HAVE_MODFL 1 2025-05-07T19:46:51.7315768Z #define _GLIBCXX_HAVE_NETDB_H 1 2025-05-07T19:46:51.7316150Z #define _GLIBCXX_HAVE_NETINET_IN_H 1 2025-05-07T19:46:51.7316419Z #define _GLIBCXX_HAVE_NETINET_TCP_H 1 2025-05-07T19:46:51.7316708Z #define _GLIBCXX_HAVE_OBSOLETE_ISINF 1 2025-05-07T19:46:51.7316987Z #define _GLIBCXX_HAVE_OBSOLETE_ISNAN 1 2025-05-07T19:46:51.7317273Z #define _GLIBCXX_HAVE_POLL 1 2025-05-07T19:46:51.7317516Z #define _GLIBCXX_HAVE_POLL_H 1 2025-05-07T19:46:51.7317785Z #define _GLIBCXX_HAVE_POSIX_MEMALIGN 1 2025-05-07T19:46:51.7318084Z #define _GLIBCXX_HAVE_POSIX_SEMAPHORE 1 2025-05-07T19:46:51.7318360Z #define _GLIBCXX_HAVE_POWF 1 2025-05-07T19:46:51.7318617Z #define _GLIBCXX_HAVE_POWL 1 2025-05-07T19:46:51.7318868Z #define _GLIBCXX_HAVE_QUICK_EXIT 1 2025-05-07T19:46:51.7319219Z #define _GLIBCXX_HAVE_READLINK 1 2025-05-07T19:46:51.7319473Z #define _GLIBCXX_HAVE_SETENV 1 2025-05-07T19:46:51.7319738Z #define _GLIBCXX_HAVE_SINCOS 1 2025-05-07T19:46:51.7319990Z #define _GLIBCXX_HAVE_SINCOSF 1 2025-05-07T19:46:51.7320254Z #define _GLIBCXX_HAVE_SINCOSL 1 2025-05-07T19:46:51.7320502Z #define _GLIBCXX_HAVE_SINF 1 2025-05-07T19:46:51.7320763Z #define _GLIBCXX_HAVE_SINHF 1 2025-05-07T19:46:51.7321025Z #define _GLIBCXX_HAVE_SINHL 1 2025-05-07T19:46:51.7321266Z #define _GLIBCXX_HAVE_SINL 1 2025-05-07T19:46:51.7321523Z #define _GLIBCXX_HAVE_SOCKATMARK 1 2025-05-07T19:46:51.7321781Z #define _GLIBCXX_HAVE_SQRTF 1 2025-05-07T19:46:51.7322036Z #define _GLIBCXX_HAVE_SQRTL 1 2025-05-07T19:46:51.7322284Z #define _GLIBCXX_HAVE_STDALIGN_H 1 2025-05-07T19:46:51.7322558Z #define _GLIBCXX_HAVE_STDBOOL_H 1 2025-05-07T19:46:51.7322817Z #define _GLIBCXX_HAVE_STDINT_H 1 2025-05-07T19:46:51.7323085Z #define _GLIBCXX_HAVE_STDLIB_H 1 2025-05-07T19:46:51.7323398Z #define _GLIBCXX_HAVE_STRERROR_L 1 2025-05-07T19:46:51.7323681Z #define _GLIBCXX_HAVE_STRERROR_R 1 2025-05-07T19:46:51.7323954Z #define _GLIBCXX_HAVE_STRINGS_H 1 2025-05-07T19:46:51.7324213Z #define _GLIBCXX_HAVE_STRING_H 1 2025-05-07T19:46:51.7324479Z #define _GLIBCXX_HAVE_STRTOF 1 2025-05-07T19:46:51.7324726Z #define _GLIBCXX_HAVE_STRTOLD 1 2025-05-07T19:46:51.7325012Z #define _GLIBCXX_HAVE_STRUCT_DIRENT_D_TYPE 1 2025-05-07T19:46:51.7325307Z #define _GLIBCXX_HAVE_STRXFRM_L 1 2025-05-07T19:46:51.7325582Z #define _GLIBCXX_HAVE_SYMLINK 1 2025-05-07T19:46:51.7325907Z #define _GLIBCXX_HAVE_SYMVER_SYMBOL_RENAMING_RUNTIME_SUPPORT 1 2025-05-07T19:46:51.7326278Z #define _GLIBCXX_HAVE_SYS_IOCTL_H 1 2025-05-07T19:46:51.7326545Z #define _GLIBCXX_HAVE_SYS_IPC_H 1 2025-05-07T19:46:51.7326821Z #define _GLIBCXX_HAVE_SYS_PARAM_H 1 2025-05-07T19:46:51.7327105Z #define _GLIBCXX_HAVE_SYS_RESOURCE_H 1 2025-05-07T19:46:51.7327379Z #define _GLIBCXX_HAVE_SYS_SEM_H 1 2025-05-07T19:46:51.7327657Z #define _GLIBCXX_HAVE_SYS_SOCKET_H 1 2025-05-07T19:46:51.7327932Z #define _GLIBCXX_HAVE_SYS_STATVFS_H 1 2025-05-07T19:46:51.7328219Z #define _GLIBCXX_HAVE_SYS_STAT_H 1 2025-05-07T19:46:51.7328494Z #define _GLIBCXX_HAVE_SYS_SYSINFO_H 1 2025-05-07T19:46:51.7328782Z #define _GLIBCXX_HAVE_SYS_TIME_H 1 2025-05-07T19:46:51.7329048Z #define _GLIBCXX_HAVE_SYS_TYPES_H 1 2025-05-07T19:46:51.7329328Z #define _GLIBCXX_HAVE_SYS_UIO_H 1 2025-05-07T19:46:51.7329602Z #define _GLIBCXX_HAVE_S_ISREG 1 2025-05-07T19:46:51.7329850Z #define _GLIBCXX_HAVE_TANF 1 2025-05-07T19:46:51.7330111Z #define _GLIBCXX_HAVE_TANHF 1 2025-05-07T19:46:51.7330355Z #define _GLIBCXX_HAVE_TANHL 1 2025-05-07T19:46:51.7330611Z #define _GLIBCXX_HAVE_TANL 1 2025-05-07T19:46:51.7330858Z #define _GLIBCXX_HAVE_TGMATH_H 1 2025-05-07T19:46:51.7331122Z #define _GLIBCXX_HAVE_TLS 1 2025-05-07T19:46:51.7331364Z #define _GLIBCXX_HAVE_TRUNCATE 1 2025-05-07T19:46:51.7331635Z #define _GLIBCXX_HAVE_UNISTD_H 1 2025-05-07T19:46:51.7331891Z #define _GLIBCXX_HAVE_USELOCALE 1 2025-05-07T19:46:51.7332168Z #define _GLIBCXX_HAVE_UTIME_H 1 2025-05-07T19:46:51.7332438Z #define _GLIBCXX_HAVE_VFWSCANF 1 2025-05-07T19:46:51.7332691Z #define _GLIBCXX_HAVE_VSWSCANF 1 2025-05-07T19:46:51.7332960Z #define _GLIBCXX_HAVE_VWSCANF 1 2025-05-07T19:46:51.7333208Z #define _GLIBCXX_HAVE_WCHAR_H 1 2025-05-07T19:46:51.7333646Z #define _GLIBCXX_HAVE_WCSTOF 1 2025-05-07T19:46:51.7333947Z #define _GLIBCXX_HAVE_WCTYPE_H 1 2025-05-07T19:46:51.7334226Z #define _GLIBCXX_HAVE_WRITEV 1 2025-05-07T19:46:51.7334555Z #define _GLIBCXX_HAVE_XLOCALE_H 1 2025-05-07T19:46:51.7334834Z #define _GLIBCXX_HOSTED 1 2025-05-07T19:46:51.7335083Z #define _GLIBCXX_ICONV_CONST 2025-05-07T19:46:51.7335360Z #define _GLIBCXX_INLINE_VERSION 0 2025-05-07T19:46:51.7335653Z #define _GLIBCXX_LT_OBJDIR ".libs/" 2025-05-07T19:46:51.7336140Z #define _GLIBCXX_MAKE_MOVE_IF_NOEXCEPT_ITERATOR(_Iter) std::__make_move_if_noexcept_iterator(_Iter) 2025-05-07T19:46:51.7337001Z #define _GLIBCXX_MAKE_MOVE_ITERATOR(_Iter) std::make_move_iterator(_Iter) 2025-05-07T19:46:51.7337422Z #define _GLIBCXX_MANGLE_SIZE_T m 2025-05-07T19:46:51.7337825Z #define _GLIBCXX_MATH_H 1 2025-05-07T19:46:51.7338100Z #define _GLIBCXX_MOVE(__val) std::move(__val) 2025-05-07T19:46:51.7338498Z #define _GLIBCXX_MOVE3(_Tp,_Up,_Vp) std::move(_Tp, _Up, _Vp) 2025-05-07T19:46:51.7338993Z #define _GLIBCXX_MOVE_BACKWARD3(_Tp,_Up,_Vp) std::move_backward(_Tp, _Up, _Vp) 2025-05-07T19:46:51.7339461Z #define _GLIBCXX_NAMESPACE_CXX11 __cxx11:: 2025-05-07T19:46:51.7339784Z #define _GLIBCXX_NAMESPACE_LDBL 2025-05-07T19:46:51.7340153Z #define _GLIBCXX_NAMESPACE_LDBL_OR_CXX11 _GLIBCXX_NAMESPACE_CXX11 2025-05-07T19:46:51.7340725Z #define _GLIBCXX_NATIVE_THREAD_ID (__gthread_active_p() ? __gthread_self() : (__gthread_t)1) 2025-05-07T19:46:51.7341220Z #define _GLIBCXX_NODISCARD [[__nodiscard__]] 2025-05-07T19:46:51.7341554Z #define _GLIBCXX_NOEXCEPT noexcept 2025-05-07T19:46:51.7341895Z #define _GLIBCXX_NOEXCEPT_IF(...) noexcept(__VA_ARGS__) 2025-05-07T19:46:51.7342273Z #define _GLIBCXX_NOEXCEPT_PARM , bool _NE 2025-05-07T19:46:51.7342657Z #define _GLIBCXX_NOEXCEPT_QUAL noexcept (_NE) 2025-05-07T19:46:51.7343051Z #define _GLIBCXX_NORETURN __attribute__ ((__noreturn__)) 2025-05-07T19:46:51.7343442Z #define _GLIBCXX_NOTHROW _GLIBCXX_USE_NOEXCEPT 2025-05-07T19:46:51.7343864Z #define _GLIBCXX_NO_OBSOLETE_ISINF_ISNAN_DYNAMIC __GLIBC_PREREQ(2,23) 2025-05-07T19:46:51.7344280Z #define _GLIBCXX_NUMERIC_LIMITS 1 2025-05-07T19:46:51.7344561Z #define _GLIBCXX_OS_DEFINES 1 2025-05-07T19:46:51.7344852Z #define _GLIBCXX_PACKAGE_BUGREPORT "" 2025-05-07T19:46:51.7345177Z #define _GLIBCXX_PACKAGE_NAME "package-unused" 2025-05-07T19:46:51.7345608Z #define _GLIBCXX_PACKAGE_STRING "package-unused version-unused" 2025-05-07T19:46:51.7346030Z #define _GLIBCXX_PACKAGE_TARNAME "libstdc++" 2025-05-07T19:46:51.7346346Z #define _GLIBCXX_PACKAGE_URL "" 2025-05-07T19:46:51.7346697Z #define _GLIBCXX_PACKAGE__GLIBCXX_VERSION "version-unused" 2025-05-07T19:46:51.7347066Z #define _GLIBCXX_PREDEFINED_OPS_H 1 2025-05-07T19:46:51.7347381Z #define _GLIBCXX_PSEUDO_VISIBILITY(V) 2025-05-07T19:46:51.7347709Z #define _GLIBCXX_PURE __attribute__ ((__pure__)) 2025-05-07T19:46:51.7348049Z #define _GLIBCXX_RELEASE 11 2025-05-07T19:46:51.7348307Z #define _GLIBCXX_RES_LIMITS 1 2025-05-07T19:46:51.7348587Z #define _GLIBCXX_STDC_HEADERS 1 2025-05-07T19:46:51.7348857Z #define _GLIBCXX_STDIO_EOF -1 2025-05-07T19:46:51.7349241Z #define _GLIBCXX_STDIO_SEEK_CUR 1 2025-05-07T19:46:51.7349514Z #define _GLIBCXX_STDIO_SEEK_END 2 2025-05-07T19:46:51.7349771Z #define _GLIBCXX_STDLIB_H 1 2025-05-07T19:46:51.7350020Z #define _GLIBCXX_STD_A std 2025-05-07T19:46:51.7350256Z #define _GLIBCXX_STD_C std 2025-05-07T19:46:51.7350500Z #define _GLIBCXX_SYMVER 1 2025-05-07T19:46:51.7350731Z #define _GLIBCXX_SYMVER_GNU 1 2025-05-07T19:46:51.7351031Z #define _GLIBCXX_SYNCHRONIZATION_HAPPENS_AFTER(A) 2025-05-07T19:46:51.7351382Z #define _GLIBCXX_SYNCHRONIZATION_HAPPENS_BEFORE(A) 2025-05-07T19:46:51.7351708Z #define _GLIBCXX_THROW(_EXC) 2025-05-07T19:46:51.7351996Z #define _GLIBCXX_THROW_OR_ABORT(_EXC) (throw (_EXC)) 2025-05-07T19:46:51.7352341Z #define _GLIBCXX_TR1_BESSEL_FUNCTION_TCC 1 2025-05-07T19:46:51.7352657Z #define _GLIBCXX_TR1_BETA_FUNCTION_TCC 1 2025-05-07T19:46:51.7352942Z #define _GLIBCXX_TR1_ELL_INTEGRAL_TCC 1 2025-05-07T19:46:51.7353237Z #define _GLIBCXX_TR1_EXP_INTEGRAL_TCC 1 2025-05-07T19:46:51.7353513Z #define _GLIBCXX_TR1_GAMMA_TCC 1 2025-05-07T19:46:51.7353792Z #define _GLIBCXX_TR1_HYPERGEOMETRIC_TCC 1 2025-05-07T19:46:51.7354094Z #define _GLIBCXX_TR1_LEGENDRE_FUNCTION_TCC 1 2025-05-07T19:46:51.7354422Z #define _GLIBCXX_TR1_MODIFIED_BESSEL_FUNC_TCC 1 2025-05-07T19:46:51.7354727Z #define _GLIBCXX_TR1_POLY_HERMITE_TCC 1 2025-05-07T19:46:51.7355022Z #define _GLIBCXX_TR1_POLY_LAGUERRE_TCC 1 2025-05-07T19:46:51.7355319Z #define _GLIBCXX_TR1_RIEMANN_ZETA_TCC 1 2025-05-07T19:46:51.7355617Z #define _GLIBCXX_TR1_SPECIAL_FUNCTION_UTIL_H 1 2025-05-07T19:46:51.7355922Z #define _GLIBCXX_TXN_SAFE 2025-05-07T19:46:51.7356160Z #define _GLIBCXX_TXN_SAFE_DYN 2025-05-07T19:46:51.7356424Z #define _GLIBCXX_TYPE_TRAITS 1 2025-05-07T19:46:51.7356681Z #define _GLIBCXX_USE_ALLOCATOR_NEW 1 2025-05-07T19:46:51.7357028Z #define _GLIBCXX_USE_C99 1 2025-05-07T19:46:51.7357325Z #define _GLIBCXX_USE_C99_COMPLEX _GLIBCXX11_USE_C99_COMPLEX 2025-05-07T19:46:51.7357687Z #define _GLIBCXX_USE_C99_COMPLEX_TR1 1 2025-05-07T19:46:51.7357982Z #define _GLIBCXX_USE_C99_CTYPE_TR1 1 2025-05-07T19:46:51.7358252Z #define _GLIBCXX_USE_C99_FENV_TR1 1 2025-05-07T19:46:51.7358539Z #define _GLIBCXX_USE_C99_INTTYPES_TR1 1 2025-05-07T19:46:51.7358841Z #define _GLIBCXX_USE_C99_INTTYPES_WCHAR_T_TR1 1 2025-05-07T19:46:51.7359196Z #define _GLIBCXX_USE_C99_MATH _GLIBCXX11_USE_C99_MATH 2025-05-07T19:46:51.7359513Z #define _GLIBCXX_USE_C99_MATH_TR1 1 2025-05-07T19:46:51.7359796Z #define _GLIBCXX_USE_C99_STDINT_TR1 1 2025-05-07T19:46:51.7360111Z #define _GLIBCXX_USE_C99_STDIO _GLIBCXX11_USE_C99_STDIO 2025-05-07T19:46:51.7360498Z #define _GLIBCXX_USE_C99_STDLIB _GLIBCXX11_USE_C99_STDLIB 2025-05-07T19:46:51.7360884Z #define _GLIBCXX_USE_C99_WCHAR _GLIBCXX11_USE_C99_WCHAR 2025-05-07T19:46:51.7361267Z #define _GLIBCXX_USE_CLOCK_MONOTONIC 1 2025-05-07T19:46:51.7361569Z #define _GLIBCXX_USE_CLOCK_REALTIME 1 2025-05-07T19:46:51.7361852Z #define _GLIBCXX_USE_CONSTEXPR constexpr 2025-05-07T19:46:51.7362161Z #define _GLIBCXX_USE_CXX11_ABI 1 2025-05-07T19:46:51.7362427Z #define _GLIBCXX_USE_DECIMAL_FLOAT 1 2025-05-07T19:46:51.7362717Z #define _GLIBCXX_USE_DEPRECATED 1 2025-05-07T19:46:51.7363152Z #define _GLIBCXX_USE_DEV_RANDOM 1 2025-05-07T19:46:51.7363439Z #define _GLIBCXX_USE_DUAL_ABI 1 2025-05-07T19:46:51.7363700Z #define _GLIBCXX_USE_FCHMOD 1 2025-05-07T19:46:51.7363975Z #define _GLIBCXX_USE_FCHMODAT 1 2025-05-07T19:46:51.7364237Z #define _GLIBCXX_USE_FLOAT128 1 2025-05-07T19:46:51.7364518Z #define _GLIBCXX_USE_GETTIMEOFDAY 1 2025-05-07T19:46:51.7364804Z #define _GLIBCXX_USE_GET_NPROCS 1 2025-05-07T19:46:51.7365091Z #define _GLIBCXX_USE_INT128 1 2025-05-07T19:46:51.7365351Z #define _GLIBCXX_USE_LFS 1 2025-05-07T19:46:51.7365624Z #define _GLIBCXX_USE_LONG_LONG 1 2025-05-07T19:46:51.7365907Z #define _GLIBCXX_USE_LSTAT 1 2025-05-07T19:46:51.7366169Z #define _GLIBCXX_USE_NANOSLEEP 1 2025-05-07T19:46:51.7366467Z #define _GLIBCXX_USE_NOEXCEPT noexcept 2025-05-07T19:46:51.7366767Z #define _GLIBCXX_USE_PTHREAD_RWLOCK_T 1 2025-05-07T19:46:51.7367084Z #define _GLIBCXX_USE_RANDOM_TR1 1 2025-05-07T19:46:51.7367360Z #define _GLIBCXX_USE_REALPATH 1 2025-05-07T19:46:51.7367649Z #define _GLIBCXX_USE_SCHED_YIELD 1 2025-05-07T19:46:51.7367942Z #define _GLIBCXX_USE_SC_NPROCESSORS_ONLN 1 2025-05-07T19:46:51.7368268Z #define _GLIBCXX_USE_SENDFILE 1 2025-05-07T19:46:51.7368558Z #define _GLIBCXX_USE_STD_SPEC_FUNCS 1 2025-05-07T19:46:51.7368845Z #define _GLIBCXX_USE_ST_MTIM 1 2025-05-07T19:46:51.7369197Z #define _GLIBCXX_USE_TBB_PAR_BACKEND __has_include() 2025-05-07T19:46:51.7369562Z #define _GLIBCXX_USE_TMPNAM 1 2025-05-07T19:46:51.7369834Z #define _GLIBCXX_USE_UTIME 1 2025-05-07T19:46:51.7370223Z #define _GLIBCXX_USE_UTIMENSAT 1 2025-05-07T19:46:51.7370675Z #define _GLIBCXX_USE_WCHAR_T 1 2025-05-07T19:46:51.7370959Z #define _GLIBCXX_USE_WEAK_REF __GXX_WEAK__ 2025-05-07T19:46:51.7371283Z #define _GLIBCXX_UTILITY 1 2025-05-07T19:46:51.7371574Z #define _GLIBCXX_VERBOSE 1 2025-05-07T19:46:51.7371943Z #define _GLIBCXX_VISIBILITY(V) __attribute__ ((__visibility__ (#V))) 2025-05-07T19:46:51.7372362Z #define _GLIBCXX_WEAK_DEFINITION 2025-05-07T19:46:51.7372639Z #define _GLIBCXX_X86_RDRAND 1 2025-05-07T19:46:51.7372922Z #define _GLIBCXX_X86_RDSEED 1 2025-05-07T19:46:51.7373174Z #define _GNU_SOURCE 1 2025-05-07T19:46:51.7373440Z #define _GTHREAD_USE_MUTEX_TIMEDLOCK 1 2025-05-07T19:46:51.7373725Z #define _G_BUFSIZ 8192 2025-05-07T19:46:51.7373974Z #define _G_HAVE_MMAP 1 2025-05-07T19:46:51.7374207Z #define _G_HAVE_MREMAP 1 2025-05-07T19:46:51.7374525Z #define _G_HAVE_ST_BLKSIZE defined (_STATBUF_ST_BLKSIZE) 2025-05-07T19:46:51.7374886Z #define _G_IO_IO_FILE_VERSION 0x20001 2025-05-07T19:46:51.7375182Z #define _G_config_h 1 2025-05-07T19:46:51.7375435Z #define _G_va_list __gnuc_va_list 2025-05-07T19:46:51.7375710Z #define _INITIALIZER_LIST 2025-05-07T19:46:51.7376092Z #define _IOFBF 0 2025-05-07T19:46:51.7376306Z #define _IOLBF 1 2025-05-07T19:46:51.7376590Z #define _IONBF 2 2025-05-07T19:46:51.7376808Z #define _IOS_APPEND 8 2025-05-07T19:46:51.7377057Z #define _IOS_ATEND 4 2025-05-07T19:46:51.7377284Z #define _IOS_BIN 128 2025-05-07T19:46:51.7377528Z #define _IOS_INPUT 1 2025-05-07T19:46:51.7377760Z #define _IOS_NOCREATE 32 2025-05-07T19:46:51.7378022Z #define _IOS_NOREPLACE 64 2025-05-07T19:46:51.7378266Z #define _IOS_OUTPUT 2 2025-05-07T19:46:51.7378508Z #define _IOS_TRUNC 16 2025-05-07T19:46:51.7378755Z #define _IO_BAD_SEEN 0x4000 2025-05-07T19:46:51.7379064Z #define _IO_BE(expr,res) __builtin_expect ((expr), res) 2025-05-07T19:46:51.7379425Z #define _IO_BOOLALPHA 0200000 2025-05-07T19:46:51.7379690Z #define _IO_BUFSIZ _G_BUFSIZ 2025-05-07T19:46:51.7379971Z #define _IO_CURRENTLY_PUTTING 0x800 2025-05-07T19:46:51.7380244Z #define _IO_DEC 020 2025-05-07T19:46:51.7380490Z #define _IO_DELETE_DONT_CLOSE 0x40 2025-05-07T19:46:51.7380856Z #define _IO_DONT_CLOSE 0100000 2025-05-07T19:46:51.7381139Z #define _IO_EOF_SEEN 0x10 2025-05-07T19:46:51.7381383Z #define _IO_ERR_SEEN 0x20 2025-05-07T19:46:51.7381644Z #define _IO_FIXED 010000 2025-05-07T19:46:51.7381902Z #define _IO_FLAGS2_MMAP 1 2025-05-07T19:46:51.7382152Z #define _IO_FLAGS2_NOTCANCEL 2 2025-05-07T19:46:51.7382431Z #define _IO_FLAGS2_USER_WBUF 8 2025-05-07T19:46:51.7382724Z #define _IO_HAVE_ST_BLKSIZE _G_HAVE_ST_BLKSIZE 2025-05-07T19:46:51.7383044Z #define _IO_HEX 0100 2025-05-07T19:46:51.7383273Z #define _IO_INTERNAL 010 2025-05-07T19:46:51.7383534Z #define _IO_IN_BACKUP 0x100 2025-05-07T19:46:51.7383797Z #define _IO_IS_APPENDING 0x1000 2025-05-07T19:46:51.7384077Z #define _IO_IS_FILEBUF 0x2000 2025-05-07T19:46:51.7384328Z #define _IO_LEFT 02 2025-05-07T19:46:51.7384562Z #define _IO_LINE_BUF 0x200 2025-05-07T19:46:51.7384822Z #define _IO_LINKED 0x80 2025-05-07T19:46:51.7385063Z #define _IO_MAGIC 0xFBAD0000 2025-05-07T19:46:51.7385341Z #define _IO_MAGIC_MASK 0xFFFF0000 2025-05-07T19:46:51.7385613Z #define _IO_NO_READS 4 2025-05-07T19:46:51.7385862Z #define _IO_NO_WRITES 8 2025-05-07T19:46:51.7386090Z #define _IO_OCT 040 2025-05-07T19:46:51.7386474Z #define _IO_PENDING_OUTPUT_COUNT(_fp) ((_fp)->_IO_write_ptr - (_fp)->_IO_write_base) 2025-05-07T19:46:51.7386908Z #define _IO_RIGHT 04 2025-05-07T19:46:51.7387157Z #define _IO_SCIENTIFIC 04000 2025-05-07T19:46:51.7387418Z #define _IO_SHOWBASE 0200 2025-05-07T19:46:51.7387687Z #define _IO_SHOWPOINT 0400 2025-05-07T19:46:51.7387959Z #define _IO_SHOWPOS 02000 2025-05-07T19:46:51.7388203Z #define _IO_SKIPWS 01 2025-05-07T19:46:51.7388554Z #define _IO_STDIO 040000 2025-05-07T19:46:51.7388884Z #define _IO_STDIO_H 2025-05-07T19:46:51.7389117Z #define _IO_TIED_PUT_GET 0x400 2025-05-07T19:46:51.7389359Z #define _IO_UNBUFFERED 2 2025-05-07T19:46:51.7389607Z #define _IO_UNIFIED_JUMPTABLES 1 2025-05-07T19:46:51.7389882Z #define _IO_UNITBUF 020000 2025-05-07T19:46:51.7390167Z #define _IO_UPPERCASE 01000 2025-05-07T19:46:51.7390434Z #define _IO_USER_BUF 1 2025-05-07T19:46:51.7390716Z #define _IO_USER_LOCK 0x8000 2025-05-07T19:46:51.7390994Z #define _IO_cleanup_region_end(_Doit) 2025-05-07T19:46:51.7391348Z #define _IO_cleanup_region_start(_fct,_fp) 2025-05-07T19:46:51.7391777Z #define _IO_feof_unlocked(__fp) (((__fp)->_flags & _IO_EOF_SEEN) != 0) 2025-05-07T19:46:51.7392267Z #define _IO_ferror_unlocked(__fp) (((__fp)->_flags & _IO_ERR_SEEN) != 0) 2025-05-07T19:46:51.7392705Z #define _IO_file_flags _flags 2025-05-07T19:46:51.7392957Z #define _IO_flockfile(_fp) 2025-05-07T19:46:51.7393246Z #define _IO_fpos64_t _G_fpos64_t 2025-05-07T19:46:51.7393531Z #define _IO_fpos_t _G_fpos_t 2025-05-07T19:46:51.7393827Z #define _IO_ftrylockfile(_fp) 2025-05-07T19:46:51.7394079Z #define _IO_funlockfile(_fp) 2025-05-07T19:46:51.7394597Z #define _IO_getc_unlocked(_fp) (_IO_BE ((_fp)->_IO_read_ptr >= (_fp)->_IO_read_end, 0) ? __uflow (_fp) : *(unsigned char *) (_fp)->_IO_read_ptr++) 2025-05-07T19:46:51.7395134Z #define _IO_iconv_t _G_iconv_t 2025-05-07T19:46:51.7395385Z #define _IO_off64_t __off64_t 2025-05-07T19:46:51.7395706Z #define _IO_off_t __off_t 2025-05-07T19:46:51.7395970Z #define _IO_peekc(_fp) _IO_peekc_unlocked (_fp) 2025-05-07T19:46:51.7396580Z #define _IO_peekc_unlocked(_fp) (_IO_BE ((_fp)->_IO_read_ptr >= (_fp)->_IO_read_end, 0) && __underflow (_fp) == EOF ? EOF : *(unsigned char *) (_fp)->_IO_read_ptr) 2025-05-07T19:46:51.7397133Z #define _IO_pid_t __pid_t 2025-05-07T19:46:51.7397912Z #define _IO_putc_unlocked(_ch,_fp) (_IO_BE ((_fp)->_IO_write_ptr >= (_fp)->_IO_write_end, 0) ? __overflow (_fp, (unsigned char) (_ch)) : (unsigned char) (*(_fp)->_IO_write_ptr++ = (_ch))) 2025-05-07T19:46:51.7398574Z #define _IO_size_t size_t 2025-05-07T19:46:51.7398821Z #define _IO_ssize_t __ssize_t 2025-05-07T19:46:51.7399132Z #define _IO_stderr ((_IO_FILE*)(&_IO_2_1_stderr_)) 2025-05-07T19:46:51.7399482Z #define _IO_stdin ((_IO_FILE*)(&_IO_2_1_stdin_)) 2025-05-07T19:46:51.7399842Z #define _IO_stdout ((_IO_FILE*)(&_IO_2_1_stdout_)) 2025-05-07T19:46:51.7400229Z #define _IO_uid_t __uid_t 2025-05-07T19:46:51.7400508Z #define _IO_va_list __gnuc_va_list 2025-05-07T19:46:51.7400792Z #define _IO_wint_t wint_t 2025-05-07T19:46:51.7401027Z #define _ISOC11_SOURCE 1 2025-05-07T19:46:51.7401277Z #define _ISOC95_SOURCE 1 2025-05-07T19:46:51.7401507Z #define _ISOC99_SOURCE 1 2025-05-07T19:46:51.7401845Z #define _ISbit(bit) ((bit) < 8 ? ((1 << (bit)) << 8) : ((1 << (bit)) >> 8)) 2025-05-07T19:46:51.7402220Z #define _LARGEFILE64_SOURCE 1 2025-05-07T19:46:51.7402496Z #define _LARGEFILE_SOURCE 1 2025-05-07T19:46:51.7402740Z #define _LIBC_LIMITS_H_ 1 2025-05-07T19:46:51.7402993Z #define _LINUX_LIMITS_H 2025-05-07T19:46:51.7403225Z #define _LP64 1 2025-05-07T19:46:51.7403442Z #define _MATH_H 1 2025-05-07T19:46:51.7403662Z #define _MATH_H_MATHDEF 1 2025-05-07T19:46:51.7403949Z #define _MOVE_H 1 2025-05-07T19:46:51.7404223Z #define _Mfloat_ float 2025-05-07T19:46:51.7404499Z #define _Mlong_double_ long double 2025-05-07T19:46:51.7404828Z #define _NEW 2025-05-07T19:46:51.7405074Z #define _OLD_STDIO_MAGIC 0xFABC0000 2025-05-07T19:46:51.7405405Z #define _POSIX2_BC_BASE_MAX 99 2025-05-07T19:46:51.7405691Z #define _POSIX2_BC_DIM_MAX 2048 2025-05-07T19:46:51.7406009Z #define _POSIX2_BC_SCALE_MAX 99 2025-05-07T19:46:51.7406300Z #define _POSIX2_BC_STRING_MAX 1000 2025-05-07T19:46:51.7406643Z #define _POSIX2_CHARCLASS_NAME_MAX 14 2025-05-07T19:46:51.7406958Z #define _POSIX2_COLL_WEIGHTS_MAX 2 2025-05-07T19:46:51.7407290Z #define _POSIX2_EXPR_NEST_MAX 32 2025-05-07T19:46:51.7407617Z #define _POSIX2_LINE_MAX 2048 2025-05-07T19:46:51.7407905Z #define _POSIX2_RE_DUP_MAX 255 2025-05-07T19:46:51.7408220Z #define _POSIX_AIO_LISTIO_MAX 2 2025-05-07T19:46:51.7408503Z #define _POSIX_AIO_MAX 1 2025-05-07T19:46:51.7408795Z #define _POSIX_ARG_MAX 4096 2025-05-07T19:46:51.7409070Z #define _POSIX_CHILD_MAX 25 2025-05-07T19:46:51.7409380Z #define _POSIX_CLOCKRES_MIN 20000000 2025-05-07T19:46:51.7409687Z #define _POSIX_C_SOURCE 200809L 2025-05-07T19:46:51.7410007Z #define _POSIX_DELAYTIMER_MAX 32 2025-05-07T19:46:51.7410418Z #define _POSIX_FD_SETSIZE _POSIX_OPEN_MAX 2025-05-07T19:46:51.7410770Z #define _POSIX_HIWAT _POSIX_PIPE_BUF 2025-05-07T19:46:51.7411097Z #define _POSIX_HOST_NAME_MAX 255 2025-05-07T19:46:51.7411370Z #define _POSIX_LINK_MAX 8 2025-05-07T19:46:51.7411661Z #define _POSIX_LOGIN_NAME_MAX 9 2025-05-07T19:46:51.7411934Z #define _POSIX_MAX_CANON 255 2025-05-07T19:46:51.7412229Z #define _POSIX_MAX_INPUT 255 2025-05-07T19:46:51.7412498Z #define _POSIX_MQ_OPEN_MAX 8 2025-05-07T19:46:51.7412790Z #define _POSIX_MQ_PRIO_MAX 32 2025-05-07T19:46:51.7413057Z #define _POSIX_NAME_MAX 14 2025-05-07T19:46:51.7413342Z #define _POSIX_NGROUPS_MAX 8 2025-05-07T19:46:51.7413609Z #define _POSIX_OPEN_MAX 20 2025-05-07T19:46:51.7413897Z #define _POSIX_PATH_MAX 256 2025-05-07T19:46:51.7414191Z #define _POSIX_PIPE_BUF 512 2025-05-07T19:46:51.7414452Z #define _POSIX_QLIMIT 1 2025-05-07T19:46:51.7414728Z #define _POSIX_RE_DUP_MAX 255 2025-05-07T19:46:51.7414995Z #define _POSIX_RTSIG_MAX 8 2025-05-07T19:46:51.7415288Z #define _POSIX_SEM_NSEMS_MAX 256 2025-05-07T19:46:51.7415639Z #define _POSIX_SEM_VALUE_MAX 32767 2025-05-07T19:46:51.7415953Z #define _POSIX_SIGQUEUE_MAX 32 2025-05-07T19:46:51.7416217Z #define _POSIX_SOURCE 1 2025-05-07T19:46:51.7416579Z #define _POSIX_SSIZE_MAX 32767 2025-05-07T19:46:51.7417010Z #define _POSIX_STREAM_MAX 8 2025-05-07T19:46:51.7417320Z #define _POSIX_SYMLINK_MAX 255 2025-05-07T19:46:51.7417650Z #define _POSIX_SYMLOOP_MAX 8 2025-05-07T19:46:51.7417969Z #define _POSIX_THREAD_DESTRUCTOR_ITERATIONS 4 2025-05-07T19:46:51.7418349Z #define _POSIX_THREAD_KEYS_MAX 128 2025-05-07T19:46:51.7418665Z #define _POSIX_THREAD_THREADS_MAX 64 2025-05-07T19:46:51.7419006Z #define _POSIX_TIMER_MAX 32 2025-05-07T19:46:51.7419291Z #define _POSIX_TTY_NAME_MAX 9 2025-05-07T19:46:51.7419617Z #define _POSIX_TZNAME_MAX 6 2025-05-07T19:46:51.7419899Z #define _POSIX_UIO_MAXIOV 16 2025-05-07T19:46:51.7420295Z #define _PSTL_ASSERT(_Condition) __glibcxx_assert(_Condition) 2025-05-07T19:46:51.7420868Z #define _PSTL_ASSERT_MSG(_Condition,_Message) __glibcxx_assert(_Condition) 2025-05-07T19:46:51.7421535Z #define _PSTL_CLANG_VERSION (__clang_major__ * 10000 + __clang_minor__ * 100 + __clang_patchlevel__) 2025-05-07T19:46:51.7422066Z #define _PSTL_CONFIG_H 2025-05-07T19:46:51.7422556Z #define _PSTL_CPP11_STD_ROTATE_BROKEN ((__GLIBCXX__ && __GLIBCXX__ < 20150716) || (_MSC_VER && _MSC_VER < 1800)) 2025-05-07T19:46:51.7423454Z #define _PSTL_CPP14_2RANGE_MISMATCH_EQUAL_PRESENT (_MSC_VER >= 1900 || __cplusplus >= 201300L || __cpp_lib_robust_nonmodifying_seq_ops == 201304) 2025-05-07T19:46:51.7424272Z #define _PSTL_CPP14_INTEGER_SEQUENCE_PRESENT (_MSC_VER >= 1900 || __cplusplus >= 201402L) 2025-05-07T19:46:51.7425099Z #define _PSTL_CPP14_MAKE_REVERSE_ITERATOR_PRESENT (_MSC_VER >= 1900 || __cplusplus >= 201402L || __cpp_lib_make_reverse_iterator == 201402) 2025-05-07T19:46:51.7426097Z #define _PSTL_CPP14_VARIABLE_TEMPLATES_PRESENT (!__INTEL_COMPILER || __INTEL_COMPILER >= 1700) && (_MSC_FULL_VER >= 190023918 || __cplusplus >= 201402L) 2025-05-07T19:46:51.7426859Z #define _PSTL_CPP17_EXECUTION_POLICIES_PRESENT (_MSC_VER >= 1912) 2025-05-07T19:46:51.7427368Z #define _PSTL_EARLYEXIT_PRESENT (__INTEL_COMPILER >= 1800) 2025-05-07T19:46:51.7427896Z #define _PSTL_GCC_VERSION (__GNUC__ * 10000 + __GNUC_MINOR__ * 100 + __GNUC_PATCHLEVEL__) 2025-05-07T19:46:51.7428396Z #define _PSTL_HIDE_FROM_ABI_POP 2025-05-07T19:46:51.7428729Z #define _PSTL_HIDE_FROM_ABI_PUSH 2025-05-07T19:46:51.7429239Z #define _PSTL_ICC_18_OMP_SIMD_BROKEN (__INTEL_COMPILER == 1800) 2025-05-07T19:46:51.7429683Z #define _PSTL_MONOTONIC_PRESENT (__INTEL_COMPILER >= 1800) 2025-05-07T19:46:51.7430065Z #define _PSTL_PAR_BACKEND_SERIAL 2025-05-07T19:46:51.7430361Z #define _PSTL_PRAGMA(x) _Pragma(# x) 2025-05-07T19:46:51.7431038Z #define _PSTL_PRAGMA_DECLARE_REDUCTION(NAME,OP) _PSTL_PRAGMA(omp declare reduction(NAME:OP : omp_out(omp_in)) initializer(omp_priv = omp_orig)) 2025-05-07T19:46:51.7431748Z #define _PSTL_PRAGMA_DECLARE_SIMD _PSTL_PRAGMA(omp declare simd) 2025-05-07T19:46:51.7432162Z #define _PSTL_PRAGMA_FORCEINLINE 2025-05-07T19:46:51.7432542Z #define _PSTL_PRAGMA_LOCATION " [Parallel STL message]: " 2025-05-07T19:46:51.7432919Z #define _PSTL_PRAGMA_MESSAGE(x) 2025-05-07T19:46:51.7433457Z #define _PSTL_PRAGMA_MESSAGE_IMPL(x) _PSTL_PRAGMA(message(_PSTL_STRING_CONCAT(_PSTL_PRAGMA_LOCATION, x))) 2025-05-07T19:46:51.7433998Z #define _PSTL_PRAGMA_MESSAGE_POLICIES(x) 2025-05-07T19:46:51.7434389Z #define _PSTL_PRAGMA_SIMD _PSTL_PRAGMA(omp simd) 2025-05-07T19:46:51.7434741Z #define _PSTL_PRAGMA_SIMD_EARLYEXIT 2025-05-07T19:46:51.7435111Z #define _PSTL_PRAGMA_SIMD_EXCLUSIVE_SCAN(PRM) 2025-05-07T19:46:51.7435476Z #define _PSTL_PRAGMA_SIMD_INCLUSIVE_SCAN(PRM) 2025-05-07T19:46:51.7435877Z #define _PSTL_PRAGMA_SIMD_ORDERED_MONOTONIC(PRM) 2025-05-07T19:46:51.7436310Z #define _PSTL_PRAGMA_SIMD_ORDERED_MONOTONIC_2ARGS(PRM1,PRM2) 2025-05-07T19:46:51.7436813Z #define _PSTL_PRAGMA_SIMD_REDUCTION(PRM) _PSTL_PRAGMA(omp simd reduction(PRM)) 2025-05-07T19:46:51.7437275Z #define _PSTL_PRAGMA_SIMD_SCAN(PRM) 2025-05-07T19:46:51.7437702Z #define _PSTL_PRAGMA_VECTOR_UNALIGNED 2025-05-07T19:46:51.7438046Z #define _PSTL_STRING(x) _PSTL_STRING_AUX(x) 2025-05-07T19:46:51.7438348Z #define _PSTL_STRING_AUX(x) #x 2025-05-07T19:46:51.7438638Z #define _PSTL_STRING_CONCAT(x,y) x #y 2025-05-07T19:46:51.7438936Z #define _PSTL_UDR_PRESENT 0 2025-05-07T19:46:51.7439344Z #define _PSTL_UDS_PRESENT (__INTEL_COMPILER >= 1900 && __INTEL_COMPILER_BUILD_DATE >= 20180626) 2025-05-07T19:46:51.7439810Z #define _PSTL_USAGE_WARNINGS 0 2025-05-07T19:46:51.7440102Z #define _PSTL_USE_NONTEMPORAL_STORES_IF_ALLOWED 2025-05-07T19:46:51.7440428Z #define _PSTL_VERSION 12000 2025-05-07T19:46:51.7440689Z #define _PSTL_VERSION_MAJOR (_PSTL_VERSION / 1000) 2025-05-07T19:46:51.7441070Z #define _PSTL_VERSION_MINOR ((_PSTL_VERSION % 1000) / 10) 2025-05-07T19:46:51.7441449Z #define _PSTL_VERSION_PATCH (_PSTL_VERSION % 10) 2025-05-07T19:46:51.7441761Z #define _PTRDIFF_T 2025-05-07T19:46:51.7441999Z #define _PTR_TRAITS_H 1 2025-05-07T19:46:51.7442293Z #define _SIGSET_H_types 1 2025-05-07T19:46:51.7442647Z #define _SIGSET_NWORDS (1024 / (8 * sizeof (unsigned long int))) 2025-05-07T19:46:51.7442986Z #define _SIZE_T 2025-05-07T19:46:51.7443215Z #define _STDC_PREDEF_H 1 2025-05-07T19:46:51.7443457Z #define _STDIO_H 1 2025-05-07T19:46:51.7443702Z #define _STDIO_USES_IOSTREAM 2025-05-07T19:46:51.7443939Z #define _STDLIB_H 1 2025-05-07T19:46:51.7444180Z #define _STL_ALGOBASE_H 1 2025-05-07T19:46:51.7444429Z #define _STL_ITERATOR_BASE_FUNCS_H 1 2025-05-07T19:46:51.7444730Z #define _STL_ITERATOR_BASE_TYPES_H 1 2025-05-07T19:46:51.7445000Z #define _STL_ITERATOR_H 1 2025-05-07T19:46:51.7445246Z #define _STL_PAIR_H 1 2025-05-07T19:46:51.7445491Z #define _STL_RELOPS_H 1 2025-05-07T19:46:51.7445730Z #define _STRING_H 1 2025-05-07T19:46:51.7445976Z #define _STRUCT_TIMEVAL 1 2025-05-07T19:46:51.7446193Z #define _SVID_SOURCE 1 2025-05-07T19:46:51.7446431Z #define _SYS_CDEFS_H 1 2025-05-07T19:46:51.7446660Z #define _SYS_SELECT_H 1 2025-05-07T19:46:51.7446925Z #define _SYS_SYSMACROS_H 1 2025-05-07T19:46:51.7447174Z #define _SYS_TYPES_H 1 2025-05-07T19:46:51.7447431Z #define _TIME_H 1 2025-05-07T19:46:51.7447642Z #define _VA_LIST_DEFINED 2025-05-07T19:46:51.7447865Z #define _XLOCALE_H 1 2025-05-07T19:46:51.7448102Z #define _XOPEN_IOV_MAX _POSIX_UIO_MAXIOV 2025-05-07T19:46:51.7448410Z #define _XOPEN_LIM_H 1 2025-05-07T19:46:51.7448652Z #define _XOPEN_SOURCE 700 2025-05-07T19:46:51.7448916Z #define _XOPEN_SOURCE_EXTENDED 1 2025-05-07T19:46:51.7449280Z #define __ASMNAME(cname) __ASMNAME2 (__USER_LABEL_PREFIX__, cname) 2025-05-07T19:46:51.7449698Z #define __ASMNAME2(prefix,cname) __STRING (prefix) cname 2025-05-07T19:46:51.7450087Z #define __ASSERT_FUNCTION __PRETTY_FUNCTION__ 2025-05-07T19:46:51.7450403Z #define __ASSERT_VOID_CAST static_cast 2025-05-07T19:46:51.7450728Z #define __ATOMIC_ACQUIRE 2 2025-05-07T19:46:51.7450985Z #define __ATOMIC_ACQ_REL 4 2025-05-07T19:46:51.7451251Z #define __ATOMIC_CONSUME 1 2025-05-07T19:46:51.7451510Z #define __ATOMIC_RELAXED 0 2025-05-07T19:46:51.7451761Z #define __ATOMIC_RELEASE 3 2025-05-07T19:46:51.7452035Z #define __ATOMIC_SEQ_CST 5 2025-05-07T19:46:51.7452299Z #define __BEGIN_DECLS extern "C" { 2025-05-07T19:46:51.7452605Z #define __BEGIN_NAMESPACE_C99 2025-05-07T19:46:51.7452869Z #define __BEGIN_NAMESPACE_STD 2025-05-07T19:46:51.7453147Z #define __BIGGEST_ALIGNMENT__ 16 2025-05-07T19:46:51.7453406Z #define __BIG_ENDIAN 4321 2025-05-07T19:46:51.7453683Z #define __BITINT_MAXWIDTH__ 8388608 2025-05-07T19:46:51.7453959Z #define __BIT_TYPES_DEFINED__ 1 2025-05-07T19:46:51.7454260Z #define __BLKCNT64_T_TYPE __SQUAD_TYPE 2025-05-07T19:46:51.7454755Z #define __BLKCNT_T_TYPE __SYSCALL_SLONG_TYPE 2025-05-07T19:46:51.7455090Z #define __BLKSIZE_T_TYPE __SYSCALL_SLONG_TYPE 2025-05-07T19:46:51.7455431Z #define __BOOL_WIDTH__ 8 2025-05-07T19:46:51.7455691Z #define __BYTE_ORDER __LITTLE_ENDIAN 2025-05-07T19:46:51.7456033Z #define __BYTE_ORDER__ __ORDER_LITTLE_ENDIAN__ 2025-05-07T19:46:51.7456354Z #define __CHANNEL_DESCRIPTOR_H__ 2025-05-07T19:46:51.7456760Z #define __CHAR16_TYPE__ unsigned short 2025-05-07T19:46:51.7457310Z #define __CHAR32_TYPE__ unsigned int 2025-05-07T19:46:51.7457630Z #define __CHAR_BIT__ 8 2025-05-07T19:46:51.7457896Z #define __CLANG_ATOMIC_BOOL_LOCK_FREE 2 2025-05-07T19:46:51.7458257Z #define __CLANG_ATOMIC_CHAR16_T_LOCK_FREE 2 2025-05-07T19:46:51.7458609Z #define __CLANG_ATOMIC_CHAR32_T_LOCK_FREE 2 2025-05-07T19:46:51.7458946Z #define __CLANG_ATOMIC_CHAR_LOCK_FREE 2 2025-05-07T19:46:51.7459283Z #define __CLANG_ATOMIC_INT_LOCK_FREE 2 2025-05-07T19:46:51.7459581Z #define __CLANG_ATOMIC_LLONG_LOCK_FREE 2 2025-05-07T19:46:51.7459895Z #define __CLANG_ATOMIC_LONG_LOCK_FREE 2 2025-05-07T19:46:51.7460204Z #define __CLANG_ATOMIC_POINTER_LOCK_FREE 2 2025-05-07T19:46:51.7460527Z #define __CLANG_ATOMIC_SHORT_LOCK_FREE 2 2025-05-07T19:46:51.7460845Z #define __CLANG_ATOMIC_WCHAR_T_LOCK_FREE 2 2025-05-07T19:46:51.7461158Z #define __CLANG_LIMITS_H 2025-05-07T19:46:51.7461422Z #define __CLANG_MAX_ALIGN_T_DEFINED 2025-05-07T19:46:51.7461791Z #define __CLOCKID_T_TYPE __S32_TYPE 2025-05-07T19:46:51.7462106Z #define __CLOCK_T_TYPE __SYSCALL_SLONG_TYPE 2025-05-07T19:46:51.7462417Z #define __COMMON_FUNCTIONS_H__ 2025-05-07T19:46:51.7462694Z #define __COMPAR_FN_T 2025-05-07T19:46:51.7462933Z #define __CONCAT(x,y) x ## y 2025-05-07T19:46:51.7463209Z #define __CONSTANT_CFSTRINGS__ 1 2025-05-07T19:46:51.7463479Z #define __CUDACC_VER_BUILD__ 85 2025-05-07T19:46:51.7463754Z #define __CUDACC_VER_MAJOR__ 12 2025-05-07T19:46:51.7464015Z #define __CUDACC_VER_MINOR__ 6 2025-05-07T19:46:51.7464628Z #define __CUDACC_VER__ "__CUDACC_VER__ is no longer supported. Use __CUDACC_VER_MAJOR__, __CUDACC_VER_MINOR__, and __CUDACC_VER_BUILD__ instead." 2025-05-07T19:46:51.7465247Z #define __CUDACC__ 1 2025-05-07T19:46:51.7465491Z #define __CUDART_API_PTDS(api) api 2025-05-07T19:46:51.7465789Z #define __CUDART_API_PTSZ(api) api 2025-05-07T19:46:51.7466235Z #define __CUDART_API_VERSION ((__CUDA_API_VER_MAJOR__ * 1000) + (__CUDA_API_VER_MINOR__ * 10)) 2025-05-07T19:46:51.7466722Z #define __CUDA_API_VER_MAJOR__ 12 2025-05-07T19:46:51.7467010Z #define __CUDA_API_VER_MINOR__ 6 2025-05-07T19:46:51.7467376Z #define __CUDA_ARCH_HAS_FEATURE__(_FEAT) __CUDA_ARCH_FEAT_##_FEAT 2025-05-07T19:46:51.7467761Z #define __CUDA_ARCH_LIST__ 520 2025-05-07T19:46:51.7468025Z #define __CUDA_ARCH__ 520 2025-05-07T19:46:51.7468312Z #define __CUDA_DEVICE_RUNTIME_API_H__ 2025-05-07T19:46:51.7468636Z #define __CUDA_MATH_CRTIMP 2025-05-07T19:46:51.7468918Z #define __CUDA_RUNTIME_API_H__ 2025-05-07T19:46:51.7469178Z #define __CUDA_RUNTIME_H__ 2025-05-07T19:46:51.7469450Z #define __DADDR_T_TYPE __S32_TYPE 2025-05-07T19:46:51.7469724Z #define __DBL_DECIMAL_DIG__ 17 2025-05-07T19:46:51.7470023Z #define __DBL_DENORM_MIN__ 4.9406564584124654e-324 2025-05-07T19:46:51.7470536Z #define __DBL_DIG__ 15 2025-05-07T19:46:51.7470806Z #define __DBL_EPSILON__ 2.2204460492503131e-16 2025-05-07T19:46:51.7471116Z #define __DBL_HAS_DENORM__ 1 2025-05-07T19:46:51.7471373Z #define __DBL_HAS_INFINITY__ 1 2025-05-07T19:46:51.7471646Z #define __DBL_HAS_QUIET_NAN__ 1 2025-05-07T19:46:51.7471905Z #define __DBL_MANT_DIG__ 53 2025-05-07T19:46:51.7472165Z #define __DBL_MAX_10_EXP__ 308 2025-05-07T19:46:51.7472419Z #define __DBL_MAX_EXP__ 1024 2025-05-07T19:46:51.7472694Z #define __DBL_MAX__ 1.7976931348623157e+308 2025-05-07T19:46:51.7472981Z #define __DBL_MIN_10_EXP__ (-307) 2025-05-07T19:46:51.7473266Z #define __DBL_MIN_EXP__ (-1021) 2025-05-07T19:46:51.7473526Z #define __DBL_MIN__ 2.2250738585072014e-308 2025-05-07T19:46:51.7473854Z #define __DECIMAL_DIG__ __LDBL_DECIMAL_DIG__ 2025-05-07T19:46:51.7474170Z #define __DELETE_THROW throw() 2025-05-07T19:46:51.7474424Z #define __DEPRECATED 1 2025-05-07T19:46:51.7474693Z #define __DEVICE_ATOMIC_FUNCTIONS_HPP__ 2025-05-07T19:46:51.7474998Z #define __DEVICE_ATOMIC_FUNCTIONS_H__ 2025-05-07T19:46:51.7475342Z #define __DEVICE_DOUBLE_FUNCTIONS_HPP__ 2025-05-07T19:46:51.7475662Z #define __DEVICE_DOUBLE_FUNCTIONS_H__ 2025-05-07T19:46:51.7476011Z #define __DEVICE_FUNCTIONS_HPP__ 2025-05-07T19:46:51.7476327Z #define __DEVICE_FUNCTIONS_H__ 2025-05-07T19:46:51.7476782Z #define __DEVICE_LAUNCH_PARAMETERS_H__ 2025-05-07T19:46:51.7477116Z #define __DEVICE_TYPES_H__ 2025-05-07T19:46:51.7477446Z #define __DEV_T_TYPE __UQUAD_TYPE 2025-05-07T19:46:51.7477791Z #define __DRIVER_FUNCTIONS_H__ 2025-05-07T19:46:51.7478093Z #define __DRIVER_TYPES_H__ 2025-05-07T19:46:51.7478411Z #define __ELF__ 1 2025-05-07T19:46:51.7478662Z #define __END_DECLS } 2025-05-07T19:46:51.7478967Z #define __END_NAMESPACE_C99 2025-05-07T19:46:51.7479253Z #define __END_NAMESPACE_STD 2025-05-07T19:46:51.7479558Z #define __EXCEPTIONS 1 2025-05-07T19:46:51.7479823Z #define __EXCEPTION_H 1 2025-05-07T19:46:51.7480128Z #define __FDS_BITS(set) ((set)->fds_bits) 2025-05-07T19:46:51.7480570Z #define __FD_CLR(d,set) ((void) (__FDS_BITS (set)[__FD_ELT (d)] &= ~__FD_MASK (d))) 2025-05-07T19:46:51.7481037Z #define __FD_ELT(d) ((d) / __NFDBITS) 2025-05-07T19:46:51.7481585Z #define __FD_ISSET(d,set) ((__FDS_BITS (set)[__FD_ELT (d)] & __FD_MASK (d)) != 0) 2025-05-07T19:46:51.7482191Z #define __FD_MASK(d) ((__fd_mask) 1 << ((d) % __NFDBITS)) 2025-05-07T19:46:51.7482960Z #define __FD_SET(d,set) ((void) (__FDS_BITS (set)[__FD_ELT (d)] |= __FD_MASK (d))) 2025-05-07T19:46:51.7483380Z #define __FD_SETSIZE 1024 2025-05-07T19:46:51.7484317Z #define __FD_ZERO(fdsp) do { int __d0, __d1; __asm__ __volatile__ ("cld; rep; " __FD_ZERO_STOS : "=c" (__d0), "=D" (__d1) : "a" (0), "0" (sizeof (fd_set) / sizeof (__fd_mask)), "1" (&__FDS_BITS (fdsp)[0]) : "memory"); } while (0) 2025-05-07T19:46:51.7485080Z #define __FD_ZERO_STOS "stosq" 2025-05-07T19:46:51.7485373Z #define __FILE_defined 1 2025-05-07T19:46:51.7485678Z #define __FINITE_MATH_ONLY__ 0 2025-05-07T19:46:51.7485956Z #define __FLOAT128__ 1 2025-05-07T19:46:51.7486263Z #define __FLOAT_WORD_ORDER __BYTE_ORDER 2025-05-07T19:46:51.7486585Z #define __FLT16_DECIMAL_DIG__ 5 2025-05-07T19:46:51.7486946Z #define __FLT16_DENORM_MIN__ 5.9604644775390625e-8F16 2025-05-07T19:46:51.7487293Z #define __FLT16_DIG__ 3 2025-05-07T19:46:51.7487598Z #define __FLT16_EPSILON__ 9.765625e-4F16 2025-05-07T19:46:51.7487923Z #define __FLT16_HAS_DENORM__ 1 2025-05-07T19:46:51.7488241Z #define __FLT16_HAS_INFINITY__ 1 2025-05-07T19:46:51.7488564Z #define __FLT16_HAS_QUIET_NAN__ 1 2025-05-07T19:46:51.7488860Z #define __FLT16_MANT_DIG__ 11 2025-05-07T19:46:51.7489174Z #define __FLT16_MAX_10_EXP__ 4 2025-05-07T19:46:51.7489459Z #define __FLT16_MAX_EXP__ 16 2025-05-07T19:46:51.7489767Z #define __FLT16_MAX__ 6.5504e+4F16 2025-05-07T19:46:51.7490067Z #define __FLT16_MIN_10_EXP__ (-4) 2025-05-07T19:46:51.7490394Z #define __FLT16_MIN_EXP__ (-13) 2025-05-07T19:46:51.7490690Z #define __FLT16_MIN__ 6.103515625e-5F16 2025-05-07T19:46:51.7491036Z #define __FLT_DECIMAL_DIG__ 9 2025-05-07T19:46:51.7491341Z #define __FLT_DENORM_MIN__ 1.40129846e-45F 2025-05-07T19:46:51.7491696Z #define __FLT_DIG__ 6 2025-05-07T19:46:51.7491992Z #define __FLT_EPSILON__ 1.19209290e-7F 2025-05-07T19:46:51.7492316Z #define __FLT_HAS_DENORM__ 1 2025-05-07T19:46:51.7492639Z #define __FLT_HAS_INFINITY__ 1 2025-05-07T19:46:51.7492934Z #define __FLT_HAS_QUIET_NAN__ 1 2025-05-07T19:46:51.7493248Z #define __FLT_MANT_DIG__ 24 2025-05-07T19:46:51.7493527Z #define __FLT_MAX_10_EXP__ 38 2025-05-07T19:46:51.7493846Z #define __FLT_MAX_EXP__ 128 2025-05-07T19:46:51.7494129Z #define __FLT_MAX__ 3.40282347e+38F 2025-05-07T19:46:51.7494458Z #define __FLT_MIN_10_EXP__ (-37) 2025-05-07T19:46:51.7494759Z #define __FLT_MIN_EXP__ (-125) 2025-05-07T19:46:51.7495081Z #define __FLT_MIN__ 1.17549435e-38F 2025-05-07T19:46:51.7495376Z #define __FLT_RADIX__ 2 2025-05-07T19:46:51.7495631Z #define __FSBLKCNT64_T_TYPE __UQUAD_TYPE 2025-05-07T19:46:51.7495975Z #define __FSBLKCNT_T_TYPE __SYSCALL_ULONG_TYPE 2025-05-07T19:46:51.7496305Z #define __FSFILCNT64_T_TYPE __UQUAD_TYPE 2025-05-07T19:46:51.7496724Z #define __FSFILCNT_T_TYPE __SYSCALL_ULONG_TYPE 2025-05-07T19:46:51.7497068Z #define __FSID_T_TYPE struct { int __val[2]; } 2025-05-07T19:46:51.7497451Z #define __FSWORD_T_TYPE __SYSCALL_SLONG_TYPE 2025-05-07T19:46:51.7497754Z #define __FXSR__ 1 2025-05-07T19:46:51.7498593Z #define __GCC_ASM_FLAG_OUTPUTS__ 1 2025-05-07T19:46:51.7498903Z #define __GCC_ATOMIC_BOOL_LOCK_FREE 2 2025-05-07T19:46:51.7499206Z #define __GCC_ATOMIC_CHAR16_T_LOCK_FREE 2 2025-05-07T19:46:51.7499538Z #define __GCC_ATOMIC_CHAR32_T_LOCK_FREE 2 2025-05-07T19:46:51.7499850Z #define __GCC_ATOMIC_CHAR_LOCK_FREE 2 2025-05-07T19:46:51.7500168Z #define __GCC_ATOMIC_INT_LOCK_FREE 2 2025-05-07T19:46:51.7500467Z #define __GCC_ATOMIC_LLONG_LOCK_FREE 2 2025-05-07T19:46:51.7500783Z #define __GCC_ATOMIC_LONG_LOCK_FREE 2 2025-05-07T19:46:51.7501088Z #define __GCC_ATOMIC_POINTER_LOCK_FREE 2 2025-05-07T19:46:51.7501416Z #define __GCC_ATOMIC_SHORT_LOCK_FREE 2 2025-05-07T19:46:51.7501729Z #define __GCC_ATOMIC_TEST_AND_SET_TRUEVAL 1 2025-05-07T19:46:51.7502065Z #define __GCC_ATOMIC_WCHAR_T_LOCK_FREE 2 2025-05-07T19:46:51.7502386Z #define __GCC_HAVE_DWARF2_CFI_ASM 1 2025-05-07T19:46:51.7502692Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_1 1 2025-05-07T19:46:51.7503096Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_2 1 2025-05-07T19:46:51.7503432Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_4 1 2025-05-07T19:46:51.7503774Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_8 1 2025-05-07T19:46:51.7504084Z #define __GID_T_TYPE __U32_TYPE 2025-05-07T19:46:51.7504377Z #define __GLIBCXX_BITSIZE_INT_N_0 128 2025-05-07T19:46:51.7504675Z #define __GLIBCXX_TYPE_INT_N_0 __int128 2025-05-07T19:46:51.7504977Z #define __GLIBCXX__ 20230528 2025-05-07T19:46:51.7505258Z #define __GLIBC_HAVE_LONG_LONG 1 2025-05-07T19:46:51.7505525Z #define __GLIBC_MINOR__ 17 2025-05-07T19:46:51.7505941Z #define __GLIBC_PREREQ(maj,min) ((__GLIBC__ << 16) + __GLIBC_MINOR__ >= ((maj) << 16) + (min)) 2025-05-07T19:46:51.7506376Z #define __GLIBC__ 2 2025-05-07T19:46:51.7506614Z #define __GNUC_GNU_INLINE__ 1 2025-05-07T19:46:51.7506872Z #define __GNUC_MINOR__ 2 2025-05-07T19:46:51.7507139Z #define __GNUC_PATCHLEVEL__ 1 2025-05-07T19:46:51.7507538Z #define __GNUC_PREREQ(maj,min) ((__GNUC__ << 16) + __GNUC_MINOR__ >= ((maj) << 16) + (min)) 2025-05-07T19:46:51.7507981Z #define __GNUC_VA_LIST 2025-05-07T19:46:51.7508230Z #define __GNUC__ 4 2025-05-07T19:46:51.7508446Z #define __GNUG__ 4 2025-05-07T19:46:51.7508683Z #define __GNU_LIBRARY__ 6 2025-05-07T19:46:51.7508935Z #define __GXX_ABI_VERSION 1002 2025-05-07T19:46:51.7509315Z #define __GXX_EXPERIMENTAL_CXX0X__ 1 2025-05-07T19:46:51.7509573Z #define __GXX_RTTI 1 2025-05-07T19:46:51.7509798Z #define __GXX_WEAK__ 1 2025-05-07T19:46:51.7510012Z #define __HAVE_COLUMN 2025-05-07T19:46:51.7510244Z #define __HOST_CONFIG_H__ 2025-05-07T19:46:51.7510477Z #define __HOST_DEFINES_H__ 2025-05-07T19:46:51.7510730Z #define __ID_T_TYPE __U32_TYPE 2025-05-07T19:46:51.7510980Z #define __INO64_T_TYPE __UQUAD_TYPE 2025-05-07T19:46:51.7511263Z #define __INO_T_MATCHES_INO64_T 1 2025-05-07T19:46:51.7511547Z #define __INO_T_TYPE __SYSCALL_ULONG_TYPE 2025-05-07T19:46:51.7511827Z #define __INT16_C_SUFFIX__ 2025-05-07T19:46:51.7512076Z #define __INT16_FMTd__ "hd" 2025-05-07T19:46:51.7512311Z #define __INT16_FMTi__ "hi" 2025-05-07T19:46:51.7512557Z #define __INT16_MAX__ 32767 2025-05-07T19:46:51.7512793Z #define __INT16_TYPE__ short 2025-05-07T19:46:51.7513042Z #define __INT32_C_SUFFIX__ 2025-05-07T19:46:51.7513270Z #define __INT32_FMTd__ "d" 2025-05-07T19:46:51.7513511Z #define __INT32_FMTi__ "i" 2025-05-07T19:46:51.7513740Z #define __INT32_MAX__ 2147483647 2025-05-07T19:46:51.7513998Z #define __INT32_TYPE__ int 2025-05-07T19:46:51.7514242Z #define __INT64_C_SUFFIX__ L 2025-05-07T19:46:51.7514477Z #define __INT64_FMTd__ "ld" 2025-05-07T19:46:51.7514723Z #define __INT64_FMTi__ "li" 2025-05-07T19:46:51.7514965Z #define __INT64_MAX__ 9223372036854775807L 2025-05-07T19:46:51.7515254Z #define __INT64_TYPE__ long int 2025-05-07T19:46:51.7515495Z #define __INT8_C_SUFFIX__ 2025-05-07T19:46:51.7515905Z #define __INT8_FMTd__ "hhd" 2025-05-07T19:46:51.7516147Z #define __INT8_FMTi__ "hhi" 2025-05-07T19:46:51.7516400Z #define __INT8_MAX__ 127 2025-05-07T19:46:51.7516644Z #define __INT8_TYPE__ signed char 2025-05-07T19:46:51.7516948Z #define __INTMAX_C_SUFFIX__ L 2025-05-07T19:46:51.7517293Z #define __INTMAX_FMTd__ "ld" 2025-05-07T19:46:51.7517541Z #define __INTMAX_FMTi__ "li" 2025-05-07T19:46:51.7517819Z #define __INTMAX_MAX__ 9223372036854775807L 2025-05-07T19:46:51.7518110Z #define __INTMAX_TYPE__ long int 2025-05-07T19:46:51.7518387Z #define __INTMAX_WIDTH__ 64 2025-05-07T19:46:51.7518633Z #define __INTPTR_FMTd__ "ld" 2025-05-07T19:46:51.7518898Z #define __INTPTR_FMTi__ "li" 2025-05-07T19:46:51.7519161Z #define __INTPTR_MAX__ 9223372036854775807L 2025-05-07T19:46:51.7519467Z #define __INTPTR_TYPE__ long int 2025-05-07T19:46:51.7519727Z #define __INTPTR_WIDTH__ 64 2025-05-07T19:46:51.7519992Z #define __INT_FAST16_FMTd__ "hd" 2025-05-07T19:46:51.7520269Z #define __INT_FAST16_FMTi__ "hi" 2025-05-07T19:46:51.7520529Z #define __INT_FAST16_MAX__ 32767 2025-05-07T19:46:51.7520626Z #define __INT_FAST16_TYPE__ short 2025-05-07T19:46:51.7520734Z #define __INT_FAST16_WIDTH__ 16 2025-05-07T19:46:51.7520827Z #define __INT_FAST32_FMTd__ "d" 2025-05-07T19:46:51.7520973Z #define __INT_FAST32_FMTi__ "i" 2025-05-07T19:46:51.7521093Z #define __INT_FAST32_MAX__ 2147483647 2025-05-07T19:46:51.7521186Z #define __INT_FAST32_TYPE__ int 2025-05-07T19:46:51.7521278Z #define __INT_FAST32_WIDTH__ 32 2025-05-07T19:46:51.7521374Z #define __INT_FAST64_FMTd__ "ld" 2025-05-07T19:46:51.7521487Z #define __INT_FAST64_FMTi__ "li" 2025-05-07T19:46:51.7521605Z #define __INT_FAST64_MAX__ 9223372036854775807L 2025-05-07T19:46:51.7521705Z #define __INT_FAST64_TYPE__ long int 2025-05-07T19:46:51.7521815Z #define __INT_FAST64_WIDTH__ 64 2025-05-07T19:46:51.7521908Z #define __INT_FAST8_FMTd__ "hhd" 2025-05-07T19:46:51.7522002Z #define __INT_FAST8_FMTi__ "hhi" 2025-05-07T19:46:51.7522093Z #define __INT_FAST8_MAX__ 127 2025-05-07T19:46:51.7522211Z #define __INT_FAST8_TYPE__ signed char 2025-05-07T19:46:51.7533136Z #define __INT_FAST8_WIDTH__ 8 2025-05-07T19:46:51.7533312Z #define __INT_LEAST16_FMTd__ "hd" 2025-05-07T19:46:51.7533410Z #define __INT_LEAST16_FMTi__ "hi" 2025-05-07T19:46:51.7533526Z #define __INT_LEAST16_MAX__ 32767 2025-05-07T19:46:51.7533625Z #define __INT_LEAST16_TYPE__ short 2025-05-07T19:46:51.7533719Z #define __INT_LEAST16_WIDTH__ 16 2025-05-07T19:46:51.7533823Z #define __INT_LEAST32_FMTd__ "d" 2025-05-07T19:46:51.7533914Z #define __INT_LEAST32_FMTi__ "i" 2025-05-07T19:46:51.7534008Z #define __INT_LEAST32_MAX__ 2147483647 2025-05-07T19:46:51.7534097Z #define __INT_LEAST32_TYPE__ int 2025-05-07T19:46:51.7534202Z #define __INT_LEAST32_WIDTH__ 32 2025-05-07T19:46:51.7534290Z #define __INT_LEAST64_FMTd__ "ld" 2025-05-07T19:46:51.7534378Z #define __INT_LEAST64_FMTi__ "li" 2025-05-07T19:46:51.7534510Z #define __INT_LEAST64_MAX__ 9223372036854775807L 2025-05-07T19:46:51.7534608Z #define __INT_LEAST64_TYPE__ long int 2025-05-07T19:46:51.7534698Z #define __INT_LEAST64_WIDTH__ 64 2025-05-07T19:46:51.7534784Z #define __INT_LEAST8_FMTd__ "hhd" 2025-05-07T19:46:51.7534885Z #define __INT_LEAST8_FMTi__ "hhi" 2025-05-07T19:46:51.7534980Z #define __INT_LEAST8_MAX__ 127 2025-05-07T19:46:51.7535081Z #define __INT_LEAST8_TYPE__ signed char 2025-05-07T19:46:51.7535185Z #define __INT_LEAST8_WIDTH__ 8 2025-05-07T19:46:51.7535271Z #define __INT_MAX__ 2147483647 2025-05-07T19:46:51.7535355Z #define __INT_WIDTH__ 32 2025-05-07T19:46:51.7535442Z #define __KERNEL_STRICT_NAMES 2025-05-07T19:46:51.7535547Z #define __KEY_T_TYPE __S32_TYPE 2025-05-07T19:46:51.7535635Z #define __LDBL_DECIMAL_DIG__ 21 2025-05-07T19:46:51.7535772Z #define __LDBL_DENORM_MIN__ 3.64519953188247460253e-4951L 2025-05-07T19:46:51.7535871Z #define __LDBL_DIG__ 18 2025-05-07T19:46:51.7535990Z #define __LDBL_EPSILON__ 1.08420217248550443401e-19L 2025-05-07T19:46:51.7536082Z #define __LDBL_HAS_DENORM__ 1 2025-05-07T19:46:51.7536180Z #define __LDBL_HAS_INFINITY__ 1 2025-05-07T19:46:51.7536268Z #define __LDBL_HAS_QUIET_NAN__ 1 2025-05-07T19:46:51.7536359Z #define __LDBL_MANT_DIG__ 64 2025-05-07T19:46:51.7536540Z #define __LDBL_MAX_10_EXP__ 4932 2025-05-07T19:46:51.7536643Z #define __LDBL_MAX_EXP__ 16384 2025-05-07T19:46:51.7536760Z #define __LDBL_MAX__ 1.18973149535723176502e+4932L 2025-05-07T19:46:51.7537170Z #define __LDBL_MIN_10_EXP__ (-4931) 2025-05-07T19:46:51.7537279Z #define __LDBL_MIN_EXP__ (-16381) 2025-05-07T19:46:51.7537398Z #define __LDBL_MIN__ 3.36210314311209350626e-4932L 2025-05-07T19:46:51.7537518Z #define __LDBL_REDIR(name,proto) name proto 2025-05-07T19:46:51.7537657Z #define __LDBL_REDIR1(name,proto,alias) name proto 2025-05-07T19:46:51.7537843Z #define __LDBL_REDIR1_NTH(name,proto,alias) name proto __THROW 2025-05-07T19:46:51.7537942Z #define __LDBL_REDIR_DECL(name) 2025-05-07T19:46:51.7538094Z #define __LDBL_REDIR_NTH(name,proto) name proto __THROW 2025-05-07T19:46:51.7538189Z #define __LEAF 2025-05-07T19:46:51.7538274Z #define __LEAF_ATTR 2025-05-07T19:46:51.7538370Z #define __LIBRARY_TYPES_H__ 2025-05-07T19:46:51.7538463Z #define __LITTLE_ENDIAN 1234 2025-05-07T19:46:51.7538567Z #define __LITTLE_ENDIAN__ 1 2025-05-07T19:46:51.7538657Z #define __LLONG_WIDTH__ 64 2025-05-07T19:46:51.7538829Z #define __LONG_LONG_MAX__ 9223372036854775807LL 2025-05-07T19:46:51.7538950Z #define __LONG_LONG_PAIR(HI,LO) LO, HI 2025-05-07T19:46:51.7539052Z #define __LONG_MAX__ 9223372036854775807L 2025-05-07T19:46:51.7539139Z #define __LONG_WIDTH__ 64 2025-05-07T19:46:51.7539222Z #define __LP64__ 1 2025-05-07T19:46:51.7539561Z #define __MATHCALLX(function,suffix,args,attrib) __MATHDECLX (_Mdouble_,function,suffix, args, attrib) 2025-05-07T19:46:51.7540197Z #define __MATHDECLX(type,function,suffix,args,attrib) __MATHDECL_1(type, function,suffix, args) __attribute__ (attrib); __MATHDECL_1(type, __CONCAT(__,function),suffix, args) __attribute__ (attrib) 2025-05-07T19:46:51.7540297Z #define __MATH_DECLARE_LDOUBLE 1 2025-05-07T19:46:51.7540407Z #define __MATH_FUNCTIONS_HPP__ 2025-05-07T19:46:51.7540505Z #define __MATH_FUNCTIONS_H__ 2025-05-07T19:46:51.7540589Z #define __MMX__ 1 2025-05-07T19:46:51.7540701Z #define __MODE_T_TYPE __U32_TYPE 2025-05-07T19:46:51.7540787Z #define __N(msgid) (msgid) 2025-05-07T19:46:51.7540911Z #define __NFDBITS (8 * (int) sizeof (__fd_mask)) 2025-05-07T19:46:51.7541031Z #define __NLINK_T_TYPE __SYSCALL_ULONG_TYPE 2025-05-07T19:46:51.7541132Z #define __NO_CTYPE 1 2025-05-07T19:46:51.7541219Z #define __NO_INLINE__ 1 2025-05-07T19:46:51.7541310Z #define __NO_MATH_INLINES 1 2025-05-07T19:46:51.7541432Z #define __NTH(fct) __LEAF_ATTR fct throw () 2025-05-07T19:46:51.7541539Z #define __NVCC_DIAG_PRAGMA_SUPPORT__ 1 2025-05-07T19:46:51.7541621Z #define __NVCC__ 1 2025-05-07T19:46:51.7541719Z #define __NV_GLIBCXX_VERSION 40800 2025-05-07T19:46:51.7541827Z #define __NV_LEGACY_LAUNCH 1 2025-05-07T19:46:51.7541927Z #define __NV_NO_HOST_COMPILER_CHECK 1 2025-05-07T19:46:51.7542019Z #define __OBJC_BOOL_IS_BOOL 0 2025-05-07T19:46:51.7542131Z #define __OFF64_T_TYPE __SQUAD_TYPE 2025-05-07T19:46:51.7542229Z #define __OFF_T_MATCHES_OFF64_T 1 2025-05-07T19:46:51.7542335Z #define __OFF_T_TYPE __SYSCALL_SLONG_TYPE 2025-05-07T19:46:51.7542467Z #define __OPENCL_MEMORY_SCOPE_ALL_SVM_DEVICES 3 2025-05-07T19:46:51.7542592Z #define __OPENCL_MEMORY_SCOPE_DEVICE 2 2025-05-07T19:46:51.7542707Z #define __OPENCL_MEMORY_SCOPE_SUB_GROUP 4 2025-05-07T19:46:51.7542816Z #define __OPENCL_MEMORY_SCOPE_WORK_GROUP 1 2025-05-07T19:46:51.7542935Z #define __OPENCL_MEMORY_SCOPE_WORK_ITEM 0 2025-05-07T19:46:51.7543035Z #define __ORDER_BIG_ENDIAN__ 4321 2025-05-07T19:46:51.7543132Z #define __ORDER_LITTLE_ENDIAN__ 1234 2025-05-07T19:46:51.7543240Z #define __ORDER_PDP_ENDIAN__ 3412 2025-05-07T19:46:51.7543330Z #define __P(args) args 2025-05-07T19:46:51.7543423Z #define __PDP_ENDIAN 3412 2025-05-07T19:46:51.7543504Z #define __PIC__ 2 2025-05-07T19:46:51.7543610Z #define __PID_T_TYPE __S32_TYPE 2025-05-07T19:46:51.7543693Z #define __PIE__ 2 2025-05-07T19:46:51.7543782Z #define __PMT(args) args 2025-05-07T19:46:51.7543886Z #define __POINTER_WIDTH__ 64 2025-05-07T19:46:51.7543987Z #define __PRAGMA_REDEFINE_EXTNAME 1 2025-05-07T19:46:51.7544088Z #define __PTHREAD_MUTEX_HAVE_PREV 1 2025-05-07T19:46:51.7544200Z #define __PTHREAD_RWLOCK_INT_FLAGS_SHARED 1 2025-05-07T19:46:51.7544310Z #define __PTHREAD_SPINS 0, 0 2025-05-07T19:46:51.7544481Z #define __PTRDIFF_FMTd__ "ld" 2025-05-07T19:46:51.7544577Z #define __PTRDIFF_FMTi__ "li" 2025-05-07T19:46:51.7544698Z #define __PTRDIFF_MAX__ 9223372036854775807L 2025-05-07T19:46:51.7544798Z #define __PTRDIFF_TYPE__ long int 2025-05-07T19:46:51.7544894Z #define __PTRDIFF_WIDTH__ 64 2025-05-07T19:46:51.7545120Z #define __REDIRECT(name,proto,alias) name proto __asm__ (__ASMNAME (#alias)) 2025-05-07T19:46:51.7545347Z #define __REDIRECT_LDBL(name,proto,alias) __REDIRECT (name, proto, alias) 2025-05-07T19:46:51.7545602Z #define __REDIRECT_NTH(name,proto,alias) name proto __THROW __asm__ (__ASMNAME (#alias)) 2025-05-07T19:46:51.7545874Z #define __REDIRECT_NTHNL(name,proto,alias) name proto __THROWNL __asm__ (__ASMNAME (#alias)) 2025-05-07T19:46:51.7546122Z #define __REDIRECT_NTH_LDBL(name,proto,alias) __REDIRECT_NTH (name, proto, alias) 2025-05-07T19:46:51.7546219Z #define __REGISTER_PREFIX__ 2025-05-07T19:46:51.7546370Z #define __RLIM64_T_TYPE __UQUAD_TYPE 2025-05-07T19:46:51.7546500Z #define __RLIM_T_TYPE __SYSCALL_ULONG_TYPE 2025-05-07T19:46:51.7546593Z #define __S16_TYPE short int 2025-05-07T19:46:51.7546678Z #define __S32_TYPE int 2025-05-07T19:46:51.7546769Z #define __S64_TYPE long int 2025-05-07T19:46:51.7546876Z #define __SCHAR_MAX__ 127 2025-05-07T19:46:51.7546960Z #define __SEG_FS 1 2025-05-07T19:46:51.7547037Z #define __SEG_GS 1 2025-05-07T19:46:51.7547140Z #define __SHRT_MAX__ 32767 2025-05-07T19:46:51.7547230Z #define __SHRT_WIDTH__ 16 2025-05-07T19:46:51.7547331Z #define __SIG_ATOMIC_MAX__ 2147483647 2025-05-07T19:46:51.7547431Z #define __SIG_ATOMIC_WIDTH__ 32 2025-05-07T19:46:51.7547533Z #define __SIZEOF_DOUBLE__ 8 2025-05-07T19:46:51.7547625Z #define __SIZEOF_FLOAT128__ 16 2025-05-07T19:46:51.7547716Z #define __SIZEOF_FLOAT__ 4 2025-05-07T19:46:51.7547816Z #define __SIZEOF_INT128__ 16 2025-05-07T19:46:51.7547901Z #define __SIZEOF_INT__ 4 2025-05-07T19:46:51.7548000Z #define __SIZEOF_LONG_DOUBLE__ 16 2025-05-07T19:46:51.7548100Z #define __SIZEOF_LONG_LONG__ 8 2025-05-07T19:46:51.7548207Z #define __SIZEOF_LONG__ 8 2025-05-07T19:46:51.7548301Z #define __SIZEOF_POINTER__ 8 2025-05-07T19:46:51.7548400Z #define __SIZEOF_PTHREAD_ATTR_T 56 2025-05-07T19:46:51.7548524Z #define __SIZEOF_PTHREAD_BARRIERATTR_T 4 2025-05-07T19:46:51.7548626Z #define __SIZEOF_PTHREAD_BARRIER_T 32 2025-05-07T19:46:51.7548727Z #define __SIZEOF_PTHREAD_CONDATTR_T 4 2025-05-07T19:46:51.7548826Z #define __SIZEOF_PTHREAD_COND_T 48 2025-05-07T19:46:51.7548945Z #define __SIZEOF_PTHREAD_MUTEXATTR_T 4 2025-05-07T19:46:51.7549045Z #define __SIZEOF_PTHREAD_MUTEX_T 40 2025-05-07T19:46:51.7549260Z #define __SIZEOF_PTHREAD_RWLOCKATTR_T 8 2025-05-07T19:46:51.7549370Z #define __SIZEOF_PTHREAD_RWLOCK_T 56 2025-05-07T19:46:51.7549460Z #define __SIZEOF_PTRDIFF_T__ 8 2025-05-07T19:46:51.7549550Z #define __SIZEOF_SHORT__ 2 2025-05-07T19:46:51.7549639Z #define __SIZEOF_SIZE_T__ 8 2025-05-07T19:46:51.7549744Z #define __SIZEOF_WCHAR_T__ 4 2025-05-07T19:46:51.7549831Z #define __SIZEOF_WINT_T__ 4 2025-05-07T19:46:51.7549920Z #define __SIZE_FMTX__ "lX" 2025-05-07T19:46:51.7550027Z #define __SIZE_FMTo__ "lo" 2025-05-07T19:46:51.7550114Z #define __SIZE_FMTu__ "lu" 2025-05-07T19:46:51.7550199Z #define __SIZE_FMTx__ "lx" 2025-05-07T19:46:51.7550302Z #define __SIZE_MAX__ 18446744073709551615UL 2025-05-07T19:46:51.7550417Z #define __SIZE_TYPE__ long unsigned int 2025-05-07T19:46:51.7550499Z #define __SIZE_WIDTH__ 64 2025-05-07T19:46:51.7550584Z #define __SLONG32_TYPE int 2025-05-07T19:46:51.7550696Z #define __SLONGWORD_TYPE long int 2025-05-07T19:46:51.7550796Z #define __SM_20_ATOMIC_FUNCTIONS_HPP__ 2025-05-07T19:46:51.7550893Z #define __SM_20_ATOMIC_FUNCTIONS_H__ 2025-05-07T19:46:51.7550985Z #define __SM_20_INTRINSICS_HPP__ 2025-05-07T19:46:51.7551093Z #define __SM_20_INTRINSICS_H__ 2025-05-07T19:46:51.7551183Z #define __SM_30_INTRINSICS_HPP__ 2025-05-07T19:46:51.7551271Z #define __SM_30_INTRINSICS_H__ 2025-05-07T19:46:51.7551384Z #define __SM_32_ATOMIC_FUNCTIONS_HPP__ 2025-05-07T19:46:51.7551484Z #define __SM_32_ATOMIC_FUNCTIONS_H__ 2025-05-07T19:46:51.7551634Z #define __SM_32_INTRINSICS_HPP__ 2025-05-07T19:46:51.7551721Z #define __SM_32_INTRINSICS_H__ 2025-05-07T19:46:51.7551828Z #define __SM_35_ATOMIC_FUNCTIONS_H__ 2025-05-07T19:46:51.7551918Z #define __SM_35_INTRINSICS_H__ 2025-05-07T19:46:51.7552017Z #define __SM_60_ATOMIC_FUNCTIONS_HPP__ 2025-05-07T19:46:51.7552128Z #define __SM_60_ATOMIC_FUNCTIONS_H__ 2025-05-07T19:46:51.7552223Z #define __SM_61_INTRINSICS_HPP__ 2025-05-07T19:46:51.7552313Z #define __SM_61_INTRINSICS_H__ 2025-05-07T19:46:51.7552399Z #define __SM_70_RT_HPP__ 2025-05-07T19:46:51.7552495Z #define __SM_70_RT_H__ 2025-05-07T19:46:51.7552582Z #define __SM_80_RT_HPP__ 2025-05-07T19:46:51.7552664Z #define __SM_80_RT_H__ 2025-05-07T19:46:51.7552765Z #define __SM_90_RT_HPP__ 2025-05-07T19:46:51.7552849Z #define __SM_90_RT_H__ 2025-05-07T19:46:51.7552943Z #define __SQUAD_TYPE long int 2025-05-07T19:46:51.7553026Z #define __SSE2_MATH__ 1 2025-05-07T19:46:51.7553171Z #define __SSE2__ 1 2025-05-07T19:46:51.7553258Z #define __SSE_MATH__ 1 2025-05-07T19:46:51.7553336Z #define __SSE__ 1 2025-05-07T19:46:51.7553448Z #define __SSIZE_T_TYPE __SWORD_TYPE 2025-05-07T19:46:51.7553571Z #define __STDCPP_DEFAULT_NEW_ALIGNMENT__ 16UL 2025-05-07T19:46:51.7553682Z #define __STDCPP_MATH_SPEC_FUNCS__ 201003L 2025-05-07T19:46:51.7553773Z #define __STDCPP_THREADS__ 1 2025-05-07T19:46:51.7553873Z #define __STDC_HOSTED__ 1 2025-05-07T19:46:51.7553970Z #define __STDC_IEC_559_COMPLEX__ 1 2025-05-07T19:46:51.7554056Z #define __STDC_IEC_559__ 1 2025-05-07T19:46:51.7554156Z #define __STDC_ISO_10646__ 201103L 2025-05-07T19:46:51.7554247Z #define __STDC_NO_THREADS__ 1 2025-05-07T19:46:51.7554335Z #define __STDC_UTF_16__ 1 2025-05-07T19:46:51.7554420Z #define __STDC_UTF_32__ 1 2025-05-07T19:46:51.7554510Z #define __STDC__ 1 2025-05-07T19:46:51.7554593Z #define __STDDEF_H 2025-05-07T19:46:51.7554679Z #define __STRING(x) #x 2025-05-07T19:46:51.7554803Z #define __SURFACE_INDIRECT_FUNCTIONS_H__ 2025-05-07T19:46:51.7554900Z #define __SURFACE_TYPES_H__ 2025-05-07T19:46:51.7555025Z #define __SUSECONDS_T_TYPE __SYSCALL_SLONG_TYPE 2025-05-07T19:46:51.7555119Z #define __SWORD_TYPE long int 2025-05-07T19:46:51.7555246Z #define __SYSCALL_SLONG_TYPE __SLONGWORD_TYPE 2025-05-07T19:46:51.7555361Z #define __SYSCALL_ULONG_TYPE __ULONGWORD_TYPE 2025-05-07T19:46:51.7555453Z #define __SYSCALL_WORDSIZE 64 2025-05-07T19:46:51.7555570Z #define __TEXTURE_INDIRECT_FUNCTIONS_H__ 2025-05-07T19:46:51.7555661Z #define __TEXTURE_TYPES_H__ 2025-05-07T19:46:51.7555749Z #define __THROW throw () 2025-05-07T19:46:51.7555838Z #define __THROWNL throw () 2025-05-07T19:46:51.7555938Z #define __TIMER_T_TYPE void * 2025-05-07T19:46:51.7556046Z #define __TIME_T_TYPE __SYSCALL_SLONG_TYPE 2025-05-07T19:46:51.7556144Z #define __U16_TYPE unsigned short int 2025-05-07T19:46:51.7556247Z #define __U32_TYPE unsigned int 2025-05-07T19:46:51.7556344Z #define __U64_TYPE unsigned long int 2025-05-07T19:46:51.7556545Z #define __UID_T_TYPE __U32_TYPE 2025-05-07T19:46:51.7556646Z #define __UINT16_C_SUFFIX__ 2025-05-07T19:46:51.7556730Z #define __UINT16_FMTX__ "hX" 2025-05-07T19:46:51.7556817Z #define __UINT16_FMTo__ "ho" 2025-05-07T19:46:51.7556902Z #define __UINT16_FMTu__ "hu" 2025-05-07T19:46:51.7557001Z #define __UINT16_FMTx__ "hx" 2025-05-07T19:46:51.7557084Z #define __UINT16_MAX__ 65535 2025-05-07T19:46:51.7557180Z #define __UINT16_TYPE__ unsigned short 2025-05-07T19:46:51.7557278Z #define __UINT32_C_SUFFIX__ U 2025-05-07T19:46:51.7557364Z #define __UINT32_FMTX__ "X" 2025-05-07T19:46:51.7557446Z #define __UINT32_FMTo__ "o" 2025-05-07T19:46:51.7557529Z #define __UINT32_FMTu__ "u" 2025-05-07T19:46:51.7557624Z #define __UINT32_FMTx__ "x" 2025-05-07T19:46:51.7557712Z #define __UINT32_MAX__ 4294967295U 2025-05-07T19:46:51.7557803Z #define __UINT32_TYPE__ unsigned int 2025-05-07T19:46:51.7557901Z #define __UINT64_C_SUFFIX__ UL 2025-05-07T19:46:51.7557983Z #define __UINT64_FMTX__ "lX" 2025-05-07T19:46:51.7558068Z #define __UINT64_FMTo__ "lo" 2025-05-07T19:46:51.7558153Z #define __UINT64_FMTu__ "lu" 2025-05-07T19:46:51.7558302Z #define __UINT64_FMTx__ "lx" 2025-05-07T19:46:51.7558402Z #define __UINT64_MAX__ 18446744073709551615UL 2025-05-07T19:46:51.7558500Z #define __UINT64_TYPE__ long unsigned int 2025-05-07T19:46:51.7558600Z #define __UINT8_C_SUFFIX__ 2025-05-07T19:46:51.7558682Z #define __UINT8_FMTX__ "hhX" 2025-05-07T19:46:51.7558764Z #define __UINT8_FMTo__ "hho" 2025-05-07T19:46:51.7558850Z #define __UINT8_FMTu__ "hhu" 2025-05-07T19:46:51.7558947Z #define __UINT8_FMTx__ "hhx" 2025-05-07T19:46:51.7559028Z #define __UINT8_MAX__ 255 2025-05-07T19:46:51.7559119Z #define __UINT8_TYPE__ unsigned char 2025-05-07T19:46:51.7559220Z #define __UINTMAX_C_SUFFIX__ UL 2025-05-07T19:46:51.7559305Z #define __UINTMAX_FMTX__ "lX" 2025-05-07T19:46:51.7559390Z #define __UINTMAX_FMTo__ "lo" 2025-05-07T19:46:51.7559477Z #define __UINTMAX_FMTu__ "lu" 2025-05-07T19:46:51.7559579Z #define __UINTMAX_FMTx__ "lx" 2025-05-07T19:46:51.7559682Z #define __UINTMAX_MAX__ 18446744073709551615UL 2025-05-07T19:46:51.7559833Z #define __UINTMAX_TYPE__ long unsigned int 2025-05-07T19:46:51.7559938Z #define __UINTMAX_WIDTH__ 64 2025-05-07T19:46:51.7560024Z #define __UINTPTR_FMTX__ "lX" 2025-05-07T19:46:51.7560108Z #define __UINTPTR_FMTo__ "lo" 2025-05-07T19:46:51.7560195Z #define __UINTPTR_FMTu__ "lu" 2025-05-07T19:46:51.7560292Z #define __UINTPTR_FMTx__ "lx" 2025-05-07T19:46:51.7560394Z #define __UINTPTR_MAX__ 18446744073709551615UL 2025-05-07T19:46:51.7560494Z #define __UINTPTR_TYPE__ long unsigned int 2025-05-07T19:46:51.7560594Z #define __UINTPTR_WIDTH__ 64 2025-05-07T19:46:51.7560684Z #define __UINT_FAST16_FMTX__ "hX" 2025-05-07T19:46:51.7560773Z #define __UINT_FAST16_FMTo__ "ho" 2025-05-07T19:46:51.7560859Z #define __UINT_FAST16_FMTu__ "hu" 2025-05-07T19:46:51.7560961Z #define __UINT_FAST16_FMTx__ "hx" 2025-05-07T19:46:51.7561047Z #define __UINT_FAST16_MAX__ 65535 2025-05-07T19:46:51.7561147Z #define __UINT_FAST16_TYPE__ unsigned short 2025-05-07T19:46:51.7561247Z #define __UINT_FAST32_FMTX__ "X" 2025-05-07T19:46:51.7561337Z #define __UINT_FAST32_FMTo__ "o" 2025-05-07T19:46:51.7561425Z #define __UINT_FAST32_FMTu__ "u" 2025-05-07T19:46:51.7561511Z #define __UINT_FAST32_FMTx__ "x" 2025-05-07T19:46:51.7561615Z #define __UINT_FAST32_MAX__ 4294967295U 2025-05-07T19:46:51.7561711Z #define __UINT_FAST32_TYPE__ unsigned int 2025-05-07T19:46:51.7561798Z #define __UINT_FAST64_FMTX__ "lX" 2025-05-07T19:46:51.7561896Z #define __UINT_FAST64_FMTo__ "lo" 2025-05-07T19:46:51.7561982Z #define __UINT_FAST64_FMTu__ "lu" 2025-05-07T19:46:51.7562063Z #define __UINT_FAST64_FMTx__ "lx" 2025-05-07T19:46:51.7562174Z #define __UINT_FAST64_MAX__ 18446744073709551615UL 2025-05-07T19:46:51.7562300Z #define __UINT_FAST64_TYPE__ long unsigned int 2025-05-07T19:46:51.7562389Z #define __UINT_FAST8_FMTX__ "hhX" 2025-05-07T19:46:51.7562474Z #define __UINT_FAST8_FMTo__ "hho" 2025-05-07T19:46:51.7562571Z #define __UINT_FAST8_FMTu__ "hhu" 2025-05-07T19:46:51.7562659Z #define __UINT_FAST8_FMTx__ "hhx" 2025-05-07T19:46:51.7562748Z #define __UINT_FAST8_MAX__ 255 2025-05-07T19:46:51.7562859Z #define __UINT_FAST8_TYPE__ unsigned char 2025-05-07T19:46:51.7562949Z #define __UINT_LEAST16_FMTX__ "hX" 2025-05-07T19:46:51.7563037Z #define __UINT_LEAST16_FMTo__ "ho" 2025-05-07T19:46:51.7563122Z #define __UINT_LEAST16_FMTu__ "hu" 2025-05-07T19:46:51.7563225Z #define __UINT_LEAST16_FMTx__ "hx" 2025-05-07T19:46:51.7563315Z #define __UINT_LEAST16_MAX__ 65535 2025-05-07T19:46:51.7563420Z #define __UINT_LEAST16_TYPE__ unsigned short 2025-05-07T19:46:51.7563522Z #define __UINT_LEAST32_FMTX__ "X" 2025-05-07T19:46:51.7563611Z #define __UINT_LEAST32_FMTo__ "o" 2025-05-07T19:46:51.7563699Z #define __UINT_LEAST32_FMTu__ "u" 2025-05-07T19:46:51.7563784Z #define __UINT_LEAST32_FMTx__ "x" 2025-05-07T19:46:51.7563887Z #define __UINT_LEAST32_MAX__ 4294967295U 2025-05-07T19:46:51.7563990Z #define __UINT_LEAST32_TYPE__ unsigned int 2025-05-07T19:46:51.7564081Z #define __UINT_LEAST64_FMTX__ "lX" 2025-05-07T19:46:51.7564178Z #define __UINT_LEAST64_FMTo__ "lo" 2025-05-07T19:46:51.7564271Z #define __UINT_LEAST64_FMTu__ "lu" 2025-05-07T19:46:51.7564412Z #define __UINT_LEAST64_FMTx__ "lx" 2025-05-07T19:46:51.7564523Z #define __UINT_LEAST64_MAX__ 18446744073709551615UL 2025-05-07T19:46:51.7564647Z #define __UINT_LEAST64_TYPE__ long unsigned int 2025-05-07T19:46:51.7564736Z #define __UINT_LEAST8_FMTX__ "hhX" 2025-05-07T19:46:51.7564822Z #define __UINT_LEAST8_FMTo__ "hho" 2025-05-07T19:46:51.7564916Z #define __UINT_LEAST8_FMTu__ "hhu" 2025-05-07T19:46:51.7565004Z #define __UINT_LEAST8_FMTx__ "hhx" 2025-05-07T19:46:51.7565093Z #define __UINT_LEAST8_MAX__ 255 2025-05-07T19:46:51.7565193Z #define __UINT_LEAST8_TYPE__ unsigned char 2025-05-07T19:46:51.7565295Z #define __ULONG32_TYPE unsigned int 2025-05-07T19:46:51.7565399Z #define __ULONGWORD_TYPE unsigned long int 2025-05-07T19:46:51.7565496Z #define __UQUAD_TYPE unsigned long int 2025-05-07T19:46:51.7565598Z #define __USECONDS_T_TYPE __U32_TYPE 2025-05-07T19:46:51.7565691Z #define __USER_LABEL_PREFIX__ 2025-05-07T19:46:51.7565823Z #define __USE_ANSI 1 2025-05-07T19:46:51.7565911Z #define __USE_ATFILE 1 2025-05-07T19:46:51.7566000Z #define __USE_BSD 1 2025-05-07T19:46:51.7566087Z #define __USE_FORTIFY_LEVEL 0 2025-05-07T19:46:51.7566164Z #define __USE_GNU 1 2025-05-07T19:46:51.7566253Z #define __USE_ISOC11 1 2025-05-07T19:46:51.7566331Z #define __USE_ISOC95 1 2025-05-07T19:46:51.7566414Z #define __USE_ISOC99 1 2025-05-07T19:46:51.7566496Z #define __USE_ISOCXX11 1 2025-05-07T19:46:51.7566587Z #define __USE_LARGEFILE 1 2025-05-07T19:46:51.7566672Z #define __USE_LARGEFILE64 1 2025-05-07T19:46:51.7566748Z #define __USE_MISC 1 2025-05-07T19:46:51.7566841Z #define __USE_POSIX 1 2025-05-07T19:46:51.7566929Z #define __USE_POSIX199309 1 2025-05-07T19:46:51.7567011Z #define __USE_POSIX199506 1 2025-05-07T19:46:51.7567090Z #define __USE_POSIX2 1 2025-05-07T19:46:51.7567180Z #define __USE_SVID 1 2025-05-07T19:46:51.7567258Z #define __USE_UNIX98 1 2025-05-07T19:46:51.7567334Z #define __USE_XOPEN 1 2025-05-07T19:46:51.7567421Z #define __USE_XOPEN2K 1 2025-05-07T19:46:51.7567505Z #define __USE_XOPEN2K8 1 2025-05-07T19:46:51.7567595Z #define __USE_XOPEN2K8XSI 1 2025-05-07T19:46:51.7567680Z #define __USE_XOPEN2KXSI 1 2025-05-07T19:46:51.7567782Z #define __USE_XOPEN_EXTENDED 1 2025-05-07T19:46:51.7567878Z #define __USING_NAMESPACE_C99(name) 2025-05-07T19:46:51.7567973Z #define __USING_NAMESPACE_STD(name) 2025-05-07T19:46:51.7568079Z #define __UWORD_TYPE unsigned long int 2025-05-07T19:46:51.7568171Z #define __VECTOR_FUNCTIONS_HPP__ 2025-05-07T19:46:51.7568263Z #define __VECTOR_FUNCTIONS_H__ 2025-05-07T19:46:51.7568349Z #define __VECTOR_TYPES_H__ 2025-05-07T19:46:51.7568768Z #define __VERSION__ "Clang 16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4)" 2025-05-07T19:46:51.7568880Z #define __WAIT_INT(status) (*(int *) &(status)) 2025-05-07T19:46:51.7568970Z #define __WAIT_STATUS void * 2025-05-07T19:46:51.7569072Z #define __WAIT_STATUS_DEFN void * 2025-05-07T19:46:51.7569156Z #define __WALL 0x40000000 2025-05-07T19:46:51.7569248Z #define __WCHAR_MAX__ 2147483647 2025-05-07T19:46:51.7569343Z #define __WCHAR_TYPE__ int 2025-05-07T19:46:51.7569428Z #define __WCHAR_WIDTH__ 32 2025-05-07T19:46:51.7569511Z #define __WCLONE 0x80000000 2025-05-07T19:46:51.7569640Z #define __WCOREDUMP(status) ((status) & __WCOREFLAG) 2025-05-07T19:46:51.7569732Z #define __WCOREFLAG 0x80 2025-05-07T19:46:51.7569874Z #define __WEXITSTATUS(status) (((status) & 0xff00) >> 8) 2025-05-07T19:46:51.7570021Z #define __WIFCONTINUED(status) ((status) == __W_CONTINUED) 2025-05-07T19:46:51.7570311Z #define __WIFEXITED(status) (__WTERMSIG(status) == 0) 2025-05-07T19:46:51.7570685Z #define __WIFSIGNALED(status) (((signed char) (((status) & 0x7f) + 1) >> 1) > 0) 2025-05-07T19:46:51.7570834Z #define __WIFSTOPPED(status) (((status) & 0xff) == 0x7f) 2025-05-07T19:46:51.7570928Z #define __WINT_MAX__ 4294967295U 2025-05-07T19:46:51.7571031Z #define __WINT_TYPE__ unsigned int 2025-05-07T19:46:51.7571121Z #define __WINT_UNSIGNED__ 1 2025-05-07T19:46:51.7571214Z #define __WINT_WIDTH__ 32 2025-05-07T19:46:51.7571469Z #define __WNOTHREAD 0x20000000 2025-05-07T19:46:51.7571552Z #define __WORDSIZE 64 2025-05-07T19:46:51.7571654Z #define __WORDSIZE_TIME64_COMPAT32 1 2025-05-07T19:46:51.7571783Z #define __WSTOPSIG(status) __WEXITSTATUS(status) 2025-05-07T19:46:51.7571909Z #define __WTERMSIG(status) ((status) & 0x7f) 2025-05-07T19:46:51.7572003Z #define __W_CONTINUED 0xffff 2025-05-07T19:46:51.7572129Z #define __W_EXITCODE(ret,sig) ((ret) << 8 | (sig)) 2025-05-07T19:46:51.7572252Z #define __W_STOPCODE(sig) ((sig) << 8 | 0x7f) 2025-05-07T19:46:51.7572341Z #define ____FILE_defined 1 2025-05-07T19:46:51.7572436Z #define ____mbstate_t_defined 1 2025-05-07T19:46:51.7572557Z #define __align__(n) __attribute__((aligned(n))) 2025-05-07T19:46:51.7572757Z #define __always_inline __inline __attribute__ ((__always_inline__)) 2025-05-07T19:46:51.7572838Z #define __amd64 1 2025-05-07T19:46:51.7572918Z #define __amd64__ 1 2025-05-07T19:46:51.7573040Z #define __annotate__(a) __attribute__((a)) 2025-05-07T19:46:51.7573212Z #define __attribute_artificial__ 2025-05-07T19:46:51.7573361Z #define __attribute_const__ __attribute__ ((__const__)) 2025-05-07T19:46:51.7573555Z #define __attribute_deprecated__ __attribute__ ((__deprecated__)) 2025-05-07T19:46:51.7573760Z #define __attribute_format_arg__(x) __attribute__ ((__format_arg__ (x))) 2025-05-07T19:46:51.7574016Z #define __attribute_format_strfmon__(a,b) __attribute__ ((__format__ (__strfmon__, a, b))) 2025-05-07T19:46:51.7574166Z #define __attribute_malloc__ __attribute__ ((__malloc__)) 2025-05-07T19:46:51.7574345Z #define __attribute_noinline__ __attribute__ ((__noinline__)) 2025-05-07T19:46:51.7574479Z #define __attribute_pure__ __attribute__ ((__pure__)) 2025-05-07T19:46:51.7574612Z #define __attribute_used__ __attribute__ ((__used__)) 2025-05-07T19:46:51.7574858Z #define __attribute_warn_unused_result__ __attribute__ ((__warn_unused_result__)) 2025-05-07T19:46:51.7574948Z #define __blkcnt_t_defined 2025-05-07T19:46:51.7575040Z #define __blksize_t_defined 2025-05-07T19:46:51.7575250Z #define __bos(ptr) __builtin_object_size (ptr, __USE_FORTIFY_LEVEL > 1) 2025-05-07T19:46:51.7575388Z #define __bos0(ptr) __builtin_object_size (ptr, 0) 2025-05-07T19:46:51.7575468Z #define __bounded 2025-05-07T19:46:51.7576078Z #define __bswap_16(x) (__extension__ ({ unsigned short int __v, __x = (unsigned short int) (x); if (__builtin_constant_p (__x)) __v = __bswap_constant_16 (__x); else __asm__ ("rorw $8, %w0" : "=r" (__v) : "0" (__x) : "cc"); __v; })) 2025-05-07T19:46:51.7576638Z #define __bswap_32(x) (__extension__ ({ unsigned int __v, __x = (x); if (__builtin_constant_p (__x)) __v = __bswap_constant_32 (__x); else __asm__ ("bswap %0" : "=r" (__v) : "0" (__x)); __v; })) 2025-05-07T19:46:51.7577114Z #define __bswap_64(x) (__extension__ ({ __uint64_t __v, __x = (x); if (__builtin_constant_p (__x)) __v = __bswap_constant_64 (__x); else __asm__ ("bswap %q0" : "=r" (__v) : "0" (__x)); __v; })) 2025-05-07T19:46:51.7577386Z #define __bswap_constant_16(x) ((unsigned short int) ((((x) >> 8) & 0xff) | (((x) & 0xff) << 8))) 2025-05-07T19:46:51.7577724Z #define __bswap_constant_32(x) ((((x) & 0xff000000) >> 24) | (((x) & 0x00ff0000) >> 8) | (((x) & 0x0000ff00) << 8) | (((x) & 0x000000ff) << 24)) 2025-05-07T19:46:51.7578663Z #define __bswap_constant_64(x) (__extension__ ((((x) & 0xff00000000000000ull) >> 56) | (((x) & 0x00ff000000000000ull) >> 40) | (((x) & 0x0000ff0000000000ull) >> 24) | (((x) & 0x000000ff00000000ull) >> 8) | (((x) & 0x00000000ff000000ull) << 8) | (((x) & 0x0000000000ff0000ull) << 24) | (((x) & 0x000000000000ff00ull) << 40) | (((x) & 0x00000000000000ffull) << 56))) 2025-05-07T19:46:51.7578787Z #define __builtin_align__(a) __align__(a) 2025-05-07T19:46:51.7578879Z #define __catch(X) catch(X) 2025-05-07T19:46:51.7578962Z #define __cdecl 2025-05-07T19:46:51.7579049Z #define __clang__ 1 2025-05-07T19:46:51.7579181Z #define __clang_literal_encoding__ "UTF-8" 2025-05-07T19:46:51.7579274Z #define __clang_major__ 16 2025-05-07T19:46:51.7579362Z #define __clang_minor__ 0 2025-05-07T19:46:51.7579464Z #define __clang_patchlevel__ 6 2025-05-07T19:46:51.7579956Z #define __clang_version__ "16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4)" 2025-05-07T19:46:51.7580085Z #define __clang_wide_literal_encoding__ "UTF-32" 2025-05-07T19:46:51.7580179Z #define __clock_t_defined 1 2025-05-07T19:46:51.7580284Z #define __clockid_t_defined 1 2025-05-07T19:46:51.7580477Z #define __cluster_dims__(...) __attribute__((cluster_dims(__VA_ARGS__))) 2025-05-07T19:46:51.7580574Z #define __code_model_small__ 1 2025-05-07T19:46:51.7580697Z #define __constant__ __location__(constant) 2025-05-07T19:46:51.7580788Z #define __cplusplus 201703L 2025-05-07T19:46:51.7580893Z #define __cpp_aggregate_bases 201603L 2025-05-07T19:46:51.7580994Z #define __cpp_aggregate_nsdmi 201304L 2025-05-07T19:46:51.7581109Z #define __cpp_alias_templates 200704L 2025-05-07T19:46:51.7581208Z #define __cpp_aligned_new 201606L 2025-05-07T19:46:51.7581305Z #define __cpp_attributes 200809L 2025-05-07T19:46:51.7581467Z #define __cpp_binary_literals 201304L 2025-05-07T19:46:51.7581574Z #define __cpp_capture_star_this 201603L 2025-05-07T19:46:51.7581671Z #define __cpp_constexpr 201603L 2025-05-07T19:46:51.7581787Z #define __cpp_constexpr_in_decltype 201711L 2025-05-07T19:46:51.7581888Z #define __cpp_decltype 200707L 2025-05-07T19:46:51.7581988Z #define __cpp_decltype_auto 201304L 2025-05-07T19:46:51.7582091Z #define __cpp_deduction_guides 201703L 2025-05-07T19:46:51.7582227Z #define __cpp_delegating_constructors 200604L 2025-05-07T19:46:51.7582329Z #define __cpp_digit_separators 201309L 2025-05-07T19:46:51.7582440Z #define __cpp_enumerator_attributes 201411L 2025-05-07T19:46:51.7582537Z #define __cpp_exceptions 199711L 2025-05-07T19:46:51.7582649Z #define __cpp_fold_expressions 201603L 2025-05-07T19:46:51.7582753Z #define __cpp_generic_lambdas 201304L 2025-05-07T19:46:51.7582870Z #define __cpp_guaranteed_copy_elision 201606L 2025-05-07T19:46:51.7582979Z #define __cpp_hex_float 201603L 2025-05-07T19:46:51.7583081Z #define __cpp_if_constexpr 201606L 2025-05-07T19:46:51.7583198Z #define __cpp_impl_destroying_delete 201806L 2025-05-07T19:46:51.7583317Z #define __cpp_inheriting_constructors 201511L 2025-05-07T19:46:51.7583422Z #define __cpp_init_captures 201304L 2025-05-07T19:46:51.7583526Z #define __cpp_initializer_lists 200806L 2025-05-07T19:46:51.7583629Z #define __cpp_inline_variables 201606L 2025-05-07T19:46:51.7583734Z #define __cpp_lambdas 200907L 2025-05-07T19:46:51.7583847Z #define __cpp_lib_addressof_constexpr 201603 2025-05-07T19:46:51.7583951Z #define __cpp_lib_array_constexpr 201803L 2025-05-07T19:46:51.7584058Z #define __cpp_lib_as_const 201510 2025-05-07T19:46:51.7584159Z #define __cpp_lib_bool_constant 201505 2025-05-07T19:46:51.7584269Z #define __cpp_lib_exchange_function 201304 2025-05-07T19:46:51.7584430Z #define __cpp_lib_has_unique_object_representations 201606 2025-05-07T19:46:51.7584537Z #define __cpp_lib_hypot 201603 2025-05-07T19:46:51.7584645Z #define __cpp_lib_integer_sequence 201304 2025-05-07T19:46:51.7584777Z #define __cpp_lib_integral_constant_callable 201304 2025-05-07T19:46:51.7584892Z #define __cpp_lib_is_aggregate 201703 2025-05-07T19:46:51.7584992Z #define __cpp_lib_is_final 201402L 2025-05-07T19:46:51.7585091Z #define __cpp_lib_is_invocable 201703 2025-05-07T19:46:51.7585196Z #define __cpp_lib_is_null_pointer 201309 2025-05-07T19:46:51.7585309Z #define __cpp_lib_is_swappable 201603 2025-05-07T19:46:51.7585408Z #define __cpp_lib_launder 201606 2025-05-07T19:46:51.7585508Z #define __cpp_lib_logical_traits 201510 2025-05-07T19:46:51.7585641Z #define __cpp_lib_make_reverse_iterator 201402 2025-05-07T19:46:51.7585766Z #define __cpp_lib_math_special_functions 201603L 2025-05-07T19:46:51.7585872Z #define __cpp_lib_result_of_sfinae 201210 2025-05-07T19:46:51.7586008Z #define __cpp_lib_robust_nonmodifying_seq_ops 201304 2025-05-07T19:46:51.7586159Z #define __cpp_lib_transformation_trait_aliases 201304 2025-05-07T19:46:51.7586263Z #define __cpp_lib_tuple_element_t 201402L 2025-05-07T19:46:51.7586367Z #define __cpp_lib_tuples_by_type 201304 2025-05-07T19:46:51.7586578Z #define __cpp_lib_type_trait_variable_templates 201510L 2025-05-07T19:46:51.7586672Z #define __cpp_lib_void_t 201411 2025-05-07T19:46:51.7586792Z #define __cpp_named_character_escapes 202207L 2025-05-07T19:46:51.7586914Z #define __cpp_namespace_attributes 201411L 2025-05-07T19:46:51.7587047Z #define __cpp_nested_namespace_definitions 201411L 2025-05-07T19:46:51.7587162Z #define __cpp_noexcept_function_type 201510L 2025-05-07T19:46:51.7587274Z #define __cpp_nontype_template_args 201411L 2025-05-07T19:46:51.7587429Z #define __cpp_nontype_template_parameter_auto 201606L 2025-05-07T19:46:51.7587522Z #define __cpp_nsdmi 200809L 2025-05-07T19:46:51.7587624Z #define __cpp_range_based_for 201603L 2025-05-07T19:46:51.7587730Z #define __cpp_raw_strings 200710L 2025-05-07T19:46:51.7587830Z #define __cpp_ref_qualifiers 200710L 2025-05-07T19:46:51.7587940Z #define __cpp_return_type_deduction 201304L 2025-05-07T19:46:51.7588076Z #define __cpp_rtti 199711L 2025-05-07T19:46:51.7588197Z #define __cpp_rvalue_references 200610L 2025-05-07T19:46:51.7588295Z #define __cpp_static_assert 201411L 2025-05-07T19:46:51.7588511Z #define __cpp_static_call_operator 202207L 2025-05-07T19:46:51.7588629Z #define __cpp_structured_bindings 201606L 2025-05-07T19:46:51.7588829Z #define __cpp_template_auto 201606L 2025-05-07T19:46:51.7588937Z #define __cpp_threadsafe_static_init 200806L 2025-05-07T19:46:51.7589036Z #define __cpp_unicode_characters 200704L 2025-05-07T19:46:51.7589142Z #define __cpp_unicode_literals 200710L 2025-05-07T19:46:51.7589244Z #define __cpp_user_defined_literals 200809L 2025-05-07T19:46:51.7589344Z #define __cpp_variable_templates 201304L 2025-05-07T19:46:51.7589450Z #define __cpp_variadic_templates 200704L 2025-05-07T19:46:51.7589542Z #define __cpp_variadic_using 201611L 2025-05-07T19:46:51.7589645Z #define __cudaCDP2DeviceGetAttribute 2025-05-07T19:46:51.7589762Z #define __cudaCDP2DeviceGetCacheConfig 2025-05-07T19:46:51.7589858Z #define __cudaCDP2DeviceGetLimit 2025-05-07T19:46:51.7589973Z #define __cudaCDP2DeviceGetSharedMemConfig 2025-05-07T19:46:51.7590075Z #define __cudaCDP2EventCreateWithFlags 2025-05-07T19:46:51.7590177Z #define __cudaCDP2EventDestroy 2025-05-07T19:46:51.7590267Z #define __cudaCDP2EventRecord 2025-05-07T19:46:51.7590372Z #define __cudaCDP2EventRecordWithFlags 2025-05-07T19:46:51.7590499Z #define __cudaCDP2EventRecordWithFlags_ptsz 2025-05-07T19:46:51.7590594Z #define __cudaCDP2EventRecord_ptsz 2025-05-07T19:46:51.7590677Z #define __cudaCDP2Free 2025-05-07T19:46:51.7590778Z #define __cudaCDP2FuncGetAttributes 2025-05-07T19:46:51.7590879Z #define __cudaCDP2GetDevice 2025-05-07T19:46:51.7590973Z #define __cudaCDP2GetDeviceCount 2025-05-07T19:46:51.7591064Z #define __cudaCDP2GetErrorName 2025-05-07T19:46:51.7591171Z #define __cudaCDP2GetErrorString 2025-05-07T19:46:51.7591262Z #define __cudaCDP2GetLastError 2025-05-07T19:46:51.7591363Z #define __cudaCDP2GetParameterBuffer 2025-05-07T19:46:51.7591470Z #define __cudaCDP2GetParameterBufferV2 2025-05-07T19:46:51.7591582Z #define __cudaCDP2LaunchDevice 2025-05-07T19:46:51.7591674Z #define __cudaCDP2LaunchDeviceV2 2025-05-07T19:46:51.7591776Z #define __cudaCDP2LaunchDeviceV2_ptsz 2025-05-07T19:46:51.7591885Z #define __cudaCDP2LaunchDevice_ptsz 2025-05-07T19:46:51.7591974Z #define __cudaCDP2Malloc 2025-05-07T19:46:51.7592067Z #define __cudaCDP2Memcpy2DAsync 2025-05-07T19:46:51.7592169Z #define __cudaCDP2Memcpy2DAsync_ptsz 2025-05-07T19:46:51.7592277Z #define __cudaCDP2Memcpy3DAsync 2025-05-07T19:46:51.7592375Z #define __cudaCDP2Memcpy3DAsync_ptsz 2025-05-07T19:46:51.7592467Z #define __cudaCDP2MemcpyAsync 2025-05-07T19:46:51.7592577Z #define __cudaCDP2MemcpyAsync_ptsz 2025-05-07T19:46:51.7592668Z #define __cudaCDP2Memset2DAsync 2025-05-07T19:46:51.7592766Z #define __cudaCDP2Memset2DAsync_ptsz 2025-05-07T19:46:51.7592858Z #define __cudaCDP2Memset3DAsync 2025-05-07T19:46:51.7592971Z #define __cudaCDP2Memset3DAsync_ptsz 2025-05-07T19:46:51.7593064Z #define __cudaCDP2MemsetAsync 2025-05-07T19:46:51.7593206Z #define __cudaCDP2MemsetAsync_ptsz 2025-05-07T19:46:51.7593405Z #define __cudaCDP2OccupancyMaxActiveBlocksPerMultiprocessor 2025-05-07T19:46:51.7593630Z #define __cudaCDP2OccupancyMaxActiveBlocksPerMultiprocessorWithFlags 2025-05-07T19:46:51.7593726Z #define __cudaCDP2PeekAtLastError 2025-05-07T19:46:51.7593838Z #define __cudaCDP2RuntimeGetVersion 2025-05-07T19:46:51.7593947Z #define __cudaCDP2StreamCreateWithFlags 2025-05-07T19:46:51.7594041Z #define __cudaCDP2StreamDestroy 2025-05-07T19:46:51.7594135Z #define __cudaCDP2StreamWaitEvent 2025-05-07T19:46:51.7594249Z #define __cudaCDP2StreamWaitEvent_ptsz 2025-05-07T19:46:51.7594344Z #define __cudaGet_blockDim() blockDim 2025-05-07T19:46:51.7594436Z #define __cudaGet_blockIdx() blockIdx 2025-05-07T19:46:51.7594537Z #define __cudaGet_gridDim() gridDim 2025-05-07T19:46:51.7594632Z #define __cudaGet_threadIdx() threadIdx 2025-05-07T19:46:51.7594725Z #define __cudaGet_warpSize() warpSize 2025-05-07T19:46:51.7594920Z #define __cudart_builtin__ __location__(cudart_builtin) 2025-05-07T19:46:51.7595022Z #define __daddr_t_defined 2025-05-07T19:46:51.7595102Z #define __dev_t_defined 2025-05-07T19:46:51.7595195Z #define __device__ __location__(device) 2025-05-07T19:46:51.7595338Z #define __device_builtin__ __location__(device_builtin) 2025-05-07T19:46:51.7595558Z #define __device_builtin_surface_type__ __location__(device_builtin_surface_type) 2025-05-07T19:46:51.7595773Z #define __device_builtin_texture_type__ __location__(device_builtin_texture_type) 2025-05-07T19:46:51.7595901Z #define __errordecl(name,msg) extern void name (void) 2025-05-07T19:46:51.7596038Z #define __exctype(name) extern int name (int) __THROW 2025-05-07T19:46:51.7596209Z #define __exctype_l(name) extern int name (int, __locale_t) __THROW 2025-05-07T19:46:51.7596287Z #define __export__ 2025-05-07T19:46:51.7596531Z #define __extern_always_inline extern __always_inline __attribute__ ((__gnu_inline__)) 2025-05-07T19:46:51.7596726Z #define __extern_inline extern __inline __attribute__ ((__gnu_inline__)) 2025-05-07T19:46:51.7596806Z #define __flexarr [] 2025-05-07T19:46:51.7596978Z #define __forceinline__ __inline__ __attribute__((always_inline)) 2025-05-07T19:46:51.7597176Z #define __fortify_function __extern_always_inline __attribute_artificial__ 2025-05-07T19:46:51.7597264Z #define __fsblkcnt_t_defined 2025-05-07T19:46:51.7597350Z #define __fsfilcnt_t_defined 2025-05-07T19:46:51.7597441Z #define __gid_t_defined 2025-05-07T19:46:51.7597578Z #define __glibc_likely(cond) __builtin_expect((cond), 1) 2025-05-07T19:46:51.7597721Z #define __glibc_unlikely(cond) __builtin_expect((cond), 0) 2025-05-07T19:46:51.7597949Z #define __glibcxx_assert(cond) do { __glibcxx_constexpr_assert(cond); } while (false) 2025-05-07T19:46:51.7598048Z #define __glibcxx_class_requires(_a,_b) 2025-05-07T19:46:51.7598154Z #define __glibcxx_class_requires2(_a,_b,_c) 2025-05-07T19:46:51.7598279Z #define __glibcxx_class_requires3(_a,_b,_c,_d) 2025-05-07T19:46:51.7598397Z #define __glibcxx_class_requires4(_a,_b,_c,_d,_e) 2025-05-07T19:46:51.7598735Z #define __glibcxx_constexpr_assert(cond) if (__builtin_is_constant_evaluated() && !bool(cond)) __builtin_unreachable() 2025-05-07T19:46:51.7598925Z #define __glibcxx_digits10_b(T,B) (__glibcxx_digits_b (T,B) * 643L / 2136) 2025-05-07T19:46:51.7599090Z #define __glibcxx_digits_b(T,B) (B - __glibcxx_signed_b (T,B)) 2025-05-07T19:46:51.7599193Z #define __glibcxx_function_requires(...) 2025-05-07T19:46:51.7599290Z #define __glibcxx_integral_traps true 2025-05-07T19:46:51.7599591Z #define __glibcxx_max_b(T,B) (__glibcxx_signed_b (T,B) ? (((((T)1 << (__glibcxx_digits_b (T,B) - 1)) - 1) << 1) + 1) : ~(T)0) 2025-05-07T19:46:51.7599823Z #define __glibcxx_min_b(T,B) (__glibcxx_signed_b (T,B) ? -__glibcxx_max_b (T,B) - 1 : (T)0) 2025-05-07T19:46:51.7600012Z #define __glibcxx_requires_can_decrement_range(_First1,_Last1,_First2) 2025-05-07T19:46:51.7600160Z #define __glibcxx_requires_can_increment(_First,_Size) 2025-05-07T19:46:51.7600348Z #define __glibcxx_requires_can_increment_range(_First1,_Last1,_First2) 2025-05-07T19:46:51.7600501Z #define __glibcxx_requires_cond(_Cond,_Msg) 2025-05-07T19:46:51.7600624Z #define __glibcxx_requires_heap(_First,_Last) 2025-05-07T19:46:51.7600771Z #define __glibcxx_requires_heap_pred(_First,_Last,_Pred) 2025-05-07T19:46:51.7600901Z #define __glibcxx_requires_irreflexive(_First,_Last) 2025-05-07T19:46:51.7601036Z #define __glibcxx_requires_irreflexive2(_First,_Last) 2025-05-07T19:46:51.7601215Z #define __glibcxx_requires_irreflexive_pred(_First,_Last,_Pred) 2025-05-07T19:46:51.7601385Z #define __glibcxx_requires_irreflexive_pred2(_First,_Last,_Pred) 2025-05-07T19:46:51.7601529Z #define __glibcxx_requires_non_empty_range(_First,_Last) 2025-05-07T19:46:51.7601640Z #define __glibcxx_requires_nonempty() 2025-05-07T19:46:51.7601814Z #define __glibcxx_requires_partitioned_lower(_First,_Last,_Value) 2025-05-07T19:46:51.7602020Z #define __glibcxx_requires_partitioned_lower_pred(_First,_Last,_Value,_Pred) 2025-05-07T19:46:51.7602249Z #define __glibcxx_requires_partitioned_upper(_First,_Last,_Value) 2025-05-07T19:46:51.7602637Z #define __glibcxx_requires_partitioned_upper_pred(_First,_Last,_Value,_Pred) 2025-05-07T19:46:51.7602762Z #define __glibcxx_requires_sorted(_First,_Last) 2025-05-07T19:46:51.7602920Z #define __glibcxx_requires_sorted_pred(_First,_Last,_Pred) 2025-05-07T19:46:51.7603093Z #define __glibcxx_requires_sorted_set(_First1,_Last1,_First2) 2025-05-07T19:46:51.7603297Z #define __glibcxx_requires_sorted_set_pred(_First1,_Last1,_First2,_Pred) 2025-05-07T19:46:51.7603409Z #define __glibcxx_requires_string(_String) 2025-05-07T19:46:51.7603554Z #define __glibcxx_requires_string_len(_String,_Len) 2025-05-07T19:46:51.7603664Z #define __glibcxx_requires_subscript(_N) 2025-05-07T19:46:51.7603802Z #define __glibcxx_requires_valid_range(_First,_Last) 2025-05-07T19:46:51.7603924Z #define __glibcxx_signed_b(T,B) ((T)(-1) < 0) 2025-05-07T19:46:51.7604022Z #define __global__ __location__(global) 2025-05-07T19:46:51.7604115Z #define __gnu_linux__ 1 2025-05-07T19:46:51.7604249Z #define __grid_constant__ __location__(grid_constant) 2025-05-07T19:46:51.7604354Z #define __have_pthread_attr_t 1 2025-05-07T19:46:51.7604451Z #define __host__ __location__(host) 2025-05-07T19:46:51.7604535Z #define __id_t_defined 2025-05-07T19:46:51.7604628Z #define __import__ 2025-05-07T19:46:51.7604769Z #define __inline_hint__ __attribute__((nv_inline_hint)) 2025-05-07T19:46:51.7604857Z #define __ino64_t_defined 2025-05-07T19:46:51.7604947Z #define __ino_t_defined 2025-05-07T19:46:51.7605042Z #define __int8_t_defined 2025-05-07T19:46:51.7605256Z #define __intN_t(N,MODE) typedef int int##N##_t __attribute__ ((__mode__ (MODE))) 2025-05-07T19:46:51.7605405Z #define __isalnum_l(c,l) __isctype_l((c), _ISalnum, (l)) 2025-05-07T19:46:51.7605722Z #define __isalpha_l(c,l) __isctype_l((c), _ISalpha, (l)) 2025-05-07T19:46:51.7605826Z #define __isascii(c) (((c) & ~0x7f) == 0) 2025-05-07T19:46:51.7605945Z #define __isascii_l(c,l) ((l), __isascii (c)) 2025-05-07T19:46:51.7606114Z #define __isblank_l(c,l) __isctype_l((c), _ISblank, (l)) 2025-05-07T19:46:51.7606268Z #define __iscntrl_l(c,l) __isctype_l((c), _IScntrl, (l)) 2025-05-07T19:46:51.7606540Z #define __isctype_l(c,type,locale) ((locale)->__ctype_b[(int) (c)] & (unsigned short int) type) 2025-05-07T19:46:51.7606682Z #define __isdigit_l(c,l) __isctype_l((c), _ISdigit, (l)) 2025-05-07T19:46:51.7606844Z #define __isgraph_l(c,l) __isctype_l((c), _ISgraph, (l)) 2025-05-07T19:46:51.7607042Z #define __isleap(year) ((year) % 4 == 0 && ((year) % 100 != 0 || (year) % 400 == 0)) 2025-05-07T19:46:51.7607185Z #define __islower_l(c,l) __isctype_l((c), _ISlower, (l)) 2025-05-07T19:46:51.7607341Z #define __isprint_l(c,l) __isctype_l((c), _ISprint, (l)) 2025-05-07T19:46:51.7607484Z #define __ispunct_l(c,l) __isctype_l((c), _ISpunct, (l)) 2025-05-07T19:46:51.7607625Z #define __isspace_l(c,l) __isctype_l((c), _ISspace, (l)) 2025-05-07T19:46:51.7607770Z #define __isupper_l(c,l) __isctype_l((c), _ISupper, (l)) 2025-05-07T19:46:51.7607935Z #define __isxdigit_l(c,l) __isctype_l((c), _ISxdigit, (l)) 2025-05-07T19:46:51.7608067Z #define __k8 1 2025-05-07T19:46:51.7608150Z #define __k8__ 1 2025-05-07T19:46:51.7608252Z #define __key_t_defined 2025-05-07T19:46:51.7608445Z #define __launch_bounds__(...) __annotate__(launch_bounds(__VA_ARGS__)) 2025-05-07T19:46:51.7608538Z #define __ldiv_t_defined 1 2025-05-07T19:46:51.7608633Z #define __linux 1 2025-05-07T19:46:51.7608713Z #define __linux__ 1 2025-05-07T19:46:51.7608805Z #define __lldiv_t_defined 1 2025-05-07T19:46:51.7608886Z #define __llvm__ 1 2025-05-07T19:46:51.7609000Z #define __location__(a) __annotate__(a) 2025-05-07T19:46:51.7609103Z #define __long_double_t long double 2025-05-07T19:46:51.7609202Z #define __malloc_and_calloc_defined 2025-05-07T19:46:51.7609319Z #define __managed__ __location__(managed) 2025-05-07T19:46:51.7609448Z #define __maxnreg__(a) __attribute__((maxnreg(a))) 2025-05-07T19:46:51.7609534Z #define __mode_t_defined 2025-05-07T19:46:51.7610133Z #define __need_IOV_MAX 2025-05-07T19:46:51.7610240Z #define __need_clockid_t 2025-05-07T19:46:51.7610329Z #define __nlink_t_defined 2025-05-07T19:46:51.7610451Z #define __no_return__ __attribute__((noreturn)) 2025-05-07T19:46:51.7610581Z #define __noinline__ __attribute__((noinline)) 2025-05-07T19:46:51.7610756Z #define __nonnull(params) __attribute__ ((__nonnull__ params)) 2025-05-07T19:46:51.7610860Z #define __nv_pure__ __location__(nv_pure) 2025-05-07T19:46:51.7610948Z #define __off64_t_defined 2025-05-07T19:46:51.7611048Z #define __off_t_defined 2025-05-07T19:46:51.7611129Z #define __pic__ 2 2025-05-07T19:46:51.7611215Z #define __pid_t_defined 2025-05-07T19:46:51.7611307Z #define __pie__ 2 2025-05-07T19:46:51.7611406Z #define __private_extern__ extern 2025-05-07T19:46:51.7611489Z #define __ptr_t void * 2025-05-07T19:46:51.7611572Z #define __ptrvalue 2025-05-07T19:46:51.7611670Z #define __restrict_arr 2025-05-07T19:46:51.7611804Z #define __seg_fs __attribute__((address_space(257))) 2025-05-07T19:46:51.7611938Z #define __seg_gs __attribute__((address_space(256))) 2025-05-07T19:46:51.7612056Z #define __shared__ __location__(shared) 2025-05-07T19:46:51.7612146Z #define __sigset_t_defined 2025-05-07T19:46:51.7612247Z #define __specialization_static 2025-05-07T19:46:51.7612338Z #define __ssize_t_defined 2025-05-07T19:46:51.7612438Z #define __stub_bdflush 2025-05-07T19:46:51.7612520Z #define __stub_chflags 2025-05-07T19:46:51.7612605Z #define __stub_fattach 2025-05-07T19:46:51.7612705Z #define __stub_fchflags 2025-05-07T19:46:51.7612788Z #define __stub_fdetach 2025-05-07T19:46:51.7612874Z #define __stub_getmsg 2025-05-07T19:46:51.7612959Z #define __stub_gtty 2025-05-07T19:46:51.7613054Z #define __stub_lchmod 2025-05-07T19:46:51.7613139Z #define __stub_putmsg 2025-05-07T19:46:51.7613222Z #define __stub_revoke 2025-05-07T19:46:51.7613324Z #define __stub_setlogin 2025-05-07T19:46:51.7613414Z #define __stub_sigreturn 2025-05-07T19:46:51.7613497Z #define __stub_sstk 2025-05-07T19:46:51.7613582Z #define __stub_stty 2025-05-07T19:46:51.7613692Z #define __suseconds_t_defined 2025-05-07T19:46:51.7613782Z #define __thread__ __thread 2025-05-07T19:46:51.7613883Z #define __throw_exception_again throw 2025-05-07T19:46:51.7613983Z #define __time_t_defined 1 2025-05-07T19:46:51.7614074Z #define __timer_t_defined 1 2025-05-07T19:46:51.7614171Z #define __timespec_defined 1 2025-05-07T19:46:51.7614266Z #define __toascii(c) ((c) & 0x7f) 2025-05-07T19:46:51.7614398Z #define __toascii_l(c,l) ((l), __toascii (c)) 2025-05-07T19:46:51.7614948Z #define __tobody(c,f,a,args) (__extension__ ({ int __res; if (sizeof (c) > 1) { if (__builtin_constant_p (c)) { int __c = (c); __res = __c < -128 || __c > 255 ? __c : (a)[__c]; } else __res = f args; } else __res = (a)[(int) (c)]; __res; })) 2025-05-07T19:46:51.7615030Z #define __try try 2025-05-07T19:46:51.7615127Z #define __tune_k8__ 1 2025-05-07T19:46:51.7615216Z #define __u_char_defined 2025-05-07T19:46:51.7615490Z #define __u_intN_t(N,MODE) typedef unsigned int u_int##N##_t __attribute__ ((__mode__ (MODE))) 2025-05-07T19:46:51.7615639Z #define __uid_t_defined 2025-05-07T19:46:51.7615724Z #define __unbounded 2025-05-07T19:46:51.7615806Z #define __unix 1 2025-05-07T19:46:51.7615889Z #define __unix__ 1 2025-05-07T19:46:51.7615989Z #define __useconds_t_defined 2025-05-07T19:46:51.7616077Z #define __warnattr(msg) 2025-05-07T19:46:51.7616219Z #define __warndecl(name,msg) extern void name (void) 2025-05-07T19:46:51.7616312Z #define __wur 2025-05-07T19:46:51.7616465Z #define __x86_64 1 2025-05-07T19:46:51.7616553Z #define __x86_64__ 1 2025-05-07T19:46:51.7616726Z #define _tolower(c) ((int) (*__ctype_tolower_loc ())[(int) (c)]) 2025-05-07T19:46:51.7616910Z #define _toupper(c) ((int) (*__ctype_toupper_loc ())[(int) (c)]) 2025-05-07T19:46:51.7617027Z #define alloca(size) __builtin_alloca (size) 2025-05-07T19:46:51.7617376Z #define assert(expr) ((expr) ? __ASSERT_VOID_CAST (0) : __assert_fail (__STRING(expr), __FILE__, __LINE__, __ASSERT_FUNCTION)) 2025-05-07T19:46:51.7617843Z #define assert_perror(errnum) (!(errnum) ? __ASSERT_VOID_CAST (0) : __assert_perror_fail ((errnum), __FILE__, __LINE__, __ASSERT_FUNCTION)) 2025-05-07T19:46:51.7617949Z #define be16toh(x) __bswap_16 (x) 2025-05-07T19:46:51.7618044Z #define be32toh(x) __bswap_32 (x) 2025-05-07T19:46:51.7618149Z #define be64toh(x) __bswap_64 (x) 2025-05-07T19:46:51.7618258Z #define cudaArrayColorAttachment 0x20 2025-05-07T19:46:51.7618358Z #define cudaArrayCubemap 0x04 2025-05-07T19:46:51.7618454Z #define cudaArrayDefault 0x00 2025-05-07T19:46:51.7618578Z #define cudaArrayDeferredMapping 0x80 2025-05-07T19:46:51.7618674Z #define cudaArrayLayered 0x01 2025-05-07T19:46:51.7618772Z #define cudaArraySparse 0x40 2025-05-07T19:46:51.7618937Z #define cudaArraySparsePropertiesSingleMipTail 0x1 2025-05-07T19:46:51.7619047Z #define cudaArraySurfaceLoadStore 0x02 2025-05-07T19:46:51.7619152Z #define cudaArrayTextureGather 0x08 2025-05-07T19:46:51.7619330Z #define cudaCooperativeLaunchMultiDeviceNoPostSync 0x02 2025-05-07T19:46:51.7619522Z #define cudaCooperativeLaunchMultiDeviceNoPreSync 0x01 2025-05-07T19:46:51.7619627Z #define cudaCpuDeviceId ((int)-1) 2025-05-07T19:46:51.7619735Z #define cudaDeviceBlockingSync 0x04 2025-05-07T19:46:51.7619859Z #define cudaDeviceLmemResizeToMax 0x10 2025-05-07T19:46:51.7619956Z #define cudaDeviceMapHost 0x08 2025-05-07T19:46:51.7620048Z #define cudaDeviceMask 0xff 2025-05-07T19:46:51.7620153Z #define cudaDeviceScheduleAuto 0x00 2025-05-07T19:46:51.7620290Z #define cudaDeviceScheduleBlockingSync 0x04 2025-05-07T19:46:51.7620394Z #define cudaDeviceScheduleMask 0x07 2025-05-07T19:46:51.7620498Z #define cudaDeviceScheduleSpin 0x01 2025-05-07T19:46:51.7620615Z #define cudaDeviceScheduleYield 0x02 2025-05-07T19:46:51.7620717Z #define cudaDeviceSyncMemops 0x80 2025-05-07T19:46:51.7620822Z #define cudaEventBlockingSync 0x01 2025-05-07T19:46:51.7620921Z #define cudaEventDefault 0x00 2025-05-07T19:46:51.7621036Z #define cudaEventDisableTiming 0x02 2025-05-07T19:46:51.7621141Z #define cudaEventInterprocess 0x04 2025-05-07T19:46:51.7621247Z #define cudaEventRecordDefault 0x00 2025-05-07T19:46:51.7621366Z #define cudaEventRecordExternal 0x01 2025-05-07T19:46:51.7621466Z #define cudaEventWaitDefault 0x00 2025-05-07T19:46:51.7621571Z #define cudaEventWaitExternal 0x01 2025-05-07T19:46:51.7621683Z #define cudaExternalMemoryDedicated 0x1 2025-05-07T19:46:51.7621888Z #define cudaExternalSemaphoreSignalSkipNvSciBufMemSync 0x01 2025-05-07T19:46:51.7622075Z #define cudaExternalSemaphoreWaitSkipNvSciBufMemSync 0x02 2025-05-07T19:46:51.7622249Z #define cudaGetDeviceProperties cudaGetDeviceProperties_v2 2025-05-07T19:46:51.7622380Z #define cudaGraphKernelNodePortDefault 0 2025-05-07T19:46:51.7622530Z #define cudaGraphKernelNodePortLaunchCompletion 2 2025-05-07T19:46:51.7622665Z #define cudaGraphKernelNodePortProgrammatic 1 2025-05-07T19:46:51.7622778Z #define cudaHostAllocDefault 0x00 2025-05-07T19:46:51.7622882Z #define cudaHostAllocMapped 0x02 2025-05-07T19:46:51.7622986Z #define cudaHostAllocPortable 0x01 2025-05-07T19:46:51.7623100Z #define cudaHostAllocWriteCombined 0x04 2025-05-07T19:46:51.7623271Z #define cudaHostRegisterDefault 0x00 2025-05-07T19:46:51.7623380Z #define cudaHostRegisterIoMemory 0x04 2025-05-07T19:46:51.7623486Z #define cudaHostRegisterMapped 0x02 2025-05-07T19:46:51.7623607Z #define cudaHostRegisterPortable 0x01 2025-05-07T19:46:51.7623714Z #define cudaHostRegisterReadOnly 0x08 2025-05-07T19:46:51.7623828Z #define cudaInitDeviceFlagsAreValid 0x01 2025-05-07T19:46:51.7623934Z #define cudaInvalidDeviceId ((int)-2) 2025-05-07T19:46:51.7624067Z #define cudaIpcMemLazyEnablePeerAccess 0x01 2025-05-07T19:46:51.7624208Z #define cudaKernelNodeAttrID cudaLaunchAttributeID 2025-05-07T19:46:51.7624375Z #define cudaKernelNodeAttrValue cudaLaunchAttributeValue 2025-05-07T19:46:51.7624712Z #define cudaKernelNodeAttributeAccessPolicyWindow cudaLaunchAttributeAccessPolicyWindow 2025-05-07T19:46:51.7625013Z #define cudaKernelNodeAttributeClusterDimension cudaLaunchAttributeClusterDimension 2025-05-07T19:46:51.7625610Z #define cudaKernelNodeAttributeClusterSchedulingPolicyPreference cudaLaunchAttributeClusterSchedulingPolicyPreference 2025-05-07T19:46:51.7625880Z #define cudaKernelNodeAttributeCooperative cudaLaunchAttributeCooperative 2025-05-07T19:46:51.7626284Z #define cudaKernelNodeAttributeDeviceUpdatableKernelNode cudaLaunchAttributeDeviceUpdatableKernelNode 2025-05-07T19:46:51.7626553Z #define cudaKernelNodeAttributeMemSyncDomain cudaLaunchAttributeMemSyncDomain 2025-05-07T19:46:51.7626864Z #define cudaKernelNodeAttributeMemSyncDomainMap cudaLaunchAttributeMemSyncDomainMap 2025-05-07T19:46:51.7627308Z #define cudaKernelNodeAttributePreferredSharedMemoryCarveout cudaLaunchAttributePreferredSharedMemoryCarveout 2025-05-07T19:46:51.7627533Z #define cudaKernelNodeAttributePriority cudaLaunchAttributePriority 2025-05-07T19:46:51.7627639Z #define cudaMemAttachGlobal 0x01 2025-05-07T19:46:51.7627752Z #define cudaMemAttachHost 0x02 2025-05-07T19:46:51.7627853Z #define cudaMemAttachSingle 0x04 2025-05-07T19:46:51.7627965Z #define cudaNvSciSyncAttrSignal 0x1 2025-05-07T19:46:51.7628081Z #define cudaNvSciSyncAttrWait 0x2 2025-05-07T19:46:51.7628184Z #define cudaOccupancyDefault 0x00 2025-05-07T19:46:51.7628328Z #define cudaOccupancyDisableCachingOverride 0x01 2025-05-07T19:46:51.7628440Z #define cudaPeerAccessDefault 0x00 2025-05-07T19:46:51.7628794Z #define cudaSignalExternalSemaphoresAsync __CUDART_API_PTSZ(cudaSignalExternalSemaphoresAsync_v2) 2025-05-07T19:46:51.7628926Z #define cudaStreamAttrID cudaLaunchAttributeID 2025-05-07T19:46:51.7629076Z #define cudaStreamAttrValue cudaLaunchAttributeValue 2025-05-07T19:46:51.7629478Z #define cudaStreamAttributeAccessPolicyWindow cudaLaunchAttributeAccessPolicyWindow 2025-05-07T19:46:51.7629710Z #define cudaStreamAttributeMemSyncDomain cudaLaunchAttributeMemSyncDomain 2025-05-07T19:46:51.7629969Z #define cudaStreamAttributeMemSyncDomainMap cudaLaunchAttributeMemSyncDomainMap 2025-05-07T19:46:51.7630173Z #define cudaStreamAttributePriority cudaLaunchAttributePriority 2025-05-07T19:46:51.7630482Z #define cudaStreamAttributeSynchronizationPolicy cudaLaunchAttributeSynchronizationPolicy 2025-05-07T19:46:51.7630581Z #define cudaStreamDefault 0x00 2025-05-07T19:46:51.7630723Z #define cudaStreamFireAndForget ((cudaStream_t)0x4) 2025-05-07T19:46:51.7630964Z #define cudaStreamGetCaptureInfo __CUDART_API_PTSZ(cudaStreamGetCaptureInfo_v2) 2025-05-07T19:46:51.7631162Z #define cudaStreamGraphFireAndForget (cudaStream_t)0x0200000000000000 2025-05-07T19:46:51.7631404Z #define cudaStreamGraphFireAndForgetAsSibling (cudaStream_t)0x0300000000000000 2025-05-07T19:46:51.7631603Z #define cudaStreamGraphTailLaunch (cudaStream_t)0x0100000000000000 2025-05-07T19:46:51.7631716Z #define cudaStreamLegacy ((cudaStream_t)0x1) 2025-05-07T19:46:51.7631813Z #define cudaStreamNonBlocking 0x01 2025-05-07T19:46:51.7631949Z #define cudaStreamPerThread ((cudaStream_t)0x2) 2025-05-07T19:46:51.7632073Z #define cudaStreamTailLaunch ((cudaStream_t)0x3) 2025-05-07T19:46:51.7632167Z #define cudaSurfaceType1D 0x01 2025-05-07T19:46:51.7632285Z #define cudaSurfaceType1DLayered 0xF1 2025-05-07T19:46:51.7632428Z #define cudaSurfaceType2D 0x02 2025-05-07T19:46:51.7632531Z #define cudaSurfaceType2DLayered 0xF2 2025-05-07T19:46:51.7632630Z #define cudaSurfaceType3D 0x03 2025-05-07T19:46:51.7632734Z #define cudaSurfaceTypeCubemap 0x0C 2025-05-07T19:46:51.7632850Z #define cudaSurfaceTypeCubemapLayered 0xFC 2025-05-07T19:46:51.7632947Z #define cudaTextureType1D 0x01 2025-05-07T19:46:51.7633058Z #define cudaTextureType1DLayered 0xF1 2025-05-07T19:46:51.7633152Z #define cudaTextureType2D 0x02 2025-05-07T19:46:51.7633251Z #define cudaTextureType2DLayered 0xF2 2025-05-07T19:46:51.7633352Z #define cudaTextureType3D 0x03 2025-05-07T19:46:51.7633450Z #define cudaTextureTypeCubemap 0x0C 2025-05-07T19:46:51.7633566Z #define cudaTextureTypeCubemapLayered 0xFC 2025-05-07T19:46:51.7633885Z #define cudaWaitExternalSemaphoresAsync __CUDART_API_PTSZ(cudaWaitExternalSemaphoresAsync_v2) 2025-05-07T19:46:51.7633976Z #define getc(_fp) _IO_getc (_fp) 2025-05-07T19:46:51.7634165Z #define htobe16(x) __bswap_16 (x) 2025-05-07T19:46:51.7634256Z #define htobe32(x) __bswap_32 (x) 2025-05-07T19:46:51.7634354Z #define htobe64(x) __bswap_64 (x) 2025-05-07T19:46:51.7634434Z #define htole16(x) (x) 2025-05-07T19:46:51.7634516Z #define htole32(x) (x) 2025-05-07T19:46:51.7634612Z #define htole64(x) (x) 2025-05-07T19:46:51.7634720Z #define isalnum_l(c,l) __isalnum_l ((c), (l)) 2025-05-07T19:46:51.7634825Z #define isalpha_l(c,l) __isalpha_l ((c), (l)) 2025-05-07T19:46:51.7634916Z #define isascii(c) __isascii (c) 2025-05-07T19:46:51.7635201Z #define isascii_l(c,l) __isascii_l ((c), (l)) 2025-05-07T19:46:51.7635311Z #define isblank_l(c,l) __isblank_l ((c), (l)) 2025-05-07T19:46:51.7635421Z #define iscntrl_l(c,l) __iscntrl_l ((c), (l)) 2025-05-07T19:46:51.7635548Z #define isdigit_l(c,l) __isdigit_l ((c), (l)) 2025-05-07T19:46:51.7635659Z #define isgraph_l(c,l) __isgraph_l ((c), (l)) 2025-05-07T19:46:51.7635768Z #define islower_l(c,l) __islower_l ((c), (l)) 2025-05-07T19:46:51.7635882Z #define isprint_l(c,l) __isprint_l ((c), (l)) 2025-05-07T19:46:51.7636008Z #define ispunct_l(c,l) __ispunct_l ((c), (l)) 2025-05-07T19:46:51.7636117Z #define isspace_l(c,l) __isspace_l ((c), (l)) 2025-05-07T19:46:51.7636388Z #define isupper_l(c,l) __isupper_l ((c), (l)) 2025-05-07T19:46:51.7636523Z #define isxdigit_l(c,l) __isxdigit_l ((c), (l)) 2025-05-07T19:46:51.7636608Z #define le16toh(x) (x) 2025-05-07T19:46:51.7636695Z #define le32toh(x) (x) 2025-05-07T19:46:51.7636784Z #define le64toh(x) (x) 2025-05-07T19:46:51.7636880Z #define linux 1 2025-05-07T19:46:51.7636980Z #define major(dev) gnu_dev_major (dev) 2025-05-07T19:46:51.7637112Z #define makedev(maj,min) gnu_dev_makedev (maj, min) 2025-05-07T19:46:51.7637278Z #define math_errhandling (MATH_ERRNO | MATH_ERREXCEPT) 2025-05-07T19:46:51.7637382Z #define minor(dev) gnu_dev_minor (dev) 2025-05-07T19:46:51.7637496Z #define offsetof(t,d) __builtin_offsetof(t, d) 2025-05-07T19:46:51.7637612Z #define putc(_ch,_fp) _IO_putc (_ch, _fp) 2025-05-07T19:46:51.7637699Z #define stderr stderr 2025-05-07T19:46:51.7637782Z #define stdin stdin 2025-05-07T19:46:51.7637869Z #define stdout stdout 2025-05-07T19:46:51.7638373Z #define strdupa(s) (__extension__ ({ const char *__old = (s); size_t __len = strlen (__old) + 1; char *__new = (char *) __builtin_alloca (__len); (char *) memcpy (__new, __old, __len); })) 2025-05-07T19:46:51.7638916Z #define strndupa(s,n) (__extension__ ({ const char *__old = (s); size_t __len = strnlen (__old, (n)); char *__new = (char *) __builtin_alloca (__len + 1); __new[__len] = '\0'; (char *) memcpy (__new, __old, __len); })) 2025-05-07T19:46:51.7639011Z #define toascii(c) __toascii (c) 2025-05-07T19:46:51.7639141Z #define toascii_l(c,l) __toascii_l ((c), (l)) 2025-05-07T19:46:51.7639224Z #define unix 1 2025-05-07T19:46:51.7639349Z #define w_coredump __wait_terminated.__w_coredump 2025-05-07T19:46:51.7639480Z #define w_retcode __wait_terminated.__w_retcode 2025-05-07T19:46:51.7639595Z #define w_stopsig __wait_stopped.__w_stopsig 2025-05-07T19:46:51.7639714Z #define w_stopval __wait_stopped.__w_stopval 2025-05-07T19:46:51.7639889Z #define w_termsig __wait_terminated.__w_termsig 2025-05-07T19:46:51.7639912Z 2025-05-07T19:46:51.7781521Z 2025-05-07T19:46:51.7782191Z + conda run -n build_binary nvcc --version 2025-05-07T19:46:51.7782211Z 2025-05-07T19:46:53.3700671Z nvcc: NVIDIA (R) Cuda compiler driver 2025-05-07T19:46:53.3701671Z Copyright (c) 2005-2024 NVIDIA Corporation 2025-05-07T19:46:53.3702610Z Built on Tue_Oct_29_23:50:19_PDT_2024 2025-05-07T19:46:53.3703509Z Cuda compilation tools, release 12.6, V12.6.85 2025-05-07T19:46:53.3704487Z Build cuda_12.6.r12.6/compiler.35059454_0 2025-05-07T19:46:53.3705090Z 2025-05-07T19:46:53.4269355Z 2025-05-07T19:46:53.4279222Z which: no nvidia-smi in (CONDA=/github/home/miniconda:/github/home/miniconda/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin) 2025-05-07T19:46:53.4279962Z [CHECK] nvidia-smi not found 2025-05-07T19:46:53.4280304Z [INSTALL] Successfully installed CUDA 12.6.3 2025-05-07T19:46:53.4373409Z ##[group]Run . $PRELUDE; install_pytorch_pip $BUILD_ENV nightly cuda/12.6.3 2025-05-07T19:46:53.4374109Z . $PRELUDE; install_pytorch_pip $BUILD_ENV nightly cuda/12.6.3 2025-05-07T19:46:53.4374758Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:46:53.4375121Z env: 2025-05-07T19:46:53.4375400Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:46:53.4375731Z BUILD_ENV: build_binary 2025-05-07T19:46:53.4376040Z BUILD_TARGET: default 2025-05-07T19:46:53.4376296Z BUILD_VARIANT: cuda 2025-05-07T19:46:53.4376676Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:46:53.4376948Z ##[endgroup] 2025-05-07T19:46:53.9202773Z ################################################################################ 2025-05-07T19:46:53.9203273Z # Install PyTorch (PIP) 2025-05-07T19:46:53.9203579Z # 2025-05-07T19:46:53.9223292Z # [2025-05-07T19:46:53.921Z] + install_pytorch_pip build_binary nightly cuda/12.6.3 2025-05-07T19:46:53.9223943Z ################################################################################ 2025-05-07T19:46:53.9224190Z 2025-05-07T19:46:53.9256904Z [EXEC] [ATTEMPT 0/3] + conda install -n build_binary -c conda-forge --override-channels -y numpy 2025-05-07T19:46:54.8530693Z Channels: 2025-05-07T19:46:54.8530991Z - conda-forge 2025-05-07T19:46:54.8531286Z Platform: linux-64 2025-05-07T19:46:57.9439069Z Collecting package metadata (repodata.json): - \ | / done 2025-05-07T19:46:59.6493677Z Solving environment: \ | / - done 2025-05-07T19:46:59.9617269Z 2025-05-07T19:46:59.9617671Z ## Package Plan ## 2025-05-07T19:46:59.9617896Z 2025-05-07T19:46:59.9618111Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:46:59.9618431Z 2025-05-07T19:46:59.9618571Z added / updated specs: 2025-05-07T19:46:59.9618843Z - numpy 2025-05-07T19:46:59.9619008Z 2025-05-07T19:46:59.9619013Z 2025-05-07T19:46:59.9619149Z The following packages will be downloaded: 2025-05-07T19:46:59.9619381Z 2025-05-07T19:46:59.9619545Z package | build 2025-05-07T19:46:59.9619932Z ---------------------------|----------------- 2025-05-07T19:46:59.9620416Z libblas-3.9.0 |31_h59b9bed_openblas 16 KB conda-forge 2025-05-07T19:46:59.9620905Z libcblas-3.9.0 |31_he106b2a_openblas 16 KB conda-forge 2025-05-07T19:46:59.9621430Z liblapack-3.9.0 |31_h7ac8fdf_openblas 16 KB conda-forge 2025-05-07T19:46:59.9621902Z numpy-2.2.5 | py313h17eae1a_0 8.1 MB conda-forge 2025-05-07T19:46:59.9622353Z ------------------------------------------------------------ 2025-05-07T19:46:59.9622738Z Total: 8.2 MB 2025-05-07T19:46:59.9622974Z 2025-05-07T19:46:59.9623118Z The following NEW packages will be INSTALLED: 2025-05-07T19:46:59.9623345Z 2025-05-07T19:46:59.9623570Z libblas conda-forge/linux-64::libblas-3.9.0-31_h59b9bed_openblas 2025-05-07T19:46:59.9624107Z libcblas conda-forge/linux-64::libcblas-3.9.0-31_he106b2a_openblas 2025-05-07T19:46:59.9624956Z liblapack conda-forge/linux-64::liblapack-3.9.0-31_h7ac8fdf_openblas 2025-05-07T19:46:59.9625451Z numpy conda-forge/linux-64::numpy-2.2.5-py313h17eae1a_0 2025-05-07T19:46:59.9625730Z 2025-05-07T19:46:59.9625751Z 2025-05-07T19:46:59.9625755Z 2025-05-07T19:46:59.9625901Z Downloading and Extracting Packages: ...working... 2025-05-07T19:46:59.9626273Z numpy-2.2.5 | 8.1 MB | | 0% 2025-05-07T19:46:59.9626521Z 2025-05-07T19:46:59.9626914Z libblas-3.9.0 | 16 KB | | 0%  2025-05-07T19:46:59.9627156Z 2025-05-07T19:46:59.9627160Z 2025-05-07T19:46:59.9634088Z libcblas-3.9.0 | 16 KB | | 0%  2025-05-07T19:46:59.9634339Z 2025-05-07T19:46:59.9634343Z 2025-05-07T19:46:59.9634350Z 2025-05-07T19:47:00.1219191Z liblapack-3.9.0 | 16 KB | | 0%  2025-05-07T19:47:00.1219505Z 2025-05-07T19:47:00.1219799Z 2025-05-07T19:47:00.1219805Z 2025-05-07T19:47:00.1220063Z liblapack-3.9.0 | 16 KB | ########## | 100%  2025-05-07T19:47:00.1220328Z 2025-05-07T19:47:00.1220332Z 2025-05-07T19:47:00.1220336Z 2025-05-07T19:47:00.1237856Z liblapack-3.9.0 | 16 KB | ########## | 100%  2025-05-07T19:47:00.1238160Z 2025-05-07T19:47:00.1238165Z 2025-05-07T19:47:00.1277689Z libcblas-3.9.0 | 16 KB | #########7 | 98%  2025-05-07T19:47:00.1277996Z 2025-05-07T19:47:00.1278000Z 2025-05-07T19:47:00.1639866Z libcblas-3.9.0 | 16 KB | ########## | 100%  2025-05-07T19:47:00.1640256Z 2025-05-07T19:47:00.1640326Z 2025-05-07T19:47:00.1640574Z libcblas-3.9.0 | 16 KB | ########## | 100%  2025-05-07T19:47:00.1640826Z 2025-05-07T19:47:00.1640834Z 2025-05-07T19:47:00.1640838Z 2025-05-07T19:47:00.1723658Z liblapack-3.9.0 | 16 KB | ########## | 100%  2025-05-07T19:47:00.1723951Z 2025-05-07T19:47:00.1724231Z libblas-3.9.0 | 16 KB | #########7 | 97%  2025-05-07T19:47:00.1727353Z 2025-05-07T19:47:00.1783790Z libblas-3.9.0 | 16 KB | ########## | 100%  2025-05-07T19:47:00.1892471Z numpy-2.2.5 | 8.1 MB | | 0% 2025-05-07T19:47:00.1892832Z 2025-05-07T19:47:00.2996785Z libblas-3.9.0 | 16 KB | ########## | 100%  2025-05-07T19:47:00.3521677Z numpy-2.2.5 | 8.1 MB | #######4 | 75% 2025-05-07T19:47:00.6865037Z numpy-2.2.5 | 8.1 MB | ########## | 100% 2025-05-07T19:47:00.6868345Z numpy-2.2.5 | 8.1 MB | ########## | 100% 2025-05-07T19:47:00.6868696Z 2025-05-07T19:47:00.6868906Z 2025-05-07T19:47:00.6869313Z  2025-05-07T19:47:00.6869539Z 2025-05-07T19:47:00.6869552Z 2025-05-07T19:47:00.6869722Z  2025-05-07T19:47:00.6869937Z 2025-05-07T19:47:00.6869941Z 2025-05-07T19:47:00.6869981Z 2025-05-07T19:47:00.6870523Z  done 2025-05-07T19:47:00.7882066Z Preparing transaction: | done 2025-05-07T19:47:00.8889880Z Verifying transaction: - done 2025-05-07T19:47:00.9902732Z Executing transaction: | done 2025-05-07T19:47:01.0951644Z ################################################################################ 2025-05-07T19:47:01.0952102Z # Install Package From PyTorch PIP: torch 2025-05-07T19:47:01.0952427Z # 2025-05-07T19:47:01.0978538Z # [2025-05-07T19:47:01.097Z] + install_from_pytorch_pip build_binary torch nightly cuda/12.6.3 2025-05-07T19:47:01.0979078Z ################################################################################ 2025-05-07T19:47:01.0979306Z 2025-05-07T19:47:01.0998479Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:47:01.1903158Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:47:01.1904260Z ################################################################################ 2025-05-07T19:47:01.1905251Z # Prepare PIP Arguments (PyTorch PIP) 2025-05-07T19:47:01.1906501Z # 2025-05-07T19:47:01.1929819Z # [2025-05-07T19:47:01.192Z] + __prepare_pip_arguments torch nightly cuda/12.6.3 2025-05-07T19:47:01.1930294Z ################################################################################ 2025-05-07T19:47:01.1930534Z 2025-05-07T19:47:01.1955689Z [INSTALL] Extracted package (channel, version): (nightly, LATEST) 2025-05-07T19:47:01.1976692Z [INSTALL] Extracted package variant: cu126 2025-05-07T19:47:01.1992317Z [INSTALL] Using a non-RELEASE channel: nightly ... 2025-05-07T19:47:01.1992919Z [INSTALL] Extracted the full PIP channel: https://download.pytorch.org/whl/nightly/cu126/ 2025-05-07T19:47:01.1997382Z [INSTALL] Extracted the full PIP package: --pre torch 2025-05-07T19:47:01.2005398Z [INSTALL] Attempting to install [torch, LATEST] from PyTorch PIP using channel https://download.pytorch.org/whl/nightly/cu126/ ... 2025-05-07T19:47:01.2029262Z [EXEC] [ATTEMPT 0/3] + conda run -n build_binary pip install --pre torch --index-url https://download.pytorch.org/whl/nightly/cu126/ 2025-05-07T19:48:31.4381336Z WARNING: Running pip as the 'root' user can result in broken permissions and conflicting behaviour with the system package manager, possibly rendering your system unusable. It is recommended to use a virtual environment instead: https://pip.pypa.io/warnings/venv. Use the --root-user-action option if you know what you are doing and want to suppress this warning. 2025-05-07T19:48:31.4385442Z 2025-05-07T19:48:31.4385671Z Looking in indexes: https://download.pytorch.org/whl/nightly/cu126/ 2025-05-07T19:48:31.4386096Z Collecting torch 2025-05-07T19:48:31.4386773Z Downloading https://download.pytorch.org/whl/nightly/cu126/torch-2.8.0.dev20250507%2Bcu126-cp313-cp313-manylinux_2_28_x86_64.whl.metadata (30 kB) 2025-05-07T19:48:31.4387523Z Collecting filelock (from torch) 2025-05-07T19:48:31.4388074Z Downloading https://download.pytorch.org/whl/nightly/filelock-3.16.1-py3-none-any.whl (16 kB) 2025-05-07T19:48:31.4389110Z Requirement already satisfied: typing-extensions>=4.10.0 in /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages (from torch) (4.13.2) 2025-05-07T19:48:31.4390247Z Requirement already satisfied: setuptools in /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages (from torch) (78.1.1) 2025-05-07T19:48:31.4390917Z Collecting sympy>=1.13.3 (from torch) 2025-05-07T19:48:31.4391398Z Downloading https://download.pytorch.org/whl/nightly/sympy-1.13.3-py3-none-any.whl (6.2 MB) 2025-05-07T19:48:31.4392308Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 6.2/6.2 MB 31.2 MB/s eta 0:00:00 2025-05-07T19:48:31.4392648Z Collecting networkx (from torch) 2025-05-07T19:48:31.4393147Z Downloading https://download.pytorch.org/whl/nightly/networkx-3.4.2-py3-none-any.whl (1.7 MB) 2025-05-07T19:48:31.4393787Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 1.7/1.7 MB 13.2 MB/s eta 0:00:00 2025-05-07T19:48:31.4394461Z Requirement already satisfied: jinja2 in /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages (from torch) (3.1.6) 2025-05-07T19:48:31.4395113Z Collecting fsspec (from torch) 2025-05-07T19:48:31.4395590Z Downloading https://download.pytorch.org/whl/nightly/fsspec-2024.10.0-py3-none-any.whl (179 kB) 2025-05-07T19:48:31.4396157Z Collecting nvidia-cuda-nvrtc-cu12==12.6.77 (from torch) 2025-05-07T19:48:31.4396844Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_cuda_nvrtc_cu12-12.6.77-py3-none-manylinux2014_x86_64.whl (23.7 MB) 2025-05-07T19:48:31.4397631Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 23.7/23.7 MB 54.0 MB/s eta 0:00:00 2025-05-07T19:48:31.4398041Z Collecting nvidia-cuda-runtime-cu12==12.6.77 (from torch) 2025-05-07T19:48:31.4398745Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_cuda_runtime_cu12-12.6.77-py3-none-manylinux2014_x86_64.whl (897 kB) 2025-05-07T19:48:31.4399549Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 897.7/897.7 kB 5.0 MB/s eta 0:00:00 2025-05-07T19:48:31.4399929Z Collecting nvidia-cuda-cupti-cu12==12.6.80 (from torch) 2025-05-07T19:48:31.4400987Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_cuda_cupti_cu12-12.6.80-py3-none-manylinux2014_x86_64.whl (8.9 MB) 2025-05-07T19:48:31.4401768Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 8.9/8.9 MB 40.8 MB/s eta 0:00:00 2025-05-07T19:48:31.4402131Z Collecting nvidia-cudnn-cu12==9.5.1.17 (from torch) 2025-05-07T19:48:31.4402814Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_cudnn_cu12-9.5.1.17-py3-none-manylinux_2_28_x86_64.whl (571.0 MB) 2025-05-07T19:48:31.4403563Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 571.0/571.0 MB 44.4 MB/s eta 0:00:00 2025-05-07T19:48:31.4403949Z Collecting nvidia-cublas-cu12==12.6.4.1 (from torch) 2025-05-07T19:48:31.4404716Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_cublas_cu12-12.6.4.1-py3-none-manylinux2014_x86_64.manylinux_2_17_x86_64.whl (393.1 MB) 2025-05-07T19:48:31.4405705Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 393.1/393.1 MB 60.2 MB/s eta 0:00:00 2025-05-07T19:48:31.4406100Z Collecting nvidia-cufft-cu12==11.3.0.4 (from torch) 2025-05-07T19:48:31.4406759Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_cufft_cu12-11.3.0.4-py3-none-manylinux2014_x86_64.whl (200.2 MB) 2025-05-07T19:48:31.4407532Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 200.2/200.2 MB 70.2 MB/s eta 0:00:00 2025-05-07T19:48:31.4407906Z Collecting nvidia-curand-cu12==10.3.7.77 (from torch) 2025-05-07T19:48:31.4408592Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_curand_cu12-10.3.7.77-py3-none-manylinux2014_x86_64.whl (56.3 MB) 2025-05-07T19:48:31.4409364Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 56.3/56.3 MB 74.3 MB/s eta 0:00:00 2025-05-07T19:48:31.4409743Z Collecting nvidia-cusolver-cu12==11.7.1.2 (from torch) 2025-05-07T19:48:31.4410456Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_cusolver_cu12-11.7.1.2-py3-none-manylinux2014_x86_64.whl (158.2 MB) 2025-05-07T19:48:31.4411222Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 158.2/158.2 MB 75.6 MB/s eta 0:00:00 2025-05-07T19:48:31.4411631Z Collecting nvidia-cusparse-cu12==12.5.4.2 (from torch) 2025-05-07T19:48:31.4412334Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_cusparse_cu12-12.5.4.2-py3-none-manylinux2014_x86_64.whl (216.6 MB) 2025-05-07T19:48:31.4413100Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 216.6/216.6 MB 83.7 MB/s eta 0:00:00 2025-05-07T19:48:31.4413495Z Collecting nvidia-cusparselt-cu12==0.6.3 (from torch) 2025-05-07T19:48:31.4414182Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_cusparselt_cu12-0.6.3-py3-none-manylinux2014_x86_64.whl (156.8 MB) 2025-05-07T19:48:31.4414959Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 156.8/156.8 MB 75.2 MB/s eta 0:00:00 2025-05-07T19:48:31.4415339Z Collecting nvidia-nccl-cu12==2.26.2 (from torch) 2025-05-07T19:48:31.4416215Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_nccl_cu12-2.26.2-py3-none-manylinux2014_x86_64.manylinux_2_17_x86_64.whl.metadata (2.0 kB) 2025-05-07T19:48:31.4417293Z Collecting nvidia-nvtx-cu12==12.6.77 (from torch) 2025-05-07T19:48:31.4417981Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_nvtx_cu12-12.6.77-py3-none-manylinux2014_x86_64.whl (89 kB) 2025-05-07T19:48:31.4418708Z Collecting nvidia-nvjitlink-cu12==12.6.85 (from torch) 2025-05-07T19:48:31.4419534Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_nvjitlink_cu12-12.6.85-py3-none-manylinux2010_x86_64.manylinux_2_12_x86_64.whl (19.7 MB) 2025-05-07T19:48:31.4420478Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 19.7/19.7 MB 57.4 MB/s eta 0:00:00 2025-05-07T19:48:31.4420896Z Collecting nvidia-cufile-cu12==1.11.1.6 (from torch) 2025-05-07T19:48:31.4421726Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_cufile_cu12-1.11.1.6-py3-none-manylinux2014_x86_64.manylinux_2_17_x86_64.whl.metadata (1.5 kB) 2025-05-07T19:48:31.4422607Z Collecting pytorch-triton==3.3.0+git96316ce5 (from torch) 2025-05-07T19:48:31.4423482Z Downloading https://download.pytorch.org/whl/nightly/pytorch_triton-3.3.0%2Bgit96316ce5-cp313-cp313-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl.metadata (1.6 kB) 2025-05-07T19:48:31.4424455Z Collecting mpmath<1.4,>=1.1.0 (from sympy>=1.13.3->torch) 2025-05-07T19:48:31.4425048Z Downloading https://download.pytorch.org/whl/nightly/mpmath-1.3.0-py3-none-any.whl (536 kB) 2025-05-07T19:48:31.4425715Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 536.2/536.2 kB 2.3 MB/s eta 0:00:00 2025-05-07T19:48:31.4426517Z Requirement already satisfied: MarkupSafe>=2.0 in /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages (from jinja2->torch) (3.0.2) 2025-05-07T19:48:31.4427650Z Downloading https://download.pytorch.org/whl/nightly/cu126/torch-2.8.0.dev20250507%2Bcu126-cp313-cp313-manylinux_2_28_x86_64.whl (825.4 MB) 2025-05-07T19:48:31.4428517Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 825.4/825.4 MB 32.6 MB/s eta 0:00:00 2025-05-07T19:48:31.4429497Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_cufile_cu12-1.11.1.6-py3-none-manylinux2014_x86_64.manylinux_2_17_x86_64.whl (1.1 MB) 2025-05-07T19:48:31.4430335Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 1.1/1.1 MB 26.8 MB/s eta 0:00:00 2025-05-07T19:48:31.4431084Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_nccl_cu12-2.26.2-py3-none-manylinux2014_x86_64.manylinux_2_17_x86_64.whl (201.3 MB) 2025-05-07T19:48:31.4431922Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 201.3/201.3 MB 75.2 MB/s eta 0:00:00 2025-05-07T19:48:31.4432689Z Downloading https://download.pytorch.org/whl/nightly/pytorch_triton-3.3.0%2Bgit96316ce5-cp313-cp313-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl (153.5 MB) 2025-05-07T19:48:31.4433559Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 153.5/153.5 MB 75.2 MB/s eta 0:00:00 2025-05-07T19:48:31.4435195Z Installing collected packages: nvidia-cusparselt-cu12, mpmath, sympy, pytorch-triton, nvidia-nvtx-cu12, nvidia-nvjitlink-cu12, nvidia-nccl-cu12, nvidia-curand-cu12, nvidia-cufile-cu12, nvidia-cuda-runtime-cu12, nvidia-cuda-nvrtc-cu12, nvidia-cuda-cupti-cu12, nvidia-cublas-cu12, networkx, fsspec, filelock, nvidia-cusparse-cu12, nvidia-cufft-cu12, nvidia-cudnn-cu12, nvidia-cusolver-cu12, torch 2025-05-07T19:48:31.4436709Z 2025-05-07T19:48:31.4438544Z Successfully installed filelock-3.16.1 fsspec-2024.10.0 mpmath-1.3.0 networkx-3.4.2 nvidia-cublas-cu12-12.6.4.1 nvidia-cuda-cupti-cu12-12.6.80 nvidia-cuda-nvrtc-cu12-12.6.77 nvidia-cuda-runtime-cu12-12.6.77 nvidia-cudnn-cu12-9.5.1.17 nvidia-cufft-cu12-11.3.0.4 nvidia-cufile-cu12-1.11.1.6 nvidia-curand-cu12-10.3.7.77 nvidia-cusolver-cu12-11.7.1.2 nvidia-cusparse-cu12-12.5.4.2 nvidia-cusparselt-cu12-0.6.3 nvidia-nccl-cu12-2.26.2 nvidia-nvjitlink-cu12-12.6.85 nvidia-nvtx-cu12-12.6.77 pytorch-triton-3.3.0+git96316ce5 sympy-1.13.3 torch-2.8.0.dev20250507+cu126 2025-05-07T19:48:31.4440471Z 2025-05-07T19:48:33.4517504Z torch 2.8.0.dev20250507+cu126 2025-05-07T19:48:33.4524163Z [CHECK] The installed package [torch, nightly/LATEST] is the correct variant (cu126) 2025-05-07T19:48:36.5640505Z [CHECK] Python (sub-)package 'torch.distributed' found ... 2025-05-07T19:48:39.6329476Z [CHECK] NOTE: The installed version is: 2.8.0.dev20250507+cu126 2025-05-07T19:48:39.6330791Z [CHECK] NOTE: Checking _GLIBCXX_USE_CXX11_ABI ... 2025-05-07T19:48:42.6876346Z True 2025-05-07T19:48:42.6876932Z True 2025-05-07T19:48:42.6877059Z 2025-05-07T19:48:42.7459808Z [INSTALL] Successfully installed PyTorch through PyTorch PIP 2025-05-07T19:48:42.7533692Z ##[group]Run if . $PRELUDE && which conda; then collect_pytorch_env_info $BUILD_ENV; fi 2025-05-07T19:48:42.7534384Z if . $PRELUDE && which conda; then collect_pytorch_env_info $BUILD_ENV; fi 2025-05-07T19:48:42.7535043Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:48:42.7535423Z env: 2025-05-07T19:48:42.7535662Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:48:42.7536109Z BUILD_ENV: build_binary 2025-05-07T19:48:42.7536555Z BUILD_TARGET: default 2025-05-07T19:48:42.7536794Z BUILD_VARIANT: cuda 2025-05-07T19:48:42.7537159Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:48:42.7537434Z ##[endgroup] 2025-05-07T19:48:43.2145732Z /github/home/miniconda/bin/conda 2025-05-07T19:48:43.2146164Z ################################################################################ 2025-05-07T19:48:43.2146610Z # Collect PyTorch Environment Information (for Reporting Issues) 2025-05-07T19:48:43.2147011Z # 2025-05-07T19:48:43.2166343Z # [2025-05-07T19:48:43.215Z] + collect_pytorch_env_info build_binary 2025-05-07T19:48:43.2166882Z ################################################################################ 2025-05-07T19:48:43.2167139Z 2025-05-07T19:48:43.2192008Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:48:43.3075141Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:48:43.3084690Z [INFO] Downloading the PyTorch environment info collection script ... 2025-05-07T19:48:43.3086645Z + wget -q https://raw.githubusercontent.com/pytorch/pytorch/main/torch/utils/collect_env.py 2025-05-07T19:48:43.3087952Z 2025-05-07T19:48:43.3942539Z 2025-05-07T19:48:43.3943591Z [INFO] Collecting PyTorch environment info (will be needed for reporting issues to PyTorch) ... 2025-05-07T19:48:43.3965655Z [EXEC] [ATTEMPT 0/3] + conda run -n build_binary python collect_env.py 2025-05-07T19:48:48.7523243Z Collecting environment information... 2025-05-07T19:48:48.7523751Z PyTorch version: 2.8.0.dev20250507+cu126 2025-05-07T19:48:48.7524108Z Is debug build: False 2025-05-07T19:48:48.7524423Z CUDA used to build PyTorch: 12.6 2025-05-07T19:48:48.7524750Z ROCM used to build PyTorch: N/A 2025-05-07T19:48:48.7524932Z 2025-05-07T19:48:48.7525064Z OS: Amazon Linux 2023.7.20250428 (x86_64) 2025-05-07T19:48:48.7525399Z GCC version: Could not collect 2025-05-07T19:48:48.7526010Z Clang version: 16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4) 2025-05-07T19:48:48.7526577Z CMake version: version 4.0.2 2025-05-07T19:48:48.7526889Z Libc version: glibc-2.34 2025-05-07T19:48:48.7527078Z 2025-05-07T19:48:48.7527380Z Python version: 3.13.2 | packaged by conda-forge | (main, Feb 17 2025, 14:10:22) [GCC 13.3.0] (64-bit runtime) 2025-05-07T19:48:48.7528026Z Python platform: Linux-6.1.130-139.222.amzn2023.x86_64-x86_64-with-glibc2.34 2025-05-07T19:48:48.7528472Z Is CUDA available: False 2025-05-07T19:48:48.7528734Z CUDA runtime version: 12.6.85 2025-05-07T19:48:48.7529059Z CUDA_MODULE_LOADING set to: N/A 2025-05-07T19:48:48.7529388Z GPU models and configuration: Could not collect 2025-05-07T19:48:48.7529760Z Nvidia driver version: Could not collect 2025-05-07T19:48:48.7530073Z cuDNN version: Could not collect 2025-05-07T19:48:48.7530376Z HIP runtime version: N/A 2025-05-07T19:48:48.7530632Z MIOpen runtime version: N/A 2025-05-07T19:48:48.7530921Z Is XNNPACK available: True 2025-05-07T19:48:48.7534201Z 2025-05-07T19:48:48.7534394Z CPU: 2025-05-07T19:48:48.7534647Z Architecture: x86_64 2025-05-07T19:48:48.7535030Z CPU op-mode(s): 32-bit, 64-bit 2025-05-07T19:48:48.7535699Z Address sizes: 46 bits physical, 48 bits virtual 2025-05-07T19:48:48.7536236Z Byte Order: Little Endian 2025-05-07T19:48:48.7536784Z CPU(s): 96 2025-05-07T19:48:48.7537112Z On-line CPU(s) list: 0-95 2025-05-07T19:48:48.7537495Z Vendor ID: GenuineIntel 2025-05-07T19:48:48.7538085Z Model name: Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:48:48.7538525Z CPU family: 6 2025-05-07T19:48:48.7538836Z Model: 85 2025-05-07T19:48:48.7539184Z Thread(s) per core: 2 2025-05-07T19:48:48.7539534Z Core(s) per socket: 24 2025-05-07T19:48:48.7539851Z Socket(s): 2 2025-05-07T19:48:48.7540186Z Stepping: 7 2025-05-07T19:48:48.7540511Z BogoMIPS: 5999.98 2025-05-07T19:48:48.7542980Z Flags: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:48:48.7545159Z Hypervisor vendor: KVM 2025-05-07T19:48:48.7545496Z Virtualization type: full 2025-05-07T19:48:48.7545835Z L1d cache: 1.5 MiB (48 instances) 2025-05-07T19:48:48.7546521Z L1i cache: 1.5 MiB (48 instances) 2025-05-07T19:48:48.7547022Z L2 cache: 48 MiB (48 instances) 2025-05-07T19:48:48.7547413Z L3 cache: 71.5 MiB (2 instances) 2025-05-07T19:48:48.7547736Z NUMA node(s): 2 2025-05-07T19:48:48.7548068Z NUMA node0 CPU(s): 0-23,48-71 2025-05-07T19:48:48.7548402Z NUMA node1 CPU(s): 24-47,72-95 2025-05-07T19:48:48.7548878Z Vulnerability Gather data sampling: Unknown: Dependent on hypervisor status 2025-05-07T19:48:48.7549435Z Vulnerability Itlb multihit: KVM: Mitigation: VMX unsupported 2025-05-07T19:48:48.7550085Z Vulnerability L1tf: Mitigation; PTE Inversion 2025-05-07T19:48:48.7550790Z Vulnerability Mds: Vulnerable: Clear CPU buffers attempted, no microcode; SMT Host state unknown 2025-05-07T19:48:48.7551369Z Vulnerability Meltdown: Mitigation; PTI 2025-05-07T19:48:48.7552001Z Vulnerability Mmio stale data: Vulnerable: Clear CPU buffers attempted, no microcode; SMT Host state unknown 2025-05-07T19:48:48.7552748Z Vulnerability Reg file data sampling: Not affected 2025-05-07T19:48:48.7553122Z Vulnerability Retbleed: Vulnerable 2025-05-07T19:48:48.7553515Z Vulnerability Spec rstack overflow: Not affected 2025-05-07T19:48:48.7553884Z Vulnerability Spec store bypass: Vulnerable 2025-05-07T19:48:48.7554463Z Vulnerability Spectre v1: Mitigation; usercopy/swapgs barriers and __user pointer sanitization 2025-05-07T19:48:48.7555255Z Vulnerability Spectre v2: Mitigation; Retpolines; STIBP disabled; RSB filling; PBRSB-eIBRS Not affected; BHI Retpoline 2025-05-07T19:48:48.7555886Z Vulnerability Srbds: Not affected 2025-05-07T19:48:48.7556281Z Vulnerability Tsx async abort: Not affected 2025-05-07T19:48:48.7556515Z 2025-05-07T19:48:48.7556627Z Versions of relevant libraries: 2025-05-07T19:48:48.7556924Z [pip3] numpy==2.2.5 2025-05-07T19:48:48.7557172Z [pip3] nvidia-cublas-cu12==12.6.4.1 2025-05-07T19:48:48.7557504Z [pip3] nvidia-cuda-cupti-cu12==12.6.80 2025-05-07T19:48:48.7557911Z [pip3] nvidia-cuda-nvrtc-cu12==12.6.77 2025-05-07T19:48:48.7558260Z [pip3] nvidia-cuda-runtime-cu12==12.6.77 2025-05-07T19:48:48.7558579Z [pip3] nvidia-cudnn-cu12==9.5.1.17 2025-05-07T19:48:48.7558908Z [pip3] nvidia-cufft-cu12==11.3.0.4 2025-05-07T19:48:48.7559238Z [pip3] nvidia-curand-cu12==10.3.7.77 2025-05-07T19:48:48.7559544Z [pip3] nvidia-cusolver-cu12==11.7.1.2 2025-05-07T19:48:48.7559970Z [pip3] nvidia-cusparse-cu12==12.5.4.2 2025-05-07T19:48:48.7560277Z [pip3] nvidia-cusparselt-cu12==0.6.3 2025-05-07T19:48:48.7560611Z [pip3] nvidia-nccl-cu12==2.26.2 2025-05-07T19:48:48.7560901Z [pip3] nvidia-nvjitlink-cu12==12.6.85 2025-05-07T19:48:48.7561235Z [pip3] nvidia-nvtx-cu12==12.6.77 2025-05-07T19:48:48.7561528Z [pip3] pytorch-triton==3.3.0+git96316ce5 2025-05-07T19:48:48.7561873Z [pip3] torch==2.8.0.dev20250507+cu126 2025-05-07T19:48:48.7562245Z [conda] cuda-cudart 12.6.77 h5888daf_0 conda-forge 2025-05-07T19:48:48.7562758Z [conda] cuda-cudart-dev 12.6.77 h5888daf_0 conda-forge 2025-05-07T19:48:48.7563299Z [conda] cuda-cudart-dev_linux-64 12.6.77 h3f2d84a_0 conda-forge 2025-05-07T19:48:48.7563819Z [conda] cuda-cudart-static 12.6.77 h5888daf_0 conda-forge 2025-05-07T19:48:48.7564378Z [conda] cuda-cudart-static_linux-64 12.6.77 h3f2d84a_0 conda-forge 2025-05-07T19:48:48.7564913Z [conda] cuda-cudart_linux-64 12.6.77 h3f2d84a_0 conda-forge 2025-05-07T19:48:48.7565422Z [conda] cuda-cupti 12.6.80 hbd13f7d_0 conda-forge 2025-05-07T19:48:48.7565916Z [conda] cuda-cupti-dev 12.6.80 h5888daf_0 conda-forge 2025-05-07T19:48:48.7566402Z [conda] cuda-libraries 12.6.3 ha770c72_0 conda-forge 2025-05-07T19:48:48.7566933Z [conda] cuda-libraries-dev 12.6.3 ha770c72_0 conda-forge 2025-05-07T19:48:48.7567421Z [conda] cuda-nvrtc 12.6.85 hbd13f7d_0 conda-forge 2025-05-07T19:48:48.7567921Z [conda] cuda-nvrtc-dev 12.6.85 h5888daf_0 conda-forge 2025-05-07T19:48:48.7568384Z [conda] cuda-nvtx 12.6.77 hbd13f7d_0 conda-forge 2025-05-07T19:48:48.7568873Z [conda] cuda-opencl 12.6.77 hbd13f7d_0 conda-forge 2025-05-07T19:48:48.7569378Z [conda] cuda-opencl-dev 12.6.77 h5888daf_0 conda-forge 2025-05-07T19:48:48.7569857Z [conda] cuda-runtime 12.6.3 ha804496_0 conda-forge 2025-05-07T19:48:48.7570740Z [conda] libcublas 12.6.4.1 h5888daf_1 conda-forge 2025-05-07T19:48:48.7571276Z [conda] libcublas-dev 12.6.4.1 h5888daf_1 conda-forge 2025-05-07T19:48:48.7571869Z [conda] libcufft 11.3.0.4 hbd13f7d_0 conda-forge 2025-05-07T19:48:48.7572357Z [conda] libcufft-dev 11.3.0.4 h5888daf_0 conda-forge 2025-05-07T19:48:48.7572888Z [conda] libcurand 10.3.7.77 hbd13f7d_0 conda-forge 2025-05-07T19:48:48.7573419Z [conda] libcurand-dev 10.3.7.77 h5888daf_0 conda-forge 2025-05-07T19:48:48.7573922Z [conda] libcusolver 11.7.1.2 h5888daf_1 conda-forge 2025-05-07T19:48:48.7574467Z [conda] libcusolver-dev 11.7.1.2 h5888daf_1 conda-forge 2025-05-07T19:48:48.7574980Z [conda] libcusparse 12.5.4.2 hbd13f7d_0 conda-forge 2025-05-07T19:48:48.7575523Z [conda] libcusparse-dev 12.5.4.2 h5888daf_0 conda-forge 2025-05-07T19:48:48.7576149Z [conda] libnvjitlink 12.6.85 hbd13f7d_0 conda-forge 2025-05-07T19:48:48.7576676Z [conda] libnvjitlink-dev 12.6.85 h5888daf_0 conda-forge 2025-05-07T19:48:48.7577198Z [conda] numpy 2.2.5 py313h17eae1a_0 conda-forge 2025-05-07T19:48:48.7577860Z [conda] nvidia-cublas-cu12 12.6.4.1 pypi_0 pypi 2025-05-07T19:48:48.7578422Z [conda] nvidia-cuda-cupti-cu12 12.6.80 pypi_0 pypi 2025-05-07T19:48:48.7578955Z [conda] nvidia-cuda-nvrtc-cu12 12.6.77 pypi_0 pypi 2025-05-07T19:48:48.7579523Z [conda] nvidia-cuda-runtime-cu12 12.6.77 pypi_0 pypi 2025-05-07T19:48:48.7580178Z [conda] nvidia-cudnn-cu12 9.5.1.17 pypi_0 pypi 2025-05-07T19:48:48.7580688Z [conda] nvidia-cufft-cu12 11.3.0.4 pypi_0 pypi 2025-05-07T19:48:48.7581228Z [conda] nvidia-curand-cu12 10.3.7.77 pypi_0 pypi 2025-05-07T19:48:48.7581757Z [conda] nvidia-cusolver-cu12 11.7.1.2 pypi_0 pypi 2025-05-07T19:48:48.7582319Z [conda] nvidia-cusparse-cu12 12.5.4.2 pypi_0 pypi 2025-05-07T19:48:48.7582864Z [conda] nvidia-cusparselt-cu12 0.6.3 pypi_0 pypi 2025-05-07T19:48:48.7583424Z [conda] nvidia-nccl-cu12 2.26.2 pypi_0 pypi 2025-05-07T19:48:48.7583971Z [conda] nvidia-nvjitlink-cu12 12.6.85 pypi_0 pypi 2025-05-07T19:48:48.7584483Z [conda] nvidia-nvtx-cu12 12.6.77 pypi_0 pypi 2025-05-07T19:48:48.7585025Z [conda] pytorch-triton 3.3.0+git96316ce5 pypi_0 pypi 2025-05-07T19:48:48.7585530Z [conda] torch 2.8.0.dev20250507+cu126 pypi_0 pypi 2025-05-07T19:48:48.7585850Z 2025-05-07T19:48:48.8268155Z ##[group]Run . $PRELUDE; install_cudnn $BUILD_ENV "$(pwd)/build_only/cudnn" 12.6.3 2025-05-07T19:48:48.8268753Z . $PRELUDE; install_cudnn $BUILD_ENV "$(pwd)/build_only/cudnn" 12.6.3 2025-05-07T19:48:48.8269292Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:48:48.8269610Z env: 2025-05-07T19:48:48.8269836Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:48:48.8270463Z BUILD_ENV: build_binary 2025-05-07T19:48:48.8270977Z BUILD_TARGET: default 2025-05-07T19:48:48.8271245Z BUILD_VARIANT: cuda 2025-05-07T19:48:48.8271500Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:48:48.8271751Z ##[endgroup] 2025-05-07T19:48:49.2728415Z ################################################################################ 2025-05-07T19:48:49.2729430Z # Install cuDNN 2025-05-07T19:48:49.2730057Z # 2025-05-07T19:48:49.2742582Z # [2025-05-07T19:48:49.273Z] + install_cudnn build_binary /__w/FBGEMM/FBGEMM/build_only/cudnn 12.6.3 2025-05-07T19:48:49.2744214Z ################################################################################ 2025-05-07T19:48:49.2744952Z 2025-05-07T19:48:49.2762752Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:48:49.3624417Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:48:49.3625629Z [INSTALL] cuda_concat_version is determined to be: 126 2025-05-07T19:48:49.3626738Z + rm -rf /__w/FBGEMM/FBGEMM/build_only/cudnn 2025-05-07T19:48:49.3627387Z 2025-05-07T19:48:49.3640471Z 2025-05-07T19:48:49.3641314Z + mkdir -p /__w/FBGEMM/FBGEMM/build_only/cudnn 2025-05-07T19:48:49.3642056Z 2025-05-07T19:48:49.3654177Z 2025-05-07T19:48:49.3672273Z [INSTALL] Downloading cuDNN to /tmp/tmp.OzARDt4RIc ... 2025-05-07T19:48:49.3694045Z [EXEC] [ATTEMPT 0/3] + wget -q https://developer.download.nvidia.com/compute/cudnn/redist/cudnn/linux-x86_64/cudnn-linux-x86_64-9.5.1.17_cuda12-archive.tar.xz -O cudnn.tar.xz 2025-05-07T19:48:51.6761938Z [INSTALL] Unpacking cuDNN ... 2025-05-07T19:48:51.6762883Z + tar -xvf cudnn.tar.xz 2025-05-07T19:48:51.6763390Z 2025-05-07T19:48:51.6787513Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/ 2025-05-07T19:48:51.6788662Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/ 2025-05-07T19:48:51.6789960Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_adv_static_v9.a 2025-05-07T19:48:56.3552802Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_cnn_static_v9.a 2025-05-07T19:48:56.4186407Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_engines_precompiled_static_v9.a 2025-05-07T19:49:04.0292788Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_engines_runtime_compiled_static_v9.a 2025-05-07T19:49:04.2765820Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_graph_static_v9.a 2025-05-07T19:49:04.3148827Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_heuristic_static_v9.a 2025-05-07T19:49:04.8644077Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_ops_static_v9.a 2025-05-07T19:49:07.0076071Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_adv_static.a 2025-05-07T19:49:07.0077607Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_cnn_static.a 2025-05-07T19:49:07.0078174Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_engines_precompiled_static.a 2025-05-07T19:49:07.0078785Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_engines_runtime_compiled_static.a 2025-05-07T19:49:07.0079359Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_graph_static.a 2025-05-07T19:49:07.0079856Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_heuristic_static.a 2025-05-07T19:49:07.0080387Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_ops_static.a 2025-05-07T19:49:07.0080837Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn.so 2025-05-07T19:49:07.0081288Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn.so.9 2025-05-07T19:49:07.0081759Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn.so.9.5.1 2025-05-07T19:49:07.0085247Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_adv.so 2025-05-07T19:49:07.0087162Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_adv.so.9 2025-05-07T19:49:07.0088562Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_adv.so.9.5.1 2025-05-07T19:49:11.5934029Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_cnn.so 2025-05-07T19:49:11.5935532Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_cnn.so.9.5.1 2025-05-07T19:49:11.6551292Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_cnn.so.9 2025-05-07T19:49:11.6551924Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_engines_precompiled.so.9.5.1 2025-05-07T19:49:18.8379187Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_engines_precompiled.so.9 2025-05-07T19:49:18.8380009Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_engines_precompiled.so 2025-05-07T19:49:18.8380613Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_engines_runtime_compiled.so 2025-05-07T19:49:18.8381249Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_engines_runtime_compiled.so.9.5.1 2025-05-07T19:49:19.0285797Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_engines_runtime_compiled.so.9 2025-05-07T19:49:19.0287526Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_graph.so.9 2025-05-07T19:49:19.0288931Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_graph.so 2025-05-07T19:49:19.0290193Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_graph.so.9.5.1 2025-05-07T19:49:19.0637563Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_heuristic.so.9.5.1 2025-05-07T19:49:19.5897711Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_heuristic.so.9 2025-05-07T19:49:19.5898280Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_heuristic.so 2025-05-07T19:49:19.5898777Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_ops.so.9 2025-05-07T19:49:19.5899267Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_ops.so 2025-05-07T19:49:19.5899769Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_ops.so.9.5.1 2025-05-07T19:49:21.7266451Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/ 2025-05-07T19:49:21.7267754Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_v9.h 2025-05-07T19:49:21.7268693Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_adv_v9.h 2025-05-07T19:49:21.7269170Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_backend_v9.h 2025-05-07T19:49:21.7269665Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_cnn_v9.h 2025-05-07T19:49:21.7270351Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_graph_v9.h 2025-05-07T19:49:21.7271383Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_ops_v9.h 2025-05-07T19:49:21.7273732Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_version_v9.h 2025-05-07T19:49:21.7274215Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn.h 2025-05-07T19:49:21.7274673Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_adv.h 2025-05-07T19:49:21.7275177Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_backend.h 2025-05-07T19:49:21.7275662Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_cnn.h 2025-05-07T19:49:21.7276153Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_graph.h 2025-05-07T19:49:21.7276734Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_ops.h 2025-05-07T19:49:21.7277221Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_version.h 2025-05-07T19:49:21.7277651Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/LICENSE 2025-05-07T19:49:21.7285240Z 2025-05-07T19:49:21.7285945Z [INSTALL] Moving cuDNN files to /__w/FBGEMM/FBGEMM/build_only/cudnn ... 2025-05-07T19:49:21.7287271Z + rm -rf /__w/FBGEMM/FBGEMM/build_only/cudnn/include 2025-05-07T19:49:21.7287968Z 2025-05-07T19:49:21.7302111Z 2025-05-07T19:49:21.7302485Z + rm -rf /__w/FBGEMM/FBGEMM/build_only/cudnn/lib 2025-05-07T19:49:21.7303166Z 2025-05-07T19:49:21.7317939Z 2025-05-07T19:49:21.7318951Z + mv cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include /__w/FBGEMM/FBGEMM/build_only/cudnn 2025-05-07T19:49:21.7343321Z 2025-05-07T19:49:21.7343396Z 2025-05-07T19:49:21.7345359Z + mv cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib /__w/FBGEMM/FBGEMM/build_only/cudnn 2025-05-07T19:49:21.7346484Z 2025-05-07T19:49:22.7028233Z 2025-05-07T19:49:22.7028803Z /__w/FBGEMM/FBGEMM 2025-05-07T19:49:22.7029108Z + rm -rf /tmp/tmp.OzARDt4RIc 2025-05-07T19:49:22.7029303Z 2025-05-07T19:49:23.2427099Z 2025-05-07T19:49:23.2443693Z [INSTALL] Set environment variables CUDNN_INCLUDE_DIR and CUDNN_LIBRARY ... 2025-05-07T19:49:23.2444750Z + conda env config vars set -n build_binary CUDNN_INCLUDE_DIR=/__w/FBGEMM/FBGEMM/build_only/cudnn/include CUDNN_LIBRARY=/__w/FBGEMM/FBGEMM/build_only/cudnn/lib 2025-05-07T19:49:23.2445475Z 2025-05-07T19:49:23.6662743Z 2025-05-07T19:49:23.6663671Z [INSTALL] Successfully installed cuDNN (for CUDA 12.6.3) 2025-05-07T19:49:23.6735974Z ##[group]Run . $PRELUDE; cd fbgemm_gpu; prepare_fbgemm_gpu_build $BUILD_ENV 2025-05-07T19:49:23.6736915Z . $PRELUDE; cd fbgemm_gpu; prepare_fbgemm_gpu_build $BUILD_ENV 2025-05-07T19:49:23.6737573Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:49:23.6737962Z env: 2025-05-07T19:49:23.6738219Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:49:23.6738587Z BUILD_ENV: build_binary 2025-05-07T19:49:23.6738860Z BUILD_TARGET: default 2025-05-07T19:49:23.6739152Z BUILD_VARIANT: cuda 2025-05-07T19:49:23.6739441Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:49:23.6739716Z ##[endgroup] 2025-05-07T19:49:24.0652461Z ################################################################################ 2025-05-07T19:49:24.0653378Z # Prepare FBGEMM-GPU Build 2025-05-07T19:49:24.0653767Z # 2025-05-07T19:49:24.0665313Z # [2025-05-07T19:49:24.066Z] + prepare_fbgemm_gpu_build build_binary 2025-05-07T19:49:24.0666758Z ################################################################################ 2025-05-07T19:49:24.0667439Z 2025-05-07T19:49:24.0680935Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:49:24.1711845Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:49:24.1731271Z [BUILD] Running git submodules update ... 2025-05-07T19:49:24.1761362Z [EXEC] [ATTEMPT 0/3] + git submodule sync 2025-05-07T19:49:24.2067993Z Synchronizing submodule url for '../external/asmjit' 2025-05-07T19:49:24.2068526Z Synchronizing submodule url for '../external/composable_kernel' 2025-05-07T19:49:24.2069026Z Synchronizing submodule url for '../external/cpuinfo' 2025-05-07T19:49:24.2069456Z Synchronizing submodule url for '../external/cutlass' 2025-05-07T19:49:24.2070433Z Synchronizing submodule url for '../external/googletest' 2025-05-07T19:49:24.2070880Z Synchronizing submodule url for '../external/hipify_torch' 2025-05-07T19:49:24.2071301Z Synchronizing submodule url for '../external/json' 2025-05-07T19:49:24.2107313Z [EXEC] [ATTEMPT 0/3] + git submodule update --init --recursive 2025-05-07T19:49:24.2637081Z [BUILD] Installing other build dependencies ... 2025-05-07T19:49:24.2659032Z [EXEC] [ATTEMPT 0/3] + conda run --no-capture-output -n build_binary python -m pip install -r requirements.txt 2025-05-07T19:49:26.1594843Z Collecting backports.tarfile (from -r requirements.txt (line 13)) 2025-05-07T19:49:26.1779263Z Downloading backports.tarfile-1.2.0-py3-none-any.whl.metadata (2.0 kB) 2025-05-07T19:49:26.1867608Z Requirement already satisfied: build in /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages (from -r requirements.txt (line 14)) (1.2.2.post1) 2025-05-07T19:49:26.2880433Z Collecting cmake (from -r requirements.txt (line 15)) 2025-05-07T19:49:26.2919876Z Downloading cmake-4.0.0-py3-none-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.metadata (6.3 kB) 2025-05-07T19:49:26.3000342Z Requirement already satisfied: click in /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages (from -r requirements.txt (line 16)) (8.1.8) 2025-05-07T19:49:26.3002592Z Requirement already satisfied: hypothesis in /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages (from -r requirements.txt (line 17)) (6.131.14) 2025-05-07T19:49:26.3003791Z Requirement already satisfied: jinja2 in /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages (from -r requirements.txt (line 18)) (3.1.6) 2025-05-07T19:49:26.3004945Z Requirement already satisfied: mpmath==1.3.0 in /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages (from -r requirements.txt (line 19)) (1.3.0) 2025-05-07T19:49:26.3266387Z Collecting ninja (from -r requirements.txt (line 20)) 2025-05-07T19:49:26.3302997Z Downloading ninja-1.11.1.4-py3-none-manylinux_2_12_x86_64.manylinux2010_x86_64.whl.metadata (5.0 kB) 2025-05-07T19:49:26.3374896Z Requirement already satisfied: numpy>=2.0.2 in /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages (from -r requirements.txt (line 21)) (2.2.5) 2025-05-07T19:49:26.3550940Z Collecting pyre-extensions (from -r requirements.txt (line 22)) 2025-05-07T19:49:26.3580559Z Downloading pyre_extensions-0.0.32-py3-none-any.whl.metadata (4.0 kB) 2025-05-07T19:49:26.3653842Z Requirement already satisfied: pyyaml in /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages (from -r requirements.txt (line 23)) (6.0.2) 2025-05-07T19:49:26.3655110Z Requirement already satisfied: scikit-build in /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages (from -r requirements.txt (line 24)) (0.18.1) 2025-05-07T19:49:26.3663342Z Requirement already satisfied: setuptools in /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages (from -r requirements.txt (line 25)) (78.1.1) 2025-05-07T19:49:26.3859268Z Collecting setuptools_git_versioning (from -r requirements.txt (line 26)) 2025-05-07T19:49:26.3899761Z Downloading setuptools_git_versioning-2.1.0-py3-none-any.whl.metadata (6.1 kB) 2025-05-07T19:49:26.4090988Z Collecting tabulate (from -r requirements.txt (line 27)) 2025-05-07T19:49:26.4127355Z Downloading tabulate-0.9.0-py3-none-any.whl.metadata (34 kB) 2025-05-07T19:49:26.4374696Z Collecting patchelf (from -r requirements.txt (line 28)) 2025-05-07T19:49:26.4413999Z Downloading patchelf-0.17.2.2-py3-none-manylinux1_x86_64.manylinux_2_5_x86_64.musllinux_1_1_x86_64.whl.metadata (3.5 kB) 2025-05-07T19:49:26.4503799Z Requirement already satisfied: packaging>=19.1 in /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages (from build->-r requirements.txt (line 14)) (25.0) 2025-05-07T19:49:26.4507289Z Requirement already satisfied: pyproject_hooks in /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages (from build->-r requirements.txt (line 14)) (1.2.0) 2025-05-07T19:49:26.4554141Z Requirement already satisfied: attrs>=22.2.0 in /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages (from hypothesis->-r requirements.txt (line 17)) (25.3.0) 2025-05-07T19:49:26.4556485Z Requirement already satisfied: sortedcontainers<3.0.0,>=2.1.0 in /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages (from hypothesis->-r requirements.txt (line 17)) (2.4.0) 2025-05-07T19:49:26.4605964Z Requirement already satisfied: MarkupSafe>=2.0 in /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages (from jinja2->-r requirements.txt (line 18)) (3.0.2) 2025-05-07T19:49:26.4728862Z Collecting typing-inspect (from pyre-extensions->-r requirements.txt (line 22)) 2025-05-07T19:49:26.4761556Z Downloading typing_inspect-0.9.0-py3-none-any.whl.metadata (1.5 kB) 2025-05-07T19:49:26.4826521Z Requirement already satisfied: typing-extensions in /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages (from pyre-extensions->-r requirements.txt (line 22)) (4.13.2) 2025-05-07T19:49:26.4841952Z Requirement already satisfied: distro in /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages (from scikit-build->-r requirements.txt (line 24)) (1.9.0) 2025-05-07T19:49:26.4850705Z Requirement already satisfied: wheel>=0.32.0 in /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages (from scikit-build->-r requirements.txt (line 24)) (0.45.1) 2025-05-07T19:49:26.5133086Z Collecting mypy-extensions>=0.3.0 (from typing-inspect->pyre-extensions->-r requirements.txt (line 22)) 2025-05-07T19:49:26.5164234Z Downloading mypy_extensions-1.1.0-py3-none-any.whl.metadata (1.1 kB) 2025-05-07T19:49:26.5279531Z Downloading backports.tarfile-1.2.0-py3-none-any.whl (30 kB) 2025-05-07T19:49:26.5370665Z Downloading cmake-4.0.0-py3-none-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (27.9 MB) 2025-05-07T19:49:26.6356559Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 27.9/27.9 MB 292.2 MB/s eta 0:00:00 2025-05-07T19:49:26.6394626Z Downloading ninja-1.11.1.4-py3-none-manylinux_2_12_x86_64.manylinux2010_x86_64.whl (422 kB) 2025-05-07T19:49:26.6489213Z Downloading pyre_extensions-0.0.32-py3-none-any.whl (12 kB) 2025-05-07T19:49:26.6555219Z Downloading setuptools_git_versioning-2.1.0-py3-none-any.whl (10 kB) 2025-05-07T19:49:26.6625085Z Downloading tabulate-0.9.0-py3-none-any.whl (35 kB) 2025-05-07T19:49:26.6731987Z Downloading patchelf-0.17.2.2-py3-none-manylinux1_x86_64.manylinux_2_5_x86_64.musllinux_1_1_x86_64.whl (466 kB) 2025-05-07T19:49:26.6819643Z Downloading typing_inspect-0.9.0-py3-none-any.whl (8.8 kB) 2025-05-07T19:49:26.6899374Z Downloading mypy_extensions-1.1.0-py3-none-any.whl (5.0 kB) 2025-05-07T19:49:26.8412202Z Installing collected packages: tabulate, setuptools_git_versioning, patchelf, ninja, mypy-extensions, cmake, backports.tarfile, typing-inspect, pyre-extensions 2025-05-07T19:49:27.6646388Z 2025-05-07T19:49:27.6699058Z WARNING: Running pip as the 'root' user can result in broken permissions and conflicting behaviour with the system package manager, possibly rendering your system unusable. It is recommended to use a virtual environment instead: https://pip.pypa.io/warnings/venv. Use the --root-user-action option if you know what you are doing and want to suppress this warning. 2025-05-07T19:49:27.6701947Z Successfully installed backports.tarfile-1.2.0 cmake-4.0.0 mypy-extensions-1.1.0 ninja-1.11.1.4 patchelf-0.17.2.2 pyre-extensions-0.0.32 setuptools_git_versioning-2.1.0 tabulate-0.9.0 typing-inspect-0.9.0 2025-05-07T19:49:27.8047653Z ################################################################################ 2025-05-07T19:49:27.8048192Z # Install PyTorch (PyTorch PIP) 2025-05-07T19:49:27.8049725Z # 2025-05-07T19:49:27.8066203Z # [2025-05-07T19:49:27.805Z] + install_triton_pip build_binary 2025-05-07T19:49:27.8067487Z ################################################################################ 2025-05-07T19:49:27.8068169Z 2025-05-07T19:49:27.8068849Z [BUILD] Installing pytorch-triton nightly/3.2.0+git4b3bb1f8 from PIP ... 2025-05-07T19:49:27.8071166Z ################################################################################ 2025-05-07T19:49:27.8072242Z # Install Package From PyTorch PIP: pytorch-triton 2025-05-07T19:49:27.8072622Z # 2025-05-07T19:49:27.8085221Z # [2025-05-07T19:49:27.807Z] + install_from_pytorch_pip build_binary pytorch-triton nightly/3.2.0+git4b3bb1f8 2025-05-07T19:49:27.8086803Z ################################################################################ 2025-05-07T19:49:27.8087480Z 2025-05-07T19:49:27.8101558Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:49:27.9098021Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:49:27.9098484Z ################################################################################ 2025-05-07T19:49:27.9098905Z # Prepare PIP Arguments (PyTorch PIP) 2025-05-07T19:49:27.9099214Z # 2025-05-07T19:49:27.9117903Z # [2025-05-07T19:49:27.911Z] + __prepare_pip_arguments pytorch-triton nightly/3.2.0+git4b3bb1f8 2025-05-07T19:49:27.9119464Z ################################################################################ 2025-05-07T19:49:27.9120185Z 2025-05-07T19:49:27.9173722Z [INSTALL] Extracted package (channel, version): (nightly, 3.2.0+git4b3bb1f8) 2025-05-07T19:49:27.9197585Z [INSTALL] Using a non-RELEASE channel: nightly ... 2025-05-07T19:49:27.9199170Z [INSTALL] Extracted the full PIP channel: https://download.pytorch.org/whl/nightly/ 2025-05-07T19:49:27.9203327Z [INSTALL] Extracted the full PIP package: --pre pytorch-triton==3.2.0+git4b3bb1f8 2025-05-07T19:49:27.9214021Z [INSTALL] Attempting to install [pytorch-triton, 3.2.0+git4b3bb1f8] from PyTorch PIP using channel https://download.pytorch.org/whl/nightly/ ... 2025-05-07T19:49:27.9242311Z [EXEC] [ATTEMPT 0/3] + conda run -n build_binary pip install --pre pytorch-triton==3.2.0+git4b3bb1f8 --index-url https://download.pytorch.org/whl/nightly/ 2025-05-07T19:49:33.3271029Z ERROR: pip's dependency resolver does not currently take into account all the packages that are installed. This behaviour is the source of the following dependency conflicts. 2025-05-07T19:49:33.3275791Z torch 2.8.0.dev20250507+cu126 requires pytorch-triton==3.3.0+git96316ce5; platform_system == "Linux" and platform_machine == "x86_64", but you have pytorch-triton 3.2.0+git4b3bb1f8 which is incompatible. 2025-05-07T19:49:33.3277742Z WARNING: Running pip as the 'root' user can result in broken permissions and conflicting behaviour with the system package manager, possibly rendering your system unusable. It is recommended to use a virtual environment instead: https://pip.pypa.io/warnings/venv. Use the --root-user-action option if you know what you are doing and want to suppress this warning. 2025-05-07T19:49:33.3279084Z 2025-05-07T19:49:33.3279286Z Looking in indexes: https://download.pytorch.org/whl/nightly/ 2025-05-07T19:49:33.3279734Z Collecting pytorch-triton==3.2.0+git4b3bb1f8 2025-05-07T19:49:33.3280533Z Downloading https://download.pytorch.org/whl/nightly/pytorch_triton-3.2.0%2Bgit4b3bb1f8-cp313-cp313-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl.metadata (1.3 kB) 2025-05-07T19:49:33.3281779Z Downloading https://download.pytorch.org/whl/nightly/pytorch_triton-3.2.0%2Bgit4b3bb1f8-cp313-cp313-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl (166.5 MB) 2025-05-07T19:49:33.3283036Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 166.5/166.5 MB 155.4 MB/s eta 0:00:00 2025-05-07T19:49:33.3283430Z Installing collected packages: pytorch-triton 2025-05-07T19:49:33.3283813Z Attempting uninstall: pytorch-triton 2025-05-07T19:49:33.3284208Z Found existing installation: pytorch-triton 3.3.0+git96316ce5 2025-05-07T19:49:33.3284679Z Uninstalling pytorch-triton-3.3.0+git96316ce5: 2025-05-07T19:49:33.3285101Z Successfully uninstalled pytorch-triton-3.3.0+git96316ce5 2025-05-07T19:49:33.3285578Z Successfully installed pytorch-triton-3.2.0+git4b3bb1f8 2025-05-07T19:49:33.3285839Z 2025-05-07T19:49:35.2055184Z [CHECK] Python (sub-)package 'triton' found ... 2025-05-07T19:49:35.2056659Z [CHECK] Printing out the pytorch-triton version ... 2025-05-07T19:49:37.0218783Z ################################################################################ 2025-05-07T19:49:37.0220036Z [CHECK] The installed VERSION of pytorch-triton is: 3.2.0 2025-05-07T19:49:37.0221250Z ################################################################################ 2025-05-07T19:49:37.0221924Z 2025-05-07T19:49:38.7707528Z [CHECK] Python (sub-)package 'numpy' found ... 2025-05-07T19:49:40.6293377Z [CHECK] Python (sub-)package 'skbuild' found ... 2025-05-07T19:49:40.6294332Z [BUILD] Successfully ran git submodules update 2025-05-07T19:49:40.6372096Z ##[group]Run . $PRELUDE; cd fbgemm_gpu; build_fbgemm_gpu_package $BUILD_ENV nightly default/cuda 2025-05-07T19:49:40.6372831Z . $PRELUDE; cd fbgemm_gpu; build_fbgemm_gpu_package $BUILD_ENV nightly default/cuda 2025-05-07T19:49:40.6373433Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:49:40.6373770Z env: 2025-05-07T19:49:40.6373997Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:49:40.6374349Z BUILD_ENV: build_binary 2025-05-07T19:49:40.6374593Z BUILD_TARGET: default 2025-05-07T19:49:40.6374840Z BUILD_VARIANT: cuda 2025-05-07T19:49:40.6375089Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:49:40.6375337Z ##[endgroup] 2025-05-07T19:49:41.1033756Z [BUILD] BUILD_TARGET_VARIANT: default/cuda 2025-05-07T19:49:41.1034220Z [BUILD] Extracted build target: default 2025-05-07T19:49:41.1034605Z [BUILD] Extracted build variant: cuda 2025-05-07T19:49:42.7211774Z /github/home/miniconda/envs/build_binary/bin/cc 2025-05-07T19:49:42.7212093Z 2025-05-07T19:49:42.7791404Z [CHECK] Binary cc found in PATH 2025-05-07T19:49:44.3697305Z /github/home/miniconda/envs/build_binary/bin/gcc 2025-05-07T19:49:44.3698109Z 2025-05-07T19:49:44.4265368Z [CHECK] Binary gcc found in PATH 2025-05-07T19:49:46.0497724Z /github/home/miniconda/envs/build_binary/bin/c++ 2025-05-07T19:49:46.0498505Z 2025-05-07T19:49:46.1063751Z [CHECK] Binary c++ found in PATH 2025-05-07T19:49:47.6978107Z /github/home/miniconda/envs/build_binary/bin/g++ 2025-05-07T19:49:47.6978889Z 2025-05-07T19:49:47.7565304Z [CHECK] Binary g++ found in PATH 2025-05-07T19:49:49.4113940Z [BUILD] Extracted and set Python tag: py313 2025-05-07T19:49:49.4114593Z [BUILD] Extracted and set Python platform name: manylinux_2_28_x86_64 2025-05-07T19:49:49.4346367Z core = 24 2025-05-07T19:49:49.4562111Z sockets = 2 2025-05-07T19:49:49.4562994Z [BUILD] Set multicore run option for setup.py: -j 48 2025-05-07T19:49:49.4564070Z [CHECK] LD_LIBRARY_PATH = 2025-05-07T19:49:49.4564880Z [BUILD] Running pre-build cleanups ... 2025-05-07T19:49:49.4565765Z + rm -rf dist 2025-05-07T19:49:49.4566129Z 2025-05-07T19:49:49.4576029Z 2025-05-07T19:49:49.4577174Z + conda run --no-capture-output -n build_binary python setup.py clean 2025-05-07T19:49:49.4577559Z 2025-05-07T19:49:52.3531930Z INFO:root:running clean 2025-05-07T19:49:52.3532718Z [SETUP.PY] ARGV: ['setup.py', 'clean'] 2025-05-07T19:49:52.3533717Z [SETUP.PY] Parsed setup.py arguments: Namespace(verbose=False, debug=False, dryrun=False, build_target='default', build_variant='cuda', package_channel='nightly', nvml_lib_path=None, nccl_lib_path=None, use_fb_only=False, cxxprefix=None) 2025-05-07T19:49:52.3534775Z [SETUP.PY] Other arguments: ['clean'] 2025-05-07T19:49:52.3535222Z [SETUP.PY] CUDA CUB directory environment variable not set. Using default CUB location. 2025-05-07T19:49:52.3535763Z [SETUP.PY] Using CUDA = /github/home/miniconda/envs/build_binary 2025-05-07T19:49:52.3536424Z [SETUP.PY] Generating version file at: /__w/FBGEMM/FBGEMM/fbgemm_gpu/fbgemm_gpu/docs/version.py 2025-05-07T19:49:52.3537191Z [SETUP.PY] Setting the FBGEMM build target: default ... 2025-05-07T19:49:52.3537591Z [SETUP.PY] Setting the FBGEMM build variant: cuda ... 2025-05-07T19:49:52.3538827Z [SETUP.PY] Passing CMake arguments: ['-DCMAKE_PREFIX_PATH=/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch', '-D_GLIBCXX_USE_CXX11_ABI=1', '-DFBGEMM_BUILD_TARGET=default', '-DFBGEMM_BUILD_VARIANT=cuda', "-DCMAKE_C_FLAGS=''", "-DCMAKE_CXX_FLAGS=''"] 2025-05-07T19:49:52.7196636Z 2025-05-07T19:49:52.7197163Z [BUILD] Printing git status ... 2025-05-07T19:49:52.7197515Z + git status 2025-05-07T19:49:52.7197646Z 2025-05-07T19:49:53.3480612Z HEAD detached at pull/4066/merge 2025-05-07T19:49:53.3481535Z Untracked files: 2025-05-07T19:49:53.3482451Z (use "git add ..." to include in what will be committed) 2025-05-07T19:49:53.3483513Z ../build_only/ 2025-05-07T19:49:53.3484121Z ../collect_env.py 2025-05-07T19:49:53.3484823Z fbgemm_gpu/docs/version.py 2025-05-07T19:49:53.3485323Z 2025-05-07T19:49:53.3486406Z nothing added to commit but untracked files present (use "git add" to track) 2025-05-07T19:49:53.3486748Z 2025-05-07T19:49:53.3486838Z + git diff 2025-05-07T19:49:53.3486961Z 2025-05-07T19:49:53.3769291Z 2025-05-07T19:49:53.3769799Z ################################################################################ 2025-05-07T19:49:53.3771347Z # Configure FBGEMM-GPU Build 2025-05-07T19:49:53.3786315Z # 2025-05-07T19:49:53.3787186Z # [2025-05-07T19:49:53.378Z] + __configure_fbgemm_gpu_build 2025-05-07T19:49:53.3788339Z ################################################################################ 2025-05-07T19:49:53.3789029Z 2025-05-07T19:49:53.3791764Z [BUILD] Setting the build target: default ... 2025-05-07T19:49:53.3793102Z [BUILD] Configuring build as CUDA variant (this is the default behavior) ... 2025-05-07T19:49:54.9991749Z /github/home/miniconda/envs/build_binary/bin/nvcc 2025-05-07T19:49:54.9992040Z 2025-05-07T19:49:55.0600844Z [CHECK] Binary nvcc found in PATH 2025-05-07T19:49:56.6533731Z /__w/FBGEMM/FBGEMM/build_only/cudnn/include 2025-05-07T19:49:56.6534345Z 2025-05-07T19:49:56.7113179Z [CHECK] Environment variable CUDNN_INCLUDE_DIR is defined in the Conda environment 2025-05-07T19:49:58.3019837Z /__w/FBGEMM/FBGEMM/build_only/cudnn/lib 2025-05-07T19:49:58.3020137Z 2025-05-07T19:49:58.3595160Z [CHECK] Environment variable CUDNN_LIBRARY is defined in the Conda environment 2025-05-07T19:49:59.9532415Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:49:59.9532804Z 2025-05-07T19:50:00.0118468Z [CHECK] Environment variable NVML_LIB_PATH is defined in the Conda environment 2025-05-07T19:50:01.6626646Z [BUILD] Using the default architectures for CUDA nvcc: NVIDIA (R) Cuda compiler driver 2025-05-07T19:50:01.6628215Z Copyright (c) 2005-2024 NVIDIA Corporation 2025-05-07T19:50:01.6629132Z Built on Tue_Oct_29_23:50:19_PDT_2024 2025-05-07T19:50:01.6630083Z Cuda compilation tools, release 12.6, V12.6.85 2025-05-07T19:50:01.6631111Z Build cuda_12.6.r12.6/compiler.35059454_0 ... 2025-05-07T19:50:01.6632229Z [BUILD] Setting the following CUDA targets: 7.0;8.0;9.0;9.0a 2025-05-07T19:50:01.6633302Z [BUILD] Looking up NVML filepath ... 2025-05-07T19:50:03.3104335Z [BUILD] Looking up NCCL filepath ... 2025-05-07T19:50:06.6750390Z [BUILD] Setting NVCC verbose mode ... 2025-05-07T19:50:06.6751559Z + conda env config vars set -n build_binary NVCC_VERBOSE=1 2025-05-07T19:50:06.6752407Z 2025-05-07T19:50:07.0883697Z 2025-05-07T19:50:07.0884018Z [BUILD] Setting CUDA build args ... 2025-05-07T19:50:08.7330742Z [BUILD] Looking up CUDA version ... 2025-05-07T19:50:12.0301507Z + conda run -n build_binary c++ --version | grep -i clang 2025-05-07T19:50:12.0302336Z 2025-05-07T19:50:13.6988364Z clang version 16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4) 2025-05-07T19:50:13.6990876Z Configuration file: /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-clang++.cfg 2025-05-07T19:50:13.6992244Z 2025-05-07T19:50:13.6992581Z [BUILD] Setting NVCC flags ... 2025-05-07T19:50:13.6994560Z + conda env config vars set -n build_binary NVCC_PREPEND_FLAGS="-std=c++20 -Xcompiler -std=c++20 -Xcompiler -stdlib=libstdc++ -ccbin /github/home/miniconda/envs/build_binary/bin/c++ -allow-unsupported-compiler" 2025-05-07T19:50:13.6995312Z 2025-05-07T19:50:14.1127367Z 2025-05-07T19:50:14.1128344Z + conda run -n build_binary printenv NVCC_PREPEND_FLAGS 2025-05-07T19:50:14.1129681Z 2025-05-07T19:50:15.7022001Z -std=c++20 -Xcompiler -std=c++20 -Xcompiler -stdlib=libstdc++ -ccbin /github/home/miniconda/envs/build_binary/bin/c++ -allow-unsupported-compiler 2025-05-07T19:50:15.7023113Z 2025-05-07T19:50:15.7598595Z 2025-05-07T19:50:15.7599321Z [BUILD] Setting CUDA build args ... 2025-05-07T19:50:15.7600316Z + conda run -n build_binary c++ --version 2025-05-07T19:50:15.7600942Z 2025-05-07T19:50:17.3714883Z clang version 16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4) 2025-05-07T19:50:17.3715872Z Target: x86_64-conda-linux-gnu 2025-05-07T19:50:17.3716177Z Thread model: posix 2025-05-07T19:50:17.3716488Z InstalledDir: /github/home/miniconda/envs/build_binary/bin 2025-05-07T19:50:17.3717124Z Configuration file: /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-clang++.cfg 2025-05-07T19:50:17.3717566Z 2025-05-07T19:50:17.4304761Z 2025-05-07T19:50:17.4305738Z + conda run -n build_binary c++ --version | grep -i clang 2025-05-07T19:50:17.4306118Z 2025-05-07T19:50:19.0925857Z clang version 16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4) 2025-05-07T19:50:19.0926802Z Configuration file: /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-clang++.cfg 2025-05-07T19:50:19.0927277Z 2025-05-07T19:50:19.0927515Z [BUILD] Clang is available; configuring for Clang-based build ... 2025-05-07T19:50:20.7361778Z .github/scripts/fbgemm_gpu_build.bash: line 370: [: : integer expression expected 2025-05-07T19:50:20.7362347Z [BUILD] Enabling debug features in the build ... 2025-05-07T19:50:20.7364861Z [BUILD] FBGEMM_GPU build arguments have been set: --verbose --build-target=default --build-variant=cuda --nvml_lib_path=/github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so --nccl_lib_path=/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 -DTORCH_CUDA_ARCH_LIST='7.0;8.0;9.0;9.0a' -DCUDA_TOOLKIT_ROOT_DIR=/github/home/miniconda/envs/build_binary/targets/x86_64-linux -DCUDAToolkit_ROOT=/github/home/miniconda/envs/build_binary/targets/x86_64-linux -DCMAKE_CXX_STANDARD=20 --cxxprefix=/github/home/miniconda/envs/build_binary --debug 2025-05-07T19:50:20.7367391Z ################################################################################ 2025-05-07T19:50:20.7367715Z # Build FBGEMM-GPU Package (Wheel) 2025-05-07T19:50:20.7368004Z # 2025-05-07T19:50:20.7390403Z # [2025-05-07T19:50:20.738Z] + build_fbgemm_gpu_package build_binary nightly default/cuda 2025-05-07T19:50:20.7391243Z ################################################################################ 2025-05-07T19:50:20.7391711Z 2025-05-07T19:50:20.7392048Z [BUILD] Building FBGEMM wheel (TARGET=default, VARIANT=cuda) ... 2025-05-07T19:50:20.7397188Z + conda run --no-capture-output -n build_binary python -m build --wheel --no-isolation --config-setting=--build-option=--verbose --config-setting=--build-option=--build-target=default --config-setting=--build-option=--build-variant=cuda --config-setting=--build-option=--nvml_lib_path=/github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so --config-setting=--build-option=--nccl_lib_path=/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 --config-setting=--build-option=-DTORCH_CUDA_ARCH_LIST='7.0;8.0;9.0;9.0a' --config-setting=--build-option=-DCUDA_TOOLKIT_ROOT_DIR=/github/home/miniconda/envs/build_binary/targets/x86_64-linux --config-setting=--build-option=-DCUDAToolkit_ROOT=/github/home/miniconda/envs/build_binary/targets/x86_64-linux --config-setting=--build-option=-DCMAKE_CXX_STANDARD=20 --config-setting=--build-option=--cxxprefix=/github/home/miniconda/envs/build_binary --config-setting=--build-option=--debug --config-setting=--build-option=--package_channel=nightly --config-setting=--build-option=--python-tag=py313 --config-setting=--build-option=--plat-name=manylinux_2_28_x86_64 2025-05-07T19:50:20.7401867Z 2025-05-07T19:50:22.3871652Z * Getting build dependencies for wheel... 2025-05-07T19:50:23.6889307Z INFO:root:running egg_info 2025-05-07T19:50:23.6927670Z INFO:root:creating fbgemm_gpu_nightly.egg-info 2025-05-07T19:50:23.6928979Z INFO:root:writing fbgemm_gpu_nightly.egg-info/PKG-INFO 2025-05-07T19:50:23.6930549Z INFO:root:writing dependency_links to fbgemm_gpu_nightly.egg-info/dependency_links.txt 2025-05-07T19:50:23.6932387Z INFO:root:writing requirements to fbgemm_gpu_nightly.egg-info/requires.txt 2025-05-07T19:50:23.6933342Z INFO:root:writing top-level names to fbgemm_gpu_nightly.egg-info/top_level.txt 2025-05-07T19:50:23.6934235Z INFO:root:writing manifest file 'fbgemm_gpu_nightly.egg-info/SOURCES.txt' 2025-05-07T19:50:23.7000442Z INFO:root:reading manifest file 'fbgemm_gpu_nightly.egg-info/SOURCES.txt' 2025-05-07T19:50:23.7010271Z INFO:root:writing manifest file 'fbgemm_gpu_nightly.egg-info/SOURCES.txt' 2025-05-07T19:50:23.7011728Z [SETUP.PY] ARGV: ['setup.py', 'egg_info'] 2025-05-07T19:50:23.7014776Z [SETUP.PY] Parsed setup.py arguments: Namespace(verbose=False, debug=False, dryrun=False, build_target='default', build_variant='cuda', package_channel='nightly', nvml_lib_path=None, nccl_lib_path=None, use_fb_only=False, cxxprefix=None) 2025-05-07T19:50:23.7018076Z [SETUP.PY] Other arguments: ['egg_info'] 2025-05-07T19:50:23.7019496Z [SETUP.PY] CUDA CUB directory environment variable not set. Using default CUB location. 2025-05-07T19:50:23.7021142Z [SETUP.PY] Using CUDA = /github/home/miniconda/envs/build_binary 2025-05-07T19:50:23.7022958Z [SETUP.PY] Generating version file at: /__w/FBGEMM/FBGEMM/fbgemm_gpu/fbgemm_gpu/docs/version.py 2025-05-07T19:50:23.7023551Z [SETUP.PY] Setting the FBGEMM build target: default ... 2025-05-07T19:50:23.7023966Z [SETUP.PY] Setting the FBGEMM build variant: cuda ... 2025-05-07T19:50:23.7025225Z [SETUP.PY] Passing CMake arguments: ['-DCMAKE_PREFIX_PATH=/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch', '-D_GLIBCXX_USE_CXX11_ABI=1', '-DFBGEMM_BUILD_TARGET=default', '-DFBGEMM_BUILD_VARIANT=cuda', "-DCMAKE_C_FLAGS=''", "-DCMAKE_CXX_FLAGS=''"] 2025-05-07T19:50:23.9798514Z * Building wheel... 2025-05-07T19:50:25.2779031Z [SETUP.PY] ARGV: ['setup.py', 'bdist_wheel', '--dist-dir', '/__w/FBGEMM/FBGEMM/fbgemm_gpu/dist/.tmp-dn9uxucd', '--verbose', '--build-target=default', '--build-variant=cuda', '--nvml_lib_path=/github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so', '--nccl_lib_path=/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2', '-DTORCH_CUDA_ARCH_LIST=7.0;8.0;9.0;9.0a', '-DCUDA_TOOLKIT_ROOT_DIR=/github/home/miniconda/envs/build_binary/targets/x86_64-linux', '-DCUDAToolkit_ROOT=/github/home/miniconda/envs/build_binary/targets/x86_64-linux', '-DCMAKE_CXX_STANDARD=20', '--cxxprefix=/github/home/miniconda/envs/build_binary', '--debug', '--package_channel=nightly', '--python-tag=py313', '--plat-name=manylinux_2_28_x86_64'] 2025-05-07T19:50:25.2783869Z [SETUP.PY] Parsed setup.py arguments: Namespace(verbose=True, debug=True, dryrun=False, build_target='default', build_variant='cuda', package_channel='nightly', nvml_lib_path='/github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so', nccl_lib_path='/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2', use_fb_only=False, cxxprefix='/github/home/miniconda/envs/build_binary') 2025-05-07T19:50:25.2786828Z [SETUP.PY] Other arguments: ['bdist_wheel', '--dist-dir', '/__w/FBGEMM/FBGEMM/fbgemm_gpu/dist/.tmp-dn9uxucd', '-DTORCH_CUDA_ARCH_LIST=7.0;8.0;9.0;9.0a', '-DCUDA_TOOLKIT_ROOT_DIR=/github/home/miniconda/envs/build_binary/targets/x86_64-linux', '-DCUDAToolkit_ROOT=/github/home/miniconda/envs/build_binary/targets/x86_64-linux', '-DCMAKE_CXX_STANDARD=20', '--python-tag=py313', '--plat-name=manylinux_2_28_x86_64'] 2025-05-07T19:50:25.2788494Z [SETUP.PY] CUDA CUB directory environment variable not set. Using default CUB location. 2025-05-07T19:50:25.2789055Z [SETUP.PY] Using CUDA = /github/home/miniconda/envs/build_binary 2025-05-07T19:50:25.2789875Z [SETUP.PY] Generating version file at: /__w/FBGEMM/FBGEMM/fbgemm_gpu/fbgemm_gpu/docs/version.py 2025-05-07T19:50:25.2790438Z [SETUP.PY] Setting the FBGEMM build target: default ... 2025-05-07T19:50:25.2790865Z [SETUP.PY] Setting the FBGEMM build variant: cuda ... 2025-05-07T19:50:25.2796566Z [SETUP.PY] Passing CMake arguments: ['-DCMAKE_PREFIX_PATH=/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch', '-D_GLIBCXX_USE_CXX11_ABI=1', '-DCMAKE_VERBOSE_MAKEFILE=ON', '-DCMAKE_EXPORT_COMPILE_COMMANDS=TRUE', '-DFBGEMM_BUILD_TARGET=default', '-DFBGEMM_BUILD_VARIANT=cuda', '-DNVML_LIB_PATH=/github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so', '-DNCCL_INCLUDE_DIRS=/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include', '-DNCCL_LIBRARIES=/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2', '-DCMAKE_C_COMPILER=/github/home/miniconda/envs/build_binary/bin/cc', '-DCMAKE_CXX_COMPILER=/github/home/miniconda/envs/build_binary/bin/c++', "-DCMAKE_C_FLAGS='-DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include'", "-DCMAKE_CXX_FLAGS='-DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include'", '-DTORCH_CUDA_ARCH_LIST=7.0;8.0;9.0;9.0a', '-DCUDA_TOOLKIT_ROOT_DIR=/github/home/miniconda/envs/build_binary/targets/x86_64-linux', '-DCUDAToolkit_ROOT=/github/home/miniconda/envs/build_binary/targets/x86_64-linux', '-DCMAKE_CXX_STANDARD=20'] 2025-05-07T19:50:25.2802187Z 2025-05-07T19:50:25.2802191Z 2025-05-07T19:50:25.2802363Z -------------------------------------------------------------------------------- 2025-05-07T19:50:25.2802762Z -- Trying 'Ninja' generator 2025-05-07T19:50:25.2803033Z -------------------------------- 2025-05-07T19:50:25.2803326Z --------------------------- 2025-05-07T19:50:25.2803570Z ---------------------- 2025-05-07T19:50:25.2803831Z ----------------- 2025-05-07T19:50:25.2804047Z ------------ 2025-05-07T19:50:25.2804284Z ------- 2025-05-07T19:50:25.2804488Z -- 2025-05-07T19:50:25.3248969Z CMake Deprecation Warning at CMakeLists.txt:1 (cmake_minimum_required): 2025-05-07T19:50:25.3249648Z Compatibility with CMake < 3.10 will be removed from a future version of 2025-05-07T19:50:25.3250072Z CMake. 2025-05-07T19:50:25.3250196Z 2025-05-07T19:50:25.3250499Z Update the VERSION argument value. Or, use the ... syntax 2025-05-07T19:50:25.3251066Z to tell CMake that the project requires at least but has been updated 2025-05-07T19:50:25.3251587Z to work with policies introduced by or earlier. 2025-05-07T19:50:25.3251849Z 2025-05-07T19:50:25.3251855Z 2025-05-07T19:50:25.3252093Z Not searching for unused variables given on the command line. 2025-05-07T19:50:25.4067774Z -- The C compiler identification is Clang 16.0.6 2025-05-07T19:50:25.4179656Z -- Detecting C compiler ABI info 2025-05-07T19:50:25.5413363Z -- Detecting C compiler ABI info - done 2025-05-07T19:50:25.5539107Z -- Check for working C compiler: /github/home/miniconda/envs/build_binary/bin/cc - skipped 2025-05-07T19:50:25.5540636Z -- Detecting C compile features 2025-05-07T19:50:25.5639614Z -- Detecting C compile features - done 2025-05-07T19:50:25.6943181Z -- The CXX compiler identification is Clang 16.0.6 2025-05-07T19:50:25.7025815Z -- Detecting CXX compiler ABI info 2025-05-07T19:50:25.8524887Z -- Detecting CXX compiler ABI info - done 2025-05-07T19:50:25.8652136Z -- Check for working CXX compiler: /github/home/miniconda/envs/build_binary/bin/c++ - skipped 2025-05-07T19:50:25.8652818Z -- Detecting CXX compile features 2025-05-07T19:50:25.8660023Z -- Detecting CXX compile features - done 2025-05-07T19:50:25.8673337Z -- Configuring done (0.6s) 2025-05-07T19:50:25.8734360Z -- Generating done (0.0s) 2025-05-07T19:50:25.8745812Z -- Build files have been written to: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_cmake_test_compile/build 2025-05-07T19:50:25.8783373Z -- 2025-05-07T19:50:25.8783635Z ------- 2025-05-07T19:50:25.8783862Z ------------ 2025-05-07T19:50:25.8784096Z ----------------- 2025-05-07T19:50:25.8784335Z ---------------------- 2025-05-07T19:50:25.8784603Z --------------------------- 2025-05-07T19:50:25.8784861Z -------------------------------- 2025-05-07T19:50:25.8785167Z -- Trying 'Ninja' generator - success 2025-05-07T19:50:25.8785844Z -------------------------------------------------------------------------------- 2025-05-07T19:50:25.8786141Z 2025-05-07T19:50:25.8796853Z Configuring Project 2025-05-07T19:50:25.8797130Z Working directory: 2025-05-07T19:50:25.8797512Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build 2025-05-07T19:50:25.8797930Z Command: 2025-05-07T19:50:25.8811007Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/cmake/data/bin/cmake /__w/FBGEMM/FBGEMM/fbgemm_gpu -G Ninja -DCMAKE_MAKE_PROGRAM:FILEPATH=/github/home/miniconda/envs/build_binary/bin/ninja --no-warn-unused-cli -DCMAKE_INSTALL_PREFIX:PATH=/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install -DPYTHON_VERSION_STRING:STRING=3.13.2 -DSKBUILD:INTERNAL=TRUE -DCMAKE_MODULE_PATH:PATH=/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/skbuild/resources/cmake -DPYTHON_EXECUTABLE:PATH=/github/home/miniconda/envs/build_binary/bin/python -DPYTHON_INCLUDE_DIR:PATH=/github/home/miniconda/envs/build_binary/include/python3.13 -DPYTHON_LIBRARY:PATH=/github/home/miniconda/envs/build_binary/lib/libpython3.13.so -DPython_EXECUTABLE:PATH=/github/home/miniconda/envs/build_binary/bin/python -DPython_ROOT_DIR:PATH=/github/home/miniconda/envs/build_binary -DPython_FIND_REGISTRY:STRING=NEVER -DPython_INCLUDE_DIR:PATH=/github/home/miniconda/envs/build_binary/include/python3.13 -DPython_NumPy_INCLUDE_DIRS:PATH=/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/numpy/_core/include -DPython3_EXECUTABLE:PATH=/github/home/miniconda/envs/build_binary/bin/python -DPython3_ROOT_DIR:PATH=/github/home/miniconda/envs/build_binary -DPython3_FIND_REGISTRY:STRING=NEVER -DPython3_INCLUDE_DIR:PATH=/github/home/miniconda/envs/build_binary/include/python3.13 -DPython3_NumPy_INCLUDE_DIRS:PATH=/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/numpy/_core/include -DCMAKE_MAKE_PROGRAM:FILEPATH=/github/home/miniconda/envs/build_binary/bin/ninja -DCMAKE_PREFIX_PATH=/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch -D_GLIBCXX_USE_CXX11_ABI=1 -DCMAKE_VERBOSE_MAKEFILE=ON -DCMAKE_EXPORT_COMPILE_COMMANDS=TRUE -DFBGEMM_BUILD_TARGET=default -DFBGEMM_BUILD_VARIANT=cuda -DNVML_LIB_PATH=/github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -DNCCL_INCLUDE_DIRS=/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -DNCCL_LIBRARIES=/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 -DCMAKE_C_COMPILER=/github/home/miniconda/envs/build_binary/bin/cc -DCMAKE_CXX_COMPILER=/github/home/miniconda/envs/build_binary/bin/c++ '-DCMAKE_C_FLAGS='"'"'-DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include'"'"'' '-DCMAKE_CXX_FLAGS='"'"'-DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include'"'"'' '-DTORCH_CUDA_ARCH_LIST=7.0;8.0;9.0;9.0a' -DCUDA_TOOLKIT_ROOT_DIR=/github/home/miniconda/envs/build_binary/targets/x86_64-linux -DCUDAToolkit_ROOT=/github/home/miniconda/envs/build_binary/targets/x86_64-linux -DCMAKE_CXX_STANDARD=20 '-DTORCH_CUDA_ARCH_LIST=7.0;8.0;9.0;9.0a' -DCUDA_TOOLKIT_ROOT_DIR=/github/home/miniconda/envs/build_binary/targets/x86_64-linux -DCUDAToolkit_ROOT=/github/home/miniconda/envs/build_binary/targets/x86_64-linux -DCMAKE_CXX_STANDARD=20 -DCMAKE_BUILD_TYPE:STRING=Release 2025-05-07T19:50:25.8824574Z 2025-05-07T19:50:25.9227581Z 2025-05-07T19:50:25.9227599Z 2025-05-07T19:50:25.9228095Z ================================================================================ 2025-05-07T19:50:25.9229142Z Default C compiler flags 2025-05-07T19:50:25.9230162Z (values may be overridden by CMAKE_CXX_STANDARD and CXX_STANDARD): 2025-05-07T19:50:25.9231053Z 2025-05-07T19:50:25.9234022Z -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include 2025-05-07T19:50:25.9235067Z ================================================================================ 2025-05-07T19:50:25.9235295Z 2025-05-07T19:50:25.9235299Z 2025-05-07T19:50:25.9235303Z 2025-05-07T19:50:25.9235416Z ================================================================================ 2025-05-07T19:50:25.9235786Z Default C++ compiler flags 2025-05-07T19:50:25.9236153Z (values may be overridden by CMAKE_CXX_STANDARD and CXX_STANDARD): 2025-05-07T19:50:25.9236442Z 2025-05-07T19:50:25.9237421Z -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include 2025-05-07T19:50:25.9238563Z ================================================================================ 2025-05-07T19:50:25.9238793Z 2025-05-07T19:50:25.9238797Z 2025-05-07T19:50:25.9238801Z 2025-05-07T19:50:25.9238934Z ================================================================================ 2025-05-07T19:50:25.9239238Z AVX2_FLAGS: 2025-05-07T19:50:25.9239357Z 2025-05-07T19:50:25.9239459Z -mavx2 2025-05-07T19:50:25.9239647Z -mf16c 2025-05-07T19:50:25.9239850Z -mfma 2025-05-07T19:50:25.9240142Z Not searching for unused variables given on the command line. 2025-05-07T19:50:25.9240555Z -fopenmp 2025-05-07T19:50:25.9240799Z ================================================================================ 2025-05-07T19:50:25.9241022Z 2025-05-07T19:50:25.9241026Z 2025-05-07T19:50:25.9241029Z 2025-05-07T19:50:25.9241148Z ================================================================================ 2025-05-07T19:50:25.9241472Z AVX512_FLAGS: 2025-05-07T19:50:25.9241598Z 2025-05-07T19:50:25.9241679Z -mavx2 2025-05-07T19:50:25.9241890Z -mf16c 2025-05-07T19:50:25.9242075Z -mfma 2025-05-07T19:50:25.9242283Z -mavx512f 2025-05-07T19:50:25.9242502Z -mavx512bw 2025-05-07T19:50:25.9242701Z -mavx512dq 2025-05-07T19:50:25.9242912Z -mavx512vl 2025-05-07T19:50:25.9243104Z -fopenmp 2025-05-07T19:50:25.9243344Z ================================================================================ 2025-05-07T19:50:25.9243566Z 2025-05-07T19:50:25.9243570Z 2025-05-07T19:50:25.9243573Z 2025-05-07T19:50:25.9243688Z ================================================================================ 2025-05-07T19:50:25.9244047Z The project is built using scikit-build 2025-05-07T19:50:25.9244364Z ================================================================================ 2025-05-07T19:50:25.9244699Z 2025-05-07T19:50:25.9244703Z 2025-05-07T19:50:25.9244708Z 2025-05-07T19:50:25.9244813Z ================================================================================ 2025-05-07T19:50:25.9245125Z Build Settings 2025-05-07T19:50:25.9245249Z 2025-05-07T19:50:25.9245348Z FBGEMM_BUILD_TARGET : default 2025-05-07T19:50:25.9245643Z FBGEMM_BUILD_VARIANT : cuda 2025-05-07T19:50:25.9245807Z 2025-05-07T19:50:25.9245898Z NVCC_VERBOSE : 2025-05-07T19:50:25.9246157Z CUDNN_INCLUDE_DIR : 2025-05-07T19:50:25.9246415Z CUDNN_LIBRARY : 2025-05-07T19:50:25.9246809Z NVML_LIB_PATH : /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:25.9247269Z TORCH_CUDA_ARCH_LIST : 7.0 2025-05-07T19:50:25.9247628Z 8.0 2025-05-07T19:50:25.9247825Z 9.0 2025-05-07T19:50:25.9247997Z 9.0a 2025-05-07T19:50:25.9248114Z 2025-05-07T19:50:25.9248201Z HIP_ROOT_DIR : 2025-05-07T19:50:25.9248438Z HIPCC_VERBOSE : 2025-05-07T19:50:25.9248691Z AMDGPU_TARGETS : 2025-05-07T19:50:25.9248932Z PYTORCH_ROCM_ARCH : 2025-05-07T19:50:25.9249200Z ================================================================================ 2025-05-07T19:50:25.9249412Z 2025-05-07T19:50:26.0678531Z -- The CXX compiler identification is Clang 16.0.6 2025-05-07T19:50:26.1375395Z -- The C compiler identification is Clang 16.0.6 2025-05-07T19:50:27.2042566Z -- The CUDA compiler identification is NVIDIA 12.6.85 with host compiler Clang 16.0.6 2025-05-07T19:50:27.2150063Z -- Detecting CXX compiler ABI info 2025-05-07T19:50:27.3632838Z -- Detecting CXX compiler ABI info - done 2025-05-07T19:50:27.3761158Z -- Check for working CXX compiler: /github/home/miniconda/envs/build_binary/bin/c++ - skipped 2025-05-07T19:50:27.3761737Z -- Detecting CXX compile features 2025-05-07T19:50:27.3769913Z -- Detecting CXX compile features - done 2025-05-07T19:50:27.3847079Z -- Detecting C compiler ABI info 2025-05-07T19:50:27.5075171Z -- Detecting C compiler ABI info - done 2025-05-07T19:50:27.5197696Z -- Check for working C compiler: /github/home/miniconda/envs/build_binary/bin/cc - skipped 2025-05-07T19:50:27.5199221Z -- Detecting C compile features 2025-05-07T19:50:27.5201320Z -- Detecting C compile features - done 2025-05-07T19:50:27.5251529Z -- Detecting CUDA compiler ABI info 2025-05-07T19:50:28.5474060Z -- Detecting CUDA compiler ABI info - done 2025-05-07T19:50:28.6012209Z -- Check for working CUDA compiler: /github/home/miniconda/envs/build_binary/bin/nvcc - skipped 2025-05-07T19:50:28.6042550Z -- Detecting CUDA compile features 2025-05-07T19:50:28.6044444Z -- Detecting CUDA compile features - done 2025-05-07T19:50:28.6066126Z -- Performing Test C_HAS_AVX_1 2025-05-07T19:50:28.8938870Z -- Performing Test C_HAS_AVX_1 - Failed 2025-05-07T19:50:28.8939874Z -- Performing Test C_HAS_AVX_2 2025-05-07T19:50:29.2220084Z -- Performing Test C_HAS_AVX_2 - Success 2025-05-07T19:50:29.2221070Z -- Performing Test C_HAS_AVX2_1 2025-05-07T19:50:29.5090565Z -- Performing Test C_HAS_AVX2_1 - Failed 2025-05-07T19:50:29.5091532Z -- Performing Test C_HAS_AVX2_2 2025-05-07T19:50:29.8349763Z -- Performing Test C_HAS_AVX2_2 - Success 2025-05-07T19:50:29.8350778Z -- Performing Test C_HAS_AVX512_1 2025-05-07T19:50:30.1219014Z -- Performing Test C_HAS_AVX512_1 - Failed 2025-05-07T19:50:30.1220020Z -- Performing Test C_HAS_AVX512_2 2025-05-07T19:50:30.4545631Z -- Performing Test C_HAS_AVX512_2 - Success 2025-05-07T19:50:30.4546049Z -- Performing Test CXX_HAS_AVX_1 2025-05-07T19:50:30.7412342Z -- Performing Test CXX_HAS_AVX_1 - Failed 2025-05-07T19:50:30.7413354Z -- Performing Test CXX_HAS_AVX_2 2025-05-07T19:50:31.0700783Z -- Performing Test CXX_HAS_AVX_2 - Success 2025-05-07T19:50:31.0701777Z -- Performing Test CXX_HAS_AVX2_1 2025-05-07T19:50:31.3538708Z -- Performing Test CXX_HAS_AVX2_1 - Failed 2025-05-07T19:50:31.3539762Z -- Performing Test CXX_HAS_AVX2_2 2025-05-07T19:50:31.6835116Z -- Performing Test CXX_HAS_AVX2_2 - Success 2025-05-07T19:50:31.6835942Z -- Performing Test CXX_HAS_AVX512_1 2025-05-07T19:50:31.9700508Z -- Performing Test CXX_HAS_AVX512_1 - Failed 2025-05-07T19:50:31.9701861Z -- Performing Test CXX_HAS_AVX512_2 2025-05-07T19:50:32.3007226Z -- Performing Test CXX_HAS_AVX512_2 - Success 2025-05-07T19:50:32.3176001Z -- Found CUDA: /github/home/miniconda/envs/build_binary/targets/x86_64-linux (found version "12.6") 2025-05-07T19:50:32.3211431Z -- Found CUDAToolkit: /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include (found version "12.6.85") 2025-05-07T19:50:32.3272144Z -- Performing Test CMAKE_HAVE_LIBC_PTHREAD 2025-05-07T19:50:32.4526613Z -- Performing Test CMAKE_HAVE_LIBC_PTHREAD - Success 2025-05-07T19:50:32.4534123Z -- Found Threads: TRUE 2025-05-07T19:50:32.4545342Z CMake Warning at /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/share/cmake/Caffe2/FindCUDAToolkit.cmake:957 (message): 2025-05-07T19:50:32.4546514Z Could not find librt library, needed by CUDA::cudart_static 2025-05-07T19:50:32.4546918Z Call Stack (most recent call first): 2025-05-07T19:50:32.4547621Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/share/cmake/Caffe2/public/cuda.cmake:59 (find_package) 2025-05-07T19:50:32.4548723Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/share/cmake/Caffe2/Caffe2Config.cmake:86 (include) 2025-05-07T19:50:32.4549920Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/share/cmake/Torch/TorchConfig.cmake:68 (find_package) 2025-05-07T19:50:32.4550774Z /__w/FBGEMM/FBGEMM/cmake/modules/PyTorchSetup.cmake:14 (find_package) 2025-05-07T19:50:32.4551236Z CMakeLists.txt:112 (include) 2025-05-07T19:50:32.4551420Z 2025-05-07T19:50:32.4551424Z 2025-05-07T19:50:32.5809579Z -- PyTorch: CUDA detected: 12.6 2025-05-07T19:50:32.5811139Z -- PyTorch: CUDA nvcc is: /github/home/miniconda/envs/build_binary/targets/x86_64-linux/bin/nvcc 2025-05-07T19:50:32.5813268Z -- PyTorch: CUDA toolkit directory: /github/home/miniconda/envs/build_binary/targets/x86_64-linux 2025-05-07T19:50:32.7550065Z -- PyTorch: Header version is: 12.6 2025-05-07T19:50:32.8562644Z -- Found Python: /github/home/miniconda/envs/build_binary/bin/python (found version "3.13.2") found components: Interpreter 2025-05-07T19:50:32.8573265Z -- USE_CUDNN is set to 0. Compiling without cuDNN support 2025-05-07T19:50:32.8573832Z -- USE_CUSPARSELT is set to 0. Compiling without cuSPARSELt support 2025-05-07T19:50:32.8574721Z CMake Warning at /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/share/cmake/Caffe2/public/cuda.cmake:140 (message): 2025-05-07T19:50:32.8575499Z Failed to compute shorthash for libnvrtc.so 2025-05-07T19:50:32.8575853Z Call Stack (most recent call first): 2025-05-07T19:50:32.8576625Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/share/cmake/Caffe2/Caffe2Config.cmake:86 (include) 2025-05-07T19:50:32.8577833Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/share/cmake/Torch/TorchConfig.cmake:68 (find_package) 2025-05-07T19:50:32.8578663Z /__w/FBGEMM/FBGEMM/cmake/modules/PyTorchSetup.cmake:14 (find_package) 2025-05-07T19:50:32.8579123Z CMakeLists.txt:112 (include) 2025-05-07T19:50:32.8579303Z 2025-05-07T19:50:32.8579308Z 2025-05-07T19:50:32.8579482Z -- USE_CUDSS is set to 0. Compiling without cuDSS support 2025-05-07T19:50:32.8579909Z -- USE_CUFILE is set to 0. Compiling without cuFile support 2025-05-07T19:50:32.8580743Z -- Added CUDA NVCC flags for: -gencode;arch=compute_70,code=sm_70;-gencode;arch=compute_80,code=sm_80;-gencode;arch=compute_90,code=sm_90;-gencode;arch=compute_90a,code=sm_90a 2025-05-07T19:50:32.8907712Z CMake Warning at /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/share/cmake/Torch/TorchConfig.cmake:22 (message): 2025-05-07T19:50:32.8910105Z static library kineto_LIBRARY-NOTFOUND not found. 2025-05-07T19:50:32.8911169Z Call Stack (most recent call first): 2025-05-07T19:50:32.8913399Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/share/cmake/Torch/TorchConfig.cmake:125 (append_torchlib_if_found) 2025-05-07T19:50:32.8915593Z /__w/FBGEMM/FBGEMM/cmake/modules/PyTorchSetup.cmake:14 (find_package) 2025-05-07T19:50:32.8916029Z CMakeLists.txt:112 (include) 2025-05-07T19:50:32.8916226Z 2025-05-07T19:50:32.8916231Z 2025-05-07T19:50:32.8916624Z -- Found Torch: /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so 2025-05-07T19:50:32.8917132Z 2025-05-07T19:50:32.8917136Z 2025-05-07T19:50:32.8917254Z ================================================================================ 2025-05-07T19:50:32.8917590Z PyTorch Flags: 2025-05-07T19:50:32.8917814Z 2025-05-07T19:50:32.8918053Z TORCH_INCLUDE_DIRS: 2025-05-07T19:50:32.8918469Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include 2025-05-07T19:50:32.8919513Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:32.8920080Z 2025-05-07T19:50:32.8920292Z TORCH_LIBRARIES: 2025-05-07T19:50:32.8920504Z torch 2025-05-07T19:50:32.8920718Z torch_library 2025-05-07T19:50:32.8921151Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so 2025-05-07T19:50:32.8921832Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:32.8922628Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:32.8923136Z 2025-05-07T19:50:32.8923345Z TORCH_CUDA_OPTIONS: 2025-05-07T19:50:32.8923585Z --expt-relaxed-constexpr 2025-05-07T19:50:32.8923867Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:32.8924149Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:32.8924574Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:32.8924860Z ================================================================================ 2025-05-07T19:50:32.8925101Z 2025-05-07T19:50:32.8925124Z 2025-05-07T19:50:32.8925129Z 2025-05-07T19:50:32.8925246Z ================================================================================ 2025-05-07T19:50:32.8925564Z NCCL Flags 2025-05-07T19:50:32.8925683Z 2025-05-07T19:50:32.8926048Z NCCL_INCLUDE_DIRS=/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include 2025-05-07T19:50:32.8926929Z NCCL_LIBRARIES=/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:32.8927553Z ================================================================================ 2025-05-07T19:50:32.8927772Z 2025-05-07T19:50:32.8927776Z 2025-05-07T19:50:32.8927780Z 2025-05-07T19:50:32.8927890Z ================================================================================ 2025-05-07T19:50:32.8928210Z CUDA Driver Path 2025-05-07T19:50:32.8928348Z 2025-05-07T19:50:32.8928688Z CUDA_DRIVER_LIBRARIES=/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:32.8929257Z ================================================================================ 2025-05-07T19:50:32.8929473Z 2025-05-07T19:50:32.8929771Z -- Found NVML_LIB_PATH: /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:32.8944510Z 2025-05-07T19:50:32.8944531Z 2025-05-07T19:50:32.8945203Z ================================================================================ 2025-05-07T19:50:32.8946348Z GPU CPP Library Target: asmjit (SHARED) 2025-05-07T19:50:32.8947189Z 2025-05-07T19:50:32.8947695Z CPU_SRCS: 2025-05-07T19:50:32.8948018Z 2025-05-07T19:50:32.8948348Z 2025-05-07T19:50:32.8948833Z GPU_SRCS: 2025-05-07T19:50:32.8949171Z 2025-05-07T19:50:32.8949383Z 2025-05-07T19:50:32.8949918Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:32.8950322Z 2025-05-07T19:50:32.8950530Z 2025-05-07T19:50:32.8951065Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:32.8951472Z 2025-05-07T19:50:32.8951674Z 2025-05-07T19:50:32.8952181Z OTHER_SRCS: 2025-05-07T19:50:32.8953247Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64assembler.cpp 2025-05-07T19:50:32.8955008Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64builder.cpp 2025-05-07T19:50:32.8955946Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64compiler.cpp 2025-05-07T19:50:32.8956569Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64emithelper.cpp 2025-05-07T19:50:32.8957305Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64formatter.cpp 2025-05-07T19:50:32.8958061Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64func.cpp 2025-05-07T19:50:32.8958640Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64instapi.cpp 2025-05-07T19:50:32.8959212Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64instdb.cpp 2025-05-07T19:50:32.8959806Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64operand.cpp 2025-05-07T19:50:32.8960581Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64rapass.cpp 2025-05-07T19:50:32.8961176Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/armformatter.cpp 2025-05-07T19:50:32.8961790Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/archtraits.cpp 2025-05-07T19:50:32.8962368Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/assembler.cpp 2025-05-07T19:50:32.8962953Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/builder.cpp 2025-05-07T19:50:32.8963656Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/codeholder.cpp 2025-05-07T19:50:32.8964257Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/codewriter.cpp 2025-05-07T19:50:32.8964858Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/compiler.cpp 2025-05-07T19:50:32.8965434Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/constpool.cpp 2025-05-07T19:50:32.8966029Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/cpuinfo.cpp 2025-05-07T19:50:32.8966603Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/emithelper.cpp 2025-05-07T19:50:32.8967194Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/emitter.cpp 2025-05-07T19:50:32.8967774Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/emitterutils.cpp 2025-05-07T19:50:32.8968392Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/environment.cpp 2025-05-07T19:50:32.8969013Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/errorhandler.cpp 2025-05-07T19:50:32.8969605Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/formatter.cpp 2025-05-07T19:50:32.8970429Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/func.cpp 2025-05-07T19:50:32.8971022Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/funcargscontext.cpp 2025-05-07T19:50:32.8971643Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/globals.cpp 2025-05-07T19:50:32.8972224Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/inst.cpp 2025-05-07T19:50:32.8972777Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/instdb.cpp 2025-05-07T19:50:32.8973377Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/jitallocator.cpp 2025-05-07T19:50:32.8973976Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/jitruntime.cpp 2025-05-07T19:50:32.8974564Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/logger.cpp 2025-05-07T19:50:32.8975128Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/operand.cpp 2025-05-07T19:50:32.8975700Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/osutils.cpp 2025-05-07T19:50:32.8976390Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/ralocal.cpp 2025-05-07T19:50:32.8976967Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/rapass.cpp 2025-05-07T19:50:32.8977548Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/rastack.cpp 2025-05-07T19:50:32.8978105Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/string.cpp 2025-05-07T19:50:32.8978683Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/support.cpp 2025-05-07T19:50:32.8979238Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/target.cpp 2025-05-07T19:50:32.8979805Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/type.cpp 2025-05-07T19:50:32.8980376Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/virtmem.cpp 2025-05-07T19:50:32.8980927Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/zone.cpp 2025-05-07T19:50:32.8981494Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/zonehash.cpp 2025-05-07T19:50:32.8982065Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/zonelist.cpp 2025-05-07T19:50:32.8982654Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/zonestack.cpp 2025-05-07T19:50:32.8983435Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/zonetree.cpp 2025-05-07T19:50:32.8984017Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/zonevector.cpp 2025-05-07T19:50:32.8984630Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86assembler.cpp 2025-05-07T19:50:32.8985214Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86builder.cpp 2025-05-07T19:50:32.8985813Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86compiler.cpp 2025-05-07T19:50:32.8986489Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86emithelper.cpp 2025-05-07T19:50:32.8987102Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86formatter.cpp 2025-05-07T19:50:32.8987690Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86func.cpp 2025-05-07T19:50:32.8988256Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86instapi.cpp 2025-05-07T19:50:32.8988964Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86instdb.cpp 2025-05-07T19:50:32.8989490Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86operand.cpp 2025-05-07T19:50:32.8990040Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86rapass.cpp 2025-05-07T19:50:32.8990426Z 2025-05-07T19:50:32.8990622Z CC_FLAGS: 2025-05-07T19:50:32.8990733Z 2025-05-07T19:50:32.8990824Z 2025-05-07T19:50:32.8990995Z NVCC_FLAGS: 2025-05-07T19:50:32.8991108Z 2025-05-07T19:50:32.8991200Z 2025-05-07T19:50:32.8991375Z HIPCC_FLAGS: 2025-05-07T19:50:32.8991510Z 2025-05-07T19:50:32.8991585Z 2025-05-07T19:50:32.8991761Z INCLUDE_DIRS: 2025-05-07T19:50:32.8992002Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:32.8992295Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:32.8992571Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:32.8992860Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:32.8993332Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include 2025-05-07T19:50:32.8994277Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:32.8994896Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:32.8995351Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:32.8995790Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:32.8996253Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:32.8996961Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:32.8997413Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:32.8997984Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include 2025-05-07T19:50:32.8998478Z 2025-05-07T19:50:32.8998695Z Selected Source Files: 2025-05-07T19:50:32.8998851Z 2025-05-07T19:50:32.8998960Z 2025-05-07T19:50:32.8999173Z HIPified Source Files: 2025-05-07T19:50:32.8999357Z 2025-05-07T19:50:32.8999442Z 2025-05-07T19:50:32.8999641Z Library Dependencies: 2025-05-07T19:50:32.8999893Z torch 2025-05-07T19:50:32.9000087Z torch_library 2025-05-07T19:50:32.9000533Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so 2025-05-07T19:50:32.9001215Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:32.9001897Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:32.9002703Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:32.9003420Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:32.9004032Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:32.9004433Z 2025-05-07T19:50:32.9004641Z Output Library: 2025-05-07T19:50:32.9004856Z asmjit 2025-05-07T19:50:32.9005150Z 2025-05-07T19:50:32.9005371Z Destination Directory: 2025-05-07T19:50:32.9005615Z fbgemm_gpu 2025-05-07T19:50:32.9005875Z ================================================================================ 2025-05-07T19:50:32.9006101Z 2025-05-07T19:50:32.9006105Z 2025-05-07T19:50:32.9006109Z 2025-05-07T19:50:32.9006225Z ================================================================================ 2025-05-07T19:50:32.9006576Z GPU CPP Library Target: fbgemm (SHARED) 2025-05-07T19:50:32.9006863Z 2025-05-07T19:50:32.9007067Z CPU_SRCS: 2025-05-07T19:50:32.9007181Z 2025-05-07T19:50:32.9007396Z 2025-05-07T19:50:32.9007645Z GPU_SRCS: 2025-05-07T19:50:32.9007761Z 2025-05-07T19:50:32.9007859Z 2025-05-07T19:50:32.9008050Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:32.9008187Z 2025-05-07T19:50:32.9008285Z 2025-05-07T19:50:32.9008469Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:32.9008750Z 2025-05-07T19:50:32.9008828Z 2025-05-07T19:50:32.9009009Z OTHER_SRCS: 2025-05-07T19:50:32.9009315Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/EmbeddingSpMDM.cc 2025-05-07T19:50:32.9009758Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/EmbeddingSpMDMAutovec.cc 2025-05-07T19:50:32.9010239Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/EmbeddingSpMDMNBit.cc 2025-05-07T19:50:32.9010674Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/QuantUtils.cc 2025-05-07T19:50:32.9011082Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/RefImplementations.cc 2025-05-07T19:50:32.9011579Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/RowWiseSparseAdagradFused.cc 2025-05-07T19:50:32.9012026Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/SparseAdagrad.cc 2025-05-07T19:50:32.9012429Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/Utils.cc 2025-05-07T19:50:32.9012819Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/EmbeddingSpMDMAvx2.cc 2025-05-07T19:50:32.9013263Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/QuantUtilsAvx2.cc 2025-05-07T19:50:32.9013715Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/EmbeddingSpMDMAvx2.cc 2025-05-07T19:50:32.9014133Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/QuantUtilsAvx2.cc 2025-05-07T19:50:32.9014579Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/EmbeddingSpMDMAvx512.cc 2025-05-07T19:50:32.9014946Z 2025-05-07T19:50:32.9015163Z CC_FLAGS: 2025-05-07T19:50:32.9015287Z 2025-05-07T19:50:32.9015372Z 2025-05-07T19:50:32.9015587Z NVCC_FLAGS: 2025-05-07T19:50:32.9015708Z 2025-05-07T19:50:32.9015793Z 2025-05-07T19:50:32.9016012Z HIPCC_FLAGS: 2025-05-07T19:50:32.9016141Z 2025-05-07T19:50:32.9016225Z 2025-05-07T19:50:32.9016535Z INCLUDE_DIRS: 2025-05-07T19:50:32.9016999Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:32.9017334Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:32.9017669Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:32.9018000Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:32.9018528Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include 2025-05-07T19:50:32.9019313Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:32.9019982Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:32.9020424Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:32.9020856Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:32.9021352Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:32.9021874Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:32.9022358Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:32.9022922Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include 2025-05-07T19:50:32.9023457Z 2025-05-07T19:50:32.9023673Z Selected Source Files: 2025-05-07T19:50:32.9023855Z 2025-05-07T19:50:32.9023942Z 2025-05-07T19:50:32.9024176Z HIPified Source Files: 2025-05-07T19:50:32.9024337Z 2025-05-07T19:50:32.9024422Z 2025-05-07T19:50:32.9024654Z Library Dependencies: 2025-05-07T19:50:32.9024898Z torch 2025-05-07T19:50:32.9025130Z torch_library 2025-05-07T19:50:32.9025573Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so 2025-05-07T19:50:32.9028243Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:32.9028940Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:32.9029980Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:32.9030726Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:32.9031251Z asmjit 2025-05-07T19:50:32.9031686Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:32.9032081Z 2025-05-07T19:50:32.9032306Z Output Library: 2025-05-07T19:50:32.9032528Z fbgemm 2025-05-07T19:50:32.9032750Z 2025-05-07T19:50:32.9032955Z Destination Directory: 2025-05-07T19:50:32.9033228Z fbgemm_gpu 2025-05-07T19:50:32.9033471Z ================================================================================ 2025-05-07T19:50:32.9033730Z 2025-05-07T19:50:32.9033739Z 2025-05-07T19:50:32.9033743Z 2025-05-07T19:50:32.9033863Z ================================================================================ 2025-05-07T19:50:32.9034231Z Running code generation script ... 2025-05-07T19:50:32.9034967Z /github/home/miniconda/envs/build_binary/bin/python /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_backward_split.py --opensource 2025-05-07T19:50:32.9035737Z ================================================================================ 2025-05-07T19:50:32.9035976Z 2025-05-07T19:50:33.4374761Z [ARGS PARSE] Parsed arguments: Namespace(install_dir='.', is_fbcode=False, is_rocm=False) 2025-05-07T19:50:33.4375763Z [GENERAATE BACKWARD SPLIT]: ['/__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_backward_split.py', '--opensource'] 2025-05-07T19:50:33.4376856Z Written: gen_embedding_backward_dense_split_weighted_vbe_cuda.cu 2025-05-07T19:50:33.4377331Z Written: gen_embedding_backward_dense_split_weighted_cuda.cu 2025-05-07T19:50:33.4377836Z Written: gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:33.4378355Z Written: gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:33.4378852Z Written: gen_embedding_backward_dense_split_unweighted_cuda.cu 2025-05-07T19:50:33.4379331Z Written: gen_embedding_backward_dense_split_weighted_vbe_meta.cpp 2025-05-07T19:50:33.4379822Z Written: gen_embedding_backward_dense_split_weighted_meta.cpp 2025-05-07T19:50:33.4380331Z Written: gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:33.4380941Z Written: gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:33.4381410Z Written: gen_embedding_backward_dense_split_unweighted_meta.cpp 2025-05-07T19:50:33.4381877Z Written: gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:33.4382374Z Written: gen_embedding_backward_dense_split_weighted_kernel_cta.cu 2025-05-07T19:50:33.4382859Z Written: gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:33.4383391Z Written: gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:33.4383901Z Written: gen_embedding_backward_dense_split_unweighted_kernel_cta.cu 2025-05-07T19:50:33.4384384Z Written: gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:33.4384882Z Written: gen_embedding_backward_dense_split_weighted_kernel_warp.cu 2025-05-07T19:50:33.4385371Z Written: gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:33.4385902Z Written: gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:33.4386398Z Written: gen_embedding_backward_dense_split_unweighted_kernel_warp.cu 2025-05-07T19:50:33.4386871Z Written: gen_embedding_optimizer_dense_split_device_kernel.cuh 2025-05-07T19:50:33.4387288Z Written: gen_embedding_backward_split_dense.cpp 2025-05-07T19:50:33.4387637Z Written: gen_embedding_backward_dense_split_cpu.cpp 2025-05-07T19:50:33.4388060Z Written: gen_embedding_backward_adagrad_split_weighted_cuda.cu 2025-05-07T19:50:33.4388769Z Written: gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:33.4389255Z Written: gen_embedding_backward_adagrad_split_unweighted_cuda.cu 2025-05-07T19:50:33.4389695Z Written: gen_embedding_backward_adagrad_split_weighted_meta.cpp 2025-05-07T19:50:33.4390175Z Written: gen_embedding_backward_adagrad_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:33.4390671Z Written: gen_embedding_backward_adagrad_split_unweighted_meta.cpp 2025-05-07T19:50:33.4391135Z Written: gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu 2025-05-07T19:50:33.4391762Z Written: gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:33.4392275Z Written: gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu 2025-05-07T19:50:33.4392782Z Written: gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu 2025-05-07T19:50:33.4393285Z Written: gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:33.4393825Z Written: gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu 2025-05-07T19:50:33.4394310Z Written: gen_embedding_optimizer_adagrad_split_device_kernel.cuh 2025-05-07T19:50:33.4394712Z Written: gen_embedding_backward_split_adagrad.cpp 2025-05-07T19:50:33.4395097Z Written: gen_embedding_split_adagrad_pt2_autograd.cpp 2025-05-07T19:50:33.4395572Z Written: gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:33.4395946Z Written: lookup_adagrad.py 2025-05-07T19:50:33.4396260Z Written: gen_embedding_backward_adagrad_split_cpu.cpp 2025-05-07T19:50:33.4396636Z Written: gen_embedding_backward_split_adagrad_cpu.cpp 2025-05-07T19:50:33.4397072Z Written: gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:33.4397523Z Written: gen_embedding_backward_adam_split_weighted_vbe_cuda.cu 2025-05-07T19:50:33.4397973Z Written: gen_embedding_backward_adam_split_weighted_cuda.cu 2025-05-07T19:50:33.4398408Z Written: gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:33.4398889Z Written: gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:33.4399344Z Written: gen_embedding_backward_adam_split_unweighted_cuda.cu 2025-05-07T19:50:33.4399776Z Written: gen_embedding_backward_adam_split_weighted_vbe_meta.cpp 2025-05-07T19:50:33.4400222Z Written: gen_embedding_backward_adam_split_weighted_meta.cpp 2025-05-07T19:50:33.4400668Z Written: gen_embedding_backward_adam_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:33.4401150Z Written: gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:33.4401602Z Written: gen_embedding_backward_adam_split_unweighted_meta.cpp 2025-05-07T19:50:33.4402079Z Written: gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:33.4402564Z Written: gen_embedding_backward_adam_split_weighted_kernel_cta.cu 2025-05-07T19:50:33.4403043Z Written: gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:33.4403562Z Written: gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:33.4404046Z Written: gen_embedding_backward_adam_split_unweighted_kernel_cta.cu 2025-05-07T19:50:33.4404541Z Written: gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:33.4405017Z Written: gen_embedding_backward_adam_split_weighted_kernel_warp.cu 2025-05-07T19:50:33.4405517Z Written: gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:33.4406042Z Written: gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:33.4406532Z Written: gen_embedding_backward_adam_split_unweighted_kernel_warp.cu 2025-05-07T19:50:33.4406998Z Written: gen_embedding_optimizer_adam_split_device_kernel.cuh 2025-05-07T19:50:33.4407383Z Written: gen_embedding_backward_split_adam.cpp 2025-05-07T19:50:33.4407742Z Written: gen_embedding_split_adam_pt2_autograd.cpp 2025-05-07T19:50:33.4408137Z Written: gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:50:33.4408512Z Written: lookup_adam.py 2025-05-07T19:50:33.4408908Z Written: gen_embedding_backward_split_adam_cpu.cpp 2025-05-07T19:50:33.4409299Z Written: gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:50:33.4409740Z Written: gen_embedding_backward_lamb_split_weighted_cuda.cu 2025-05-07T19:50:33.4410176Z Written: gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:33.4410647Z Written: gen_embedding_backward_lamb_split_unweighted_cuda.cu 2025-05-07T19:50:33.4411069Z Written: gen_embedding_backward_lamb_split_weighted_meta.cpp 2025-05-07T19:50:33.4411593Z Written: gen_embedding_backward_lamb_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:33.4412061Z Written: gen_embedding_backward_lamb_split_unweighted_meta.cpp 2025-05-07T19:50:33.4412501Z Written: gen_embedding_backward_lamb_split_weighted_kernel_cta.cu 2025-05-07T19:50:33.4412993Z Written: gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:33.4413480Z Written: gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu 2025-05-07T19:50:33.4413965Z Written: gen_embedding_backward_lamb_split_weighted_kernel_warp.cu 2025-05-07T19:50:33.4414447Z Written: gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:33.4414961Z Written: gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu 2025-05-07T19:50:33.4415434Z Written: gen_embedding_optimizer_lamb_split_device_kernel.cuh 2025-05-07T19:50:33.4415823Z Written: gen_embedding_backward_split_lamb.cpp 2025-05-07T19:50:33.4416181Z Written: gen_embedding_split_lamb_pt2_autograd.cpp 2025-05-07T19:50:33.4416854Z Written: gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:50:33.4417256Z Written: lookup_lamb.py 2025-05-07T19:50:33.4417628Z Written: gen_embedding_backward_split_lamb_cpu.cpp 2025-05-07T19:50:33.4418071Z Written: gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:50:33.4418544Z Written: gen_embedding_backward_lars_sgd_split_weighted_cuda.cu 2025-05-07T19:50:33.4419064Z Written: gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:33.4419599Z Written: gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu 2025-05-07T19:50:33.4420077Z Written: gen_embedding_backward_lars_sgd_split_weighted_meta.cpp 2025-05-07T19:50:33.4420599Z Written: gen_embedding_backward_lars_sgd_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:33.4421118Z Written: gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp 2025-05-07T19:50:33.4421644Z Written: gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu 2025-05-07T19:50:33.4422193Z Written: gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:33.4422769Z Written: gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu 2025-05-07T19:50:33.4423317Z Written: gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu 2025-05-07T19:50:33.4423869Z Written: gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:33.4424447Z Written: gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu 2025-05-07T19:50:33.4424962Z Written: gen_embedding_optimizer_lars_sgd_split_device_kernel.cuh 2025-05-07T19:50:33.4425417Z Written: gen_embedding_backward_split_lars_sgd.cpp 2025-05-07T19:50:33.4425811Z Written: gen_embedding_split_lars_sgd_pt2_autograd.cpp 2025-05-07T19:50:33.4426272Z Written: gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:33.4426685Z Written: lookup_lars_sgd.py 2025-05-07T19:50:33.4426993Z Written: gen_embedding_backward_split_lars_sgd_cpu.cpp 2025-05-07T19:50:33.4427431Z Written: gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:50:33.4427940Z Written: gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu 2025-05-07T19:50:33.4428530Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:33.4429242Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu 2025-05-07T19:50:33.4429795Z Written: gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp 2025-05-07T19:50:33.4430458Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:33.4431024Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp 2025-05-07T19:50:33.4431604Z Written: gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu 2025-05-07T19:50:33.4432200Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:33.4432821Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu 2025-05-07T19:50:33.4433473Z Written: gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu 2025-05-07T19:50:33.4434065Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:33.4434679Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu 2025-05-07T19:50:33.5254979Z Written: gen_embedding_optimizer_partial_rowwise_adam_split_device_kernel.cuh 2025-05-07T19:50:33.5255590Z Written: gen_embedding_backward_split_partial_rowwise_adam.cpp 2025-05-07T19:50:33.5256115Z Written: gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp 2025-05-07T19:50:33.5256791Z Written: gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:50:33.5257320Z Written: lookup_partial_rowwise_adam.py 2025-05-07T19:50:33.5257729Z Written: gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp 2025-05-07T19:50:33.5258357Z Written: gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:50:33.5258962Z Written: gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu 2025-05-07T19:50:33.5259607Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:33.5260210Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu 2025-05-07T19:50:33.5260793Z Written: gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp 2025-05-07T19:50:33.5261477Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:33.5262064Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp 2025-05-07T19:50:33.5262620Z Written: gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu 2025-05-07T19:50:33.5263229Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:33.5263841Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu 2025-05-07T19:50:33.5264419Z Written: gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu 2025-05-07T19:50:33.5265033Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:33.5265635Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu 2025-05-07T19:50:33.5266214Z Written: gen_embedding_optimizer_partial_rowwise_lamb_split_device_kernel.cuh 2025-05-07T19:50:33.5266707Z Written: gen_embedding_backward_split_partial_rowwise_lamb.cpp 2025-05-07T19:50:33.5267174Z Written: gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp 2025-05-07T19:50:33.5267699Z Written: gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:50:33.5268132Z Written: lookup_partial_rowwise_lamb.py 2025-05-07T19:50:33.5268525Z Written: gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp 2025-05-07T19:50:33.5269023Z Written: gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:50:33.5269568Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu 2025-05-07T19:50:33.5270070Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu 2025-05-07T19:50:33.5271056Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu 2025-05-07T19:50:33.5271597Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu 2025-05-07T19:50:33.5272138Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu 2025-05-07T19:50:33.5272974Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:33.5273542Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu 2025-05-07T19:50:33.5274118Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:33.5274682Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu 2025-05-07T19:50:33.5275214Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu 2025-05-07T19:50:33.5275910Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp 2025-05-07T19:50:33.5276467Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp 2025-05-07T19:50:33.5277117Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_meta.cpp 2025-05-07T19:50:33.5277605Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp 2025-05-07T19:50:33.5278140Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_meta.cpp 2025-05-07T19:50:33.5278704Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:33.5279233Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp 2025-05-07T19:50:33.5279770Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:33.5280279Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_meta.cpp 2025-05-07T19:50:33.5280794Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp 2025-05-07T19:50:33.5281317Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:33.5281877Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:33.5282422Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu 2025-05-07T19:50:33.5282939Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu 2025-05-07T19:50:33.5283501Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:33.5284073Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:33.5284659Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:33.5285239Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:33.5285784Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu 2025-05-07T19:50:33.5286343Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu 2025-05-07T19:50:33.5286887Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:33.5287465Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:33.5288008Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu 2025-05-07T19:50:33.5288543Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu 2025-05-07T19:50:33.5289108Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:33.5289686Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:33.5290275Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:33.5290836Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:33.5291418Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu 2025-05-07T19:50:33.5291953Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu 2025-05-07T19:50:33.5292537Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu 2025-05-07T19:50:33.5293127Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu 2025-05-07T19:50:33.5293719Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu 2025-05-07T19:50:33.5294378Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu 2025-05-07T19:50:33.5294981Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu 2025-05-07T19:50:33.5295581Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu 2025-05-07T19:50:33.5296169Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu 2025-05-07T19:50:33.5297172Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu 2025-05-07T19:50:33.5297776Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu 2025-05-07T19:50:33.5298367Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu 2025-05-07T19:50:33.5298948Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu 2025-05-07T19:50:33.5299553Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu 2025-05-07T19:50:33.5300125Z Written: gen_embedding_optimizer_rowwise_adagrad_ssd_device_kernel.cuh 2025-05-07T19:50:33.5300662Z Written: gen_embedding_optimizer_rowwise_adagrad_split_device_kernel.cuh 2025-05-07T19:50:33.5301173Z Written: gen_embedding_backward_ssd_rowwise_adagrad.cpp 2025-05-07T19:50:33.5301605Z Written: gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:50:33.5302126Z Written: gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:33.5302571Z Written: lookup_rowwise_adagrad_ssd.py 2025-05-07T19:50:33.5303088Z Written: gen_embedding_backward_split_rowwise_adagrad.cpp 2025-05-07T19:50:33.5303531Z Written: gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:50:33.5304006Z Written: gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:33.5304444Z Written: lookup_rowwise_adagrad.py 2025-05-07T19:50:33.5304803Z Written: gen_embedding_backward_rowwise_adagrad_split_cpu.cpp 2025-05-07T19:50:33.5305260Z Written: gen_embedding_backward_split_rowwise_adagrad_cpu.cpp 2025-05-07T19:50:33.5305730Z Written: gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:33.5306284Z Written: gen_embedding_optimizer_approx_rowwise_adagrad_split_device_kernel.cuh 2025-05-07T19:50:33.5306810Z Written: gen_embedding_backward_split_approx_rowwise_adagrad.cpp 2025-05-07T19:50:33.5307270Z Written: gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:50:33.5307800Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:33.5308323Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp 2025-05-07T19:50:33.5308854Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:33.5309440Z Written: gen_embedding_optimizer_rowwise_adagrad_with_weight_decay_split_device_kernel.cuh 2025-05-07T19:50:33.5310031Z Written: gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T19:50:33.5310585Z Written: gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:50:33.5311174Z Written: gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:50:33.5311787Z Written: gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:50:33.5312372Z Written: gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:50:33.5313045Z Written: gen_embedding_optimizer_approx_rowwise_adagrad_with_weight_decay_split_device_kernel.cuh 2025-05-07T19:50:33.5313692Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T19:50:33.5314268Z Written: gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:50:33.5314926Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:50:33.5315570Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:50:33.6301525Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:50:33.6303712Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu 2025-05-07T19:50:33.6305292Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu 2025-05-07T19:50:33.6306049Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:33.6306927Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:33.6307545Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu 2025-05-07T19:50:33.6308157Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp 2025-05-07T19:50:33.6308750Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp 2025-05-07T19:50:33.6309375Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:33.6310008Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:33.6310632Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp 2025-05-07T19:50:33.6311268Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:33.6311894Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu 2025-05-07T19:50:33.6312558Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:33.6313220Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:33.6313872Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu 2025-05-07T19:50:33.6314525Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:33.6315158Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu 2025-05-07T19:50:33.6315816Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:33.6316479Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:33.6317138Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu 2025-05-07T19:50:33.6317770Z Written: gen_embedding_optimizer_rowwise_adagrad_with_counter_split_device_kernel.cuh 2025-05-07T19:50:33.6318307Z Written: gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp 2025-05-07T19:50:33.6318825Z Written: gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:50:33.6319380Z Written: gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:50:33.6319876Z Written: lookup_rowwise_adagrad_with_counter.py 2025-05-07T19:50:33.6320304Z Written: gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:50:33.6320875Z Written: gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:50:33.6321505Z Written: gen_embedding_optimizer_approx_rowwise_adagrad_with_counter_split_device_kernel.cuh 2025-05-07T19:50:33.6322091Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp 2025-05-07T19:50:33.6322651Z Written: gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:50:33.6323253Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:50:33.6323873Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:50:33.6324486Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:50:33.6325086Z Written: gen_embedding_optimizer_rowwise_weighted_adagrad_split_device_kernel.cuh 2025-05-07T19:50:33.6325746Z Written: gen_embedding_backward_split_rowwise_weighted_adagrad.cpp 2025-05-07T19:50:33.6326220Z Written: gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp 2025-05-07T19:50:33.6326766Z Written: gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:33.6327299Z Written: gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp 2025-05-07T19:50:33.6327842Z Written: gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:33.6328426Z Written: gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu 2025-05-07T19:50:33.6328847Z Written: gen_embedding_backward_sgd_split_weighted_cuda.cu 2025-05-07T19:50:33.6329290Z Written: gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:33.6329738Z Written: gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:33.6330192Z Written: gen_embedding_backward_sgd_split_unweighted_cuda.cu 2025-05-07T19:50:33.6330621Z Written: gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp 2025-05-07T19:50:33.6331068Z Written: gen_embedding_backward_sgd_split_weighted_meta.cpp 2025-05-07T19:50:33.6331530Z Written: gen_embedding_backward_sgd_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:33.6331985Z Written: gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:33.6332443Z Written: gen_embedding_backward_sgd_split_unweighted_meta.cpp 2025-05-07T19:50:33.6332887Z Written: gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:33.6333364Z Written: gen_embedding_backward_sgd_split_weighted_kernel_cta.cu 2025-05-07T19:50:33.6333835Z Written: gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:33.6334348Z Written: gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:33.6334836Z Written: gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu 2025-05-07T19:50:33.6335302Z Written: gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:33.6335783Z Written: gen_embedding_backward_sgd_split_weighted_kernel_warp.cu 2025-05-07T19:50:33.6336260Z Written: gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:33.6337073Z Written: gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:33.6337597Z Written: gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu 2025-05-07T19:50:33.6338100Z Written: gen_embedding_optimizer_sgd_split_device_kernel.cuh 2025-05-07T19:50:33.6338528Z Written: gen_embedding_backward_split_sgd.cpp 2025-05-07T19:50:33.6338903Z Written: gen_embedding_split_sgd_pt2_autograd.cpp 2025-05-07T19:50:33.6339346Z Written: gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:33.6339729Z Written: lookup_sgd.py 2025-05-07T19:50:33.6340048Z Written: gen_embedding_backward_sgd_split_cpu.cpp 2025-05-07T19:50:33.6340429Z Written: gen_embedding_backward_split_sgd_cpu.cpp 2025-05-07T19:50:33.6340875Z Written: gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:50:33.6341370Z Written: gen_embedding_optimizer_approx_sgd_split_device_kernel.cuh 2025-05-07T19:50:33.6341847Z Written: gen_embedding_backward_split_approx_sgd.cpp 2025-05-07T19:50:33.6342291Z Written: gen_embedding_split_approx_sgd_pt2_autograd.cpp 2025-05-07T19:50:33.6342764Z Written: gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:33.6343261Z Written: gen_embedding_backward_split_approx_sgd_cpu.cpp 2025-05-07T19:50:33.6343727Z Written: gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:50:33.6344233Z Written: gen_embedding_backward_none_split_weighted_cuda.cu 2025-05-07T19:50:33.6344707Z Written: gen_embedding_backward_none_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:33.6345212Z Written: gen_embedding_backward_none_split_unweighted_cuda.cu 2025-05-07T19:50:33.6345692Z Written: gen_embedding_backward_none_split_weighted_meta.cpp 2025-05-07T19:50:33.6346173Z Written: gen_embedding_backward_none_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:33.6346770Z Written: gen_embedding_backward_none_split_unweighted_meta.cpp 2025-05-07T19:50:33.6347251Z Written: gen_embedding_backward_none_split_weighted_kernel_cta.cu 2025-05-07T19:50:33.6347794Z Written: gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:33.6348323Z Written: gen_embedding_backward_none_split_unweighted_kernel_cta.cu 2025-05-07T19:50:33.6348835Z Written: gen_embedding_backward_none_split_weighted_kernel_warp.cu 2025-05-07T19:50:33.6349475Z Written: gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:33.6350028Z Written: gen_embedding_backward_none_split_unweighted_kernel_warp.cu 2025-05-07T19:50:33.6350491Z Written: gen_embedding_optimizer_none_split_device_kernel.cuh 2025-05-07T19:50:33.6350872Z Written: gen_embedding_backward_split_none.cpp 2025-05-07T19:50:33.6351229Z Written: gen_embedding_split_none_pt2_autograd.cpp 2025-05-07T19:50:33.6351625Z Written: gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp 2025-05-07T19:50:33.6352001Z Written: lookup_none.py 2025-05-07T19:50:33.6352292Z Written: gen_embedding_backward_split_none_cpu.cpp 2025-05-07T19:50:33.6352684Z Written: gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp 2025-05-07T19:50:33.6353157Z Written: gen_embedding_backward_split_weighted_device_kernel_hip.hip 2025-05-07T19:50:33.6353655Z Written: gen_embedding_backward_split_unweighted_nobag_device_kernel_hip.hip 2025-05-07T19:50:33.6354181Z Written: gen_embedding_backward_split_unweighted_device_kernel_hip.hip 2025-05-07T19:50:33.6354659Z Written: gen_embedding_backward_ssd_weighted_vbe_device_kernel.cuh 2025-05-07T19:50:33.6355147Z Written: gen_embedding_backward_split_weighted_vbe_device_kernel.cuh 2025-05-07T19:50:33.6355617Z Written: gen_embedding_backward_ssd_weighted_device_kernel.cuh 2025-05-07T19:50:33.6356049Z Written: gen_embedding_backward_split_weighted_device_kernel.cuh 2025-05-07T19:50:33.6356528Z Written: gen_embedding_backward_ssd_unweighted_nobag_device_kernel.cuh 2025-05-07T19:50:33.6357027Z Written: gen_embedding_backward_split_unweighted_nobag_device_kernel.cuh 2025-05-07T19:50:33.6357532Z Written: gen_embedding_backward_ssd_unweighted_vbe_device_kernel.cuh 2025-05-07T19:50:33.6358012Z Written: gen_embedding_backward_split_unweighted_vbe_device_kernel.cuh 2025-05-07T19:50:33.6358492Z Written: gen_embedding_backward_ssd_unweighted_device_kernel.cuh 2025-05-07T19:50:33.6358956Z Written: gen_embedding_backward_split_unweighted_device_kernel.cuh 2025-05-07T19:50:33.6359399Z Written: gen_embedding_backward_split_common_device_kernel.cuh 2025-05-07T19:50:33.6359840Z Written: gen_embedding_backward_split_grad_embedding_ops.cu 2025-05-07T19:50:33.6360280Z Written: gen_embedding_backward_dense_indice_weights_codegen_cuda.cu 2025-05-07T19:50:33.6360759Z Written: gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu 2025-05-07T19:50:33.6361225Z Written: gen_embedding_backward_split_indice_weights_codegen_cuda.cu 2025-05-07T19:50:33.6361616Z Written: pt2_arg_utils.h 2025-05-07T19:50:33.6361856Z Written: __init__.py 2025-05-07T19:50:33.6362102Z Written: lookup_args_ssd.py 2025-05-07T19:50:33.6362363Z Written: lookup_args.py 2025-05-07T19:50:33.6449501Z 2025-05-07T19:50:33.6449601Z 2025-05-07T19:50:33.6450179Z ================================================================================ 2025-05-07T19:50:33.6451278Z Running code generation script ... 2025-05-07T19:50:33.6453231Z /github/home/miniconda/envs/build_binary/bin/python /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_embedding_optimizer.py --opensource 2025-05-07T19:50:33.6454035Z ================================================================================ 2025-05-07T19:50:33.6454374Z 2025-05-07T19:50:33.7517614Z [ARGS PARSE] Parsed arguments: Namespace(install_dir='.', is_fbcode=False, is_rocm=False) 2025-05-07T19:50:33.7518958Z [GENERATE OPTIMIZERS]: ['/__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_embedding_optimizer.py', '--opensource'] 2025-05-07T19:50:33.7519727Z Written: gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu 2025-05-07T19:50:33.7521167Z Written: gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu 2025-05-07T19:50:33.7521652Z Written: gen_embedding_optimizer_rowwise_adagrad_split.cpp 2025-05-07T19:50:33.7522145Z Written: gen_embedding_optimizer_rowwise_adagrad_split_device_kernel.cuh 2025-05-07T19:50:33.7522747Z Written: split_embedding_optimizer_rowwise_adagrad.py 2025-05-07T19:50:33.7523098Z Written: optimizer_args.py 2025-05-07T19:50:33.7623465Z 2025-05-07T19:50:33.7623473Z 2025-05-07T19:50:33.7623938Z ================================================================================ 2025-05-07T19:50:33.7624346Z Running code generation script ... 2025-05-07T19:50:33.7625105Z /github/home/miniconda/envs/build_binary/bin/python /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_forward_quantized.py --opensource 2025-05-07T19:50:33.7625897Z ================================================================================ 2025-05-07T19:50:33.8814619Z 2025-05-07T19:50:33.8815691Z [ARGS PARSE] Parsed arguments: Namespace(install_dir='.', is_fbcode=False, is_rocm=False) 2025-05-07T19:50:33.8818480Z [GENERATE FORWARD QUANTIZED]: ['/__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_forward_quantized.py', '--opensource'] 2025-05-07T19:50:33.8820922Z Written: gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu 2025-05-07T19:50:33.8822837Z Written: gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu 2025-05-07T19:50:33.8824735Z Written: gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu 2025-05-07T19:50:33.8826412Z Written: gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu 2025-05-07T19:50:33.8827013Z Written: gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu 2025-05-07T19:50:33.8827629Z Written: gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu 2025-05-07T19:50:33.8828255Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu 2025-05-07T19:50:33.8828929Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu 2025-05-07T19:50:33.8829592Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu 2025-05-07T19:50:33.8830266Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu 2025-05-07T19:50:33.8830941Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu 2025-05-07T19:50:33.8831602Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu 2025-05-07T19:50:33.8832261Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu 2025-05-07T19:50:33.8832876Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu 2025-05-07T19:50:33.8833515Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu 2025-05-07T19:50:33.8834150Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu 2025-05-07T19:50:33.8834761Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu 2025-05-07T19:50:33.8835388Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu 2025-05-07T19:50:33.8835978Z Written: gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu 2025-05-07T19:50:33.8836582Z Written: gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu 2025-05-07T19:50:33.8837196Z Written: gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu 2025-05-07T19:50:33.8837714Z Written: gen_embedding_forward_quantized_weighted_codegen_cpu.cpp 2025-05-07T19:50:33.8838205Z Written: gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp 2025-05-07T19:50:33.8927387Z 2025-05-07T19:50:33.8927534Z 2025-05-07T19:50:33.8927781Z ================================================================================ 2025-05-07T19:50:33.8928631Z Running code generation script ... 2025-05-07T19:50:33.8929421Z /github/home/miniconda/envs/build_binary/bin/python /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_forward_split.py --opensource 2025-05-07T19:50:33.8930191Z ================================================================================ 2025-05-07T19:50:33.8930461Z 2025-05-07T19:50:34.2301142Z [ARGS PARSE] Parsed arguments: Namespace(install_dir='.', is_fbcode=False, is_rocm=False) 2025-05-07T19:50:34.2304040Z [GENERATE FORWARD SPLIT]: ['/__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_forward_split.py', '--opensource'] 2025-05-07T19:50:34.2304763Z Written: gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu 2025-05-07T19:50:34.2305384Z Written: gen_embedding_forward_dense_weighted_codegen_cuda.cu 2025-05-07T19:50:34.2305842Z Written: gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:50:34.2306336Z Written: gen_embedding_forward_dense_unweighted_codegen_cuda.cu 2025-05-07T19:50:34.2306814Z Written: gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu 2025-05-07T19:50:34.2307271Z Written: gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu 2025-05-07T19:50:34.2307731Z Written: gen_embedding_forward_ssd_weighted_codegen_cuda.cu 2025-05-07T19:50:34.2308158Z Written: gen_embedding_forward_split_weighted_codegen_cuda.cu 2025-05-07T19:50:34.2308629Z Written: gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:50:34.2309101Z Written: gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:50:34.2309590Z Written: gen_embedding_forward_ssd_unweighted_codegen_cuda.cu 2025-05-07T19:50:34.2310057Z Written: gen_embedding_forward_split_unweighted_codegen_cuda.cu 2025-05-07T19:50:34.2310527Z Written: gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu 2025-05-07T19:50:34.2311024Z Written: gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu 2025-05-07T19:50:34.2311496Z Written: gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu 2025-05-07T19:50:34.2312020Z Written: gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu 2025-05-07T19:50:34.2312496Z Written: gen_embedding_forward_dense_weighted_vbe_codegen_meta.cpp 2025-05-07T19:50:34.2312980Z Written: gen_embedding_forward_dense_weighted_codegen_meta.cpp 2025-05-07T19:50:34.2313479Z Written: gen_embedding_forward_dense_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:50:34.2313950Z Written: gen_embedding_forward_dense_unweighted_codegen_meta.cpp 2025-05-07T19:50:34.2314432Z Written: gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp 2025-05-07T19:50:34.2314888Z Written: gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp 2025-05-07T19:50:34.2315372Z Written: gen_embedding_forward_ssd_weighted_codegen_meta.cpp 2025-05-07T19:50:34.2315806Z Written: gen_embedding_forward_split_weighted_codegen_meta.cpp 2025-05-07T19:50:34.2316283Z Written: gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:50:34.2316762Z Written: gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:50:34.2317253Z Written: gen_embedding_forward_ssd_unweighted_codegen_meta.cpp 2025-05-07T19:50:34.2317725Z Written: gen_embedding_forward_split_unweighted_codegen_meta.cpp 2025-05-07T19:50:34.2318164Z Written: gen_embedding_forward_dense_weighted_vbe_kernel.cu 2025-05-07T19:50:34.2318596Z Written: gen_embedding_forward_dense_weighted_kernel.cu 2025-05-07T19:50:34.2319021Z Written: gen_embedding_forward_dense_unweighted_nobag_kernel.cu 2025-05-07T19:50:34.2319492Z Written: gen_embedding_forward_dense_unweighted_vbe_kernel.cu 2025-05-07T19:50:34.2319923Z Written: gen_embedding_forward_dense_unweighted_kernel.cu 2025-05-07T19:50:34.2320356Z Written: gen_embedding_forward_ssd_weighted_vbe_kernel.cu 2025-05-07T19:50:34.2320778Z Written: gen_embedding_forward_split_weighted_vbe_kernel.cu 2025-05-07T19:50:34.2321174Z Written: gen_embedding_forward_ssd_weighted_kernel.cu 2025-05-07T19:50:34.2321587Z Written: gen_embedding_forward_split_weighted_kernel.cu 2025-05-07T19:50:34.2322130Z Written: gen_embedding_forward_ssd_unweighted_nobag_kernel.cu 2025-05-07T19:50:34.2322606Z Written: gen_embedding_forward_split_unweighted_nobag_kernel.cu 2025-05-07T19:50:34.2323049Z Written: gen_embedding_forward_ssd_unweighted_vbe_kernel.cu 2025-05-07T19:50:34.2323505Z Written: gen_embedding_forward_split_unweighted_vbe_kernel.cu 2025-05-07T19:50:34.2323950Z Written: gen_embedding_forward_ssd_unweighted_kernel.cu 2025-05-07T19:50:34.2324357Z Written: gen_embedding_forward_split_unweighted_kernel.cu 2025-05-07T19:50:34.2324874Z Written: gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu 2025-05-07T19:50:34.2325307Z Written: gen_embedding_forward_split_weighted_gwd_kernel.cu 2025-05-07T19:50:34.2325773Z Written: gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu 2025-05-07T19:50:34.2326218Z Written: gen_embedding_forward_split_unweighted_gwd_kernel.cu 2025-05-07T19:50:34.2326669Z Written: gen_embedding_forward_split_weighted_v2_kernel.cu 2025-05-07T19:50:34.2327119Z Written: gen_embedding_forward_split_unweighted_v2_kernel.cu 2025-05-07T19:50:34.2327576Z Written: gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu 2025-05-07T19:50:34.2328095Z Written: gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu 2025-05-07T19:50:34.2328576Z Written: gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu 2025-05-07T19:50:34.2329088Z Written: gen_embedding_forward_split_unweighted_nobag_kernel_small.cu 2025-05-07T19:50:34.2329538Z Written: gen_embedding_forward_split_pt2_cuda_wrapper.cpp 2025-05-07T19:50:34.2329969Z Written: gen_embedding_forward_split_pt2_cpu_wrapper.cpp 2025-05-07T19:50:34.2330390Z Written: gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:34.2428497Z 2025-05-07T19:50:34.2428609Z 2025-05-07T19:50:34.2429129Z ================================================================================ 2025-05-07T19:50:34.2430225Z Running code generation script ... 2025-05-07T19:50:34.2432406Z /github/home/miniconda/envs/build_binary/bin/python /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_index_select.py --opensource 2025-05-07T19:50:34.2433694Z ================================================================================ 2025-05-07T19:50:34.2433932Z 2025-05-07T19:50:34.5072842Z [ARGS PARSE] Parsed arguments: Namespace(install_dir='.', is_fbcode=False, is_rocm=False) 2025-05-07T19:50:34.5075260Z [INDEX SELECT GENERATOR]: ['/__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_index_select.py', '--opensource'] 2025-05-07T19:50:34.5076078Z Written: gen_batch_index_select_dim0_forward_codegen_cuda.cu 2025-05-07T19:50:34.5076534Z Written: gen_batch_index_select_dim0_forward_kernel.cu 2025-05-07T19:50:34.5077061Z Written: gen_batch_index_select_dim0_forward_kernel_small.cu 2025-05-07T19:50:34.5077515Z Written: gen_batch_index_select_dim0_backward_codegen_cuda.cu 2025-05-07T19:50:34.5077975Z Written: gen_batch_index_select_dim0_backward_kernel_cta.cu 2025-05-07T19:50:34.5078396Z Written: gen_batch_index_select_dim0_backward_kernel_warp.cu 2025-05-07T19:50:34.5078896Z Written: gen_embedding_backward_split_batch_index_select_device_kernel.cuh 2025-05-07T19:50:34.5079377Z Written: gen_embedding_backward_split_grad_index_select.cu 2025-05-07T19:50:34.5079828Z Written: gen_embedding_backward_split_common_device_kernel.cuh 2025-05-07T19:50:34.5195280Z -- Adding merge_pooled_embeddings sources 2025-05-07T19:50:34.5206665Z 2025-05-07T19:50:34.5206682Z 2025-05-07T19:50:34.5207211Z ================================================================================ 2025-05-07T19:50:34.5208500Z GPU CPP Library Target: fbgemm_gpu_tbe_cache (SHARED) 2025-05-07T19:50:34.5224671Z 2025-05-07T19:50:34.5225136Z CPU_SRCS: 2025-05-07T19:50:34.5225783Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate_byte.cpp 2025-05-07T19:50:34.5226463Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/linearize_cache_indices.cpp 2025-05-07T19:50:34.5227154Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate_byte.cpp 2025-05-07T19:50:34.5227988Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lxu_cache.cpp 2025-05-07T19:50:34.5228612Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/split_embeddings_cache_ops.cpp 2025-05-07T19:50:34.5229080Z 2025-05-07T19:50:34.5229285Z GPU_SRCS: 2025-05-07T19:50:34.5229634Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_find.cu 2025-05-07T19:50:34.5230234Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate.cu 2025-05-07T19:50:34.5230954Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate_byte.cu 2025-05-07T19:50:34.5231583Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/linearize_cache_indices.cu 2025-05-07T19:50:34.5232428Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_find.cu 2025-05-07T19:50:34.5232964Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate.cu 2025-05-07T19:50:34.5233547Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate_byte.cu 2025-05-07T19:50:34.5234084Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lxu_cache.cu 2025-05-07T19:50:34.5234630Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/reset_weight_momentum.cu 2025-05-07T19:50:34.5235237Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/split_embeddings_cache_ops.cu 2025-05-07T19:50:34.5235667Z 2025-05-07T19:50:34.5235873Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:34.5236005Z 2025-05-07T19:50:34.5236079Z 2025-05-07T19:50:34.5236275Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:34.5236407Z 2025-05-07T19:50:34.5236485Z 2025-05-07T19:50:34.5236676Z OTHER_SRCS: 2025-05-07T19:50:34.5236790Z 2025-05-07T19:50:34.5236863Z 2025-05-07T19:50:34.5237042Z CC_FLAGS: 2025-05-07T19:50:34.5237147Z 2025-05-07T19:50:34.5237230Z 2025-05-07T19:50:34.5237390Z NVCC_FLAGS: 2025-05-07T19:50:34.5237600Z --expt-relaxed-constexpr 2025-05-07T19:50:34.5237842Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:34.5238114Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:34.5238371Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:34.5238618Z 2025-05-07T19:50:34.5238786Z HIPCC_FLAGS: 2025-05-07T19:50:34.5238915Z 2025-05-07T19:50:34.5238987Z 2025-05-07T19:50:34.5239159Z INCLUDE_DIRS: 2025-05-07T19:50:34.5239385Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:34.5239683Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:34.5239939Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:34.5240224Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:34.5240676Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include 2025-05-07T19:50:34.5241402Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:34.5241986Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:34.5242372Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:34.5242769Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:34.5243216Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:34.5243700Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:34.5244115Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:34.5244627Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include 2025-05-07T19:50:34.5245076Z 2025-05-07T19:50:34.5245271Z Selected Source Files: 2025-05-07T19:50:34.5245666Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate_byte.cpp 2025-05-07T19:50:34.5246276Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/linearize_cache_indices.cpp 2025-05-07T19:50:34.5246880Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate_byte.cpp 2025-05-07T19:50:34.5247429Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lxu_cache.cpp 2025-05-07T19:50:34.5247994Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/split_embeddings_cache_ops.cpp 2025-05-07T19:50:34.5248650Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_find.cu 2025-05-07T19:50:34.5249199Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate.cu 2025-05-07T19:50:34.5249769Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate_byte.cu 2025-05-07T19:50:34.5250370Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/linearize_cache_indices.cu 2025-05-07T19:50:34.5250940Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_find.cu 2025-05-07T19:50:34.5251535Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate.cu 2025-05-07T19:50:34.5252112Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate_byte.cu 2025-05-07T19:50:34.5252646Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lxu_cache.cu 2025-05-07T19:50:34.5253188Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/reset_weight_momentum.cu 2025-05-07T19:50:34.5253794Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/split_embeddings_cache_ops.cu 2025-05-07T19:50:34.5254227Z 2025-05-07T19:50:34.5254434Z HIPified Source Files: 2025-05-07T19:50:34.5254578Z 2025-05-07T19:50:34.5254651Z 2025-05-07T19:50:34.5254854Z Library Dependencies: 2025-05-07T19:50:34.5255066Z torch 2025-05-07T19:50:34.5255259Z torch_library 2025-05-07T19:50:34.5255656Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so 2025-05-07T19:50:34.5256417Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:34.5257283Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:34.5258058Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:34.5258794Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:34.5259382Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:34.5259793Z 2025-05-07T19:50:34.5259977Z Output Library: 2025-05-07T19:50:34.5260215Z fbgemm_gpu_tbe_cache 2025-05-07T19:50:34.5260433Z 2025-05-07T19:50:34.5260645Z Destination Directory: 2025-05-07T19:50:34.5260879Z fbgemm_gpu 2025-05-07T19:50:34.5261127Z ================================================================================ 2025-05-07T19:50:34.5261351Z 2025-05-07T19:50:34.5774670Z 2025-05-07T19:50:34.5774966Z 2025-05-07T19:50:34.5775205Z ================================================================================ 2025-05-07T19:50:34.5775672Z GPU CPP Library Target: fbgemm_gpu_tbe_inference (SHARED) 2025-05-07T19:50:34.5776029Z 2025-05-07T19:50:34.5776229Z CPU_SRCS: 2025-05-07T19:50:34.5776669Z codegen/inference/embedding_forward_quantized_host_cpu.cpp 2025-05-07T19:50:34.5777195Z gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp 2025-05-07T19:50:34.5777629Z gen_embedding_forward_quantized_weighted_codegen_cpu.cpp 2025-05-07T19:50:34.5777997Z 2025-05-07T19:50:34.5778191Z GPU_SRCS: 2025-05-07T19:50:34.5778509Z codegen/inference/embedding_forward_quantized_host.cpp 2025-05-07T19:50:34.5778959Z codegen/inference/embedding_forward_quantized_split_lookup.cu 2025-05-07T19:50:34.5779524Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu 2025-05-07T19:50:34.5780137Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu 2025-05-07T19:50:34.5780851Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu 2025-05-07T19:50:34.5781437Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu 2025-05-07T19:50:34.5781985Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu 2025-05-07T19:50:34.5782547Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu 2025-05-07T19:50:34.5783108Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu 2025-05-07T19:50:34.5783974Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu 2025-05-07T19:50:34.5784563Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu 2025-05-07T19:50:34.5785162Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu 2025-05-07T19:50:34.5785754Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu 2025-05-07T19:50:34.5786357Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu 2025-05-07T19:50:34.5787025Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu 2025-05-07T19:50:34.5787579Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu 2025-05-07T19:50:34.5788142Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu 2025-05-07T19:50:34.5788692Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu 2025-05-07T19:50:34.5789265Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu 2025-05-07T19:50:34.5789834Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu 2025-05-07T19:50:34.5790358Z gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu 2025-05-07T19:50:34.5790901Z gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu 2025-05-07T19:50:34.5791425Z gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu 2025-05-07T19:50:34.5791804Z 2025-05-07T19:50:34.5791978Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:34.5792119Z 2025-05-07T19:50:34.5792190Z 2025-05-07T19:50:34.5792363Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:34.5792498Z 2025-05-07T19:50:34.5792569Z 2025-05-07T19:50:34.5792751Z OTHER_SRCS: 2025-05-07T19:50:34.5792856Z 2025-05-07T19:50:34.5792923Z 2025-05-07T19:50:34.5793098Z CC_FLAGS: 2025-05-07T19:50:34.5793200Z 2025-05-07T19:50:34.5793268Z 2025-05-07T19:50:34.5793428Z NVCC_FLAGS: 2025-05-07T19:50:34.5793626Z --expt-relaxed-constexpr 2025-05-07T19:50:34.5793883Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:34.5794133Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:34.5794416Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:34.5794638Z 2025-05-07T19:50:34.5794816Z HIPCC_FLAGS: 2025-05-07T19:50:34.5794923Z 2025-05-07T19:50:34.5795034Z 2025-05-07T19:50:34.5795201Z INCLUDE_DIRS: 2025-05-07T19:50:34.5795422Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:34.5795698Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:34.5795960Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:34.5796256Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:34.5796706Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include 2025-05-07T19:50:34.5797422Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:34.5797999Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:34.5798381Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:34.5798769Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:34.5799203Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:34.5799673Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:34.5800082Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:34.5800589Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include 2025-05-07T19:50:34.5801033Z 2025-05-07T19:50:34.5801223Z Selected Source Files: 2025-05-07T19:50:34.5801518Z codegen/inference/embedding_forward_quantized_host_cpu.cpp 2025-05-07T19:50:34.5801937Z gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp 2025-05-07T19:50:34.5802333Z gen_embedding_forward_quantized_weighted_codegen_cpu.cpp 2025-05-07T19:50:34.5802725Z codegen/inference/embedding_forward_quantized_host.cpp 2025-05-07T19:50:34.5803148Z codegen/inference/embedding_forward_quantized_split_lookup.cu 2025-05-07T19:50:34.5804824Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu 2025-05-07T19:50:34.5805382Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu 2025-05-07T19:50:34.5805922Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu 2025-05-07T19:50:34.5806467Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu 2025-05-07T19:50:34.5807021Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu 2025-05-07T19:50:34.5807622Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu 2025-05-07T19:50:34.5808206Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu 2025-05-07T19:50:34.5808797Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu 2025-05-07T19:50:34.5809396Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu 2025-05-07T19:50:34.5809978Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu 2025-05-07T19:50:34.5810573Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu 2025-05-07T19:50:34.5811171Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu 2025-05-07T19:50:34.5811745Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu 2025-05-07T19:50:34.5812318Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu 2025-05-07T19:50:34.5812867Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu 2025-05-07T19:50:34.5813420Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu 2025-05-07T19:50:34.5813982Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu 2025-05-07T19:50:34.5814523Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu 2025-05-07T19:50:34.5815059Z gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu 2025-05-07T19:50:34.5815576Z gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu 2025-05-07T19:50:34.5816115Z gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu 2025-05-07T19:50:34.5816591Z 2025-05-07T19:50:34.5816964Z HIPified Source Files: 2025-05-07T19:50:34.5817113Z 2025-05-07T19:50:34.5817208Z 2025-05-07T19:50:34.5817391Z Library Dependencies: 2025-05-07T19:50:34.5817714Z torch 2025-05-07T19:50:34.5817894Z torch_library 2025-05-07T19:50:34.5818335Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so 2025-05-07T19:50:34.5818985Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:34.5819666Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:34.5820430Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:34.5821153Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:34.5821605Z asmjit 2025-05-07T19:50:34.5821779Z fbgemm 2025-05-07T19:50:34.5821975Z fbgemm_gpu_tbe_cache 2025-05-07T19:50:34.5822199Z fbgemm_gpu_config 2025-05-07T19:50:34.5822550Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:34.5823045Z 2025-05-07T19:50:34.5823227Z Output Library: 2025-05-07T19:50:34.5823425Z fbgemm_gpu_tbe_inference 2025-05-07T19:50:34.5823644Z 2025-05-07T19:50:34.5823819Z Destination Directory: 2025-05-07T19:50:34.5824044Z fbgemm_gpu 2025-05-07T19:50:34.5824251Z ================================================================================ 2025-05-07T19:50:34.5824470Z 2025-05-07T19:50:34.8485989Z 2025-05-07T19:50:34.8486154Z 2025-05-07T19:50:34.8486659Z ================================================================================ 2025-05-07T19:50:34.8488252Z GPU CPP Library Target: fbgemm_gpu_config (SHARED) 2025-05-07T19:50:34.8489156Z 2025-05-07T19:50:34.8489666Z CPU_SRCS: 2025-05-07T19:50:34.8490263Z src/config/feature_gates.cpp 2025-05-07T19:50:34.8490944Z 2025-05-07T19:50:34.8491462Z GPU_SRCS: 2025-05-07T19:50:34.8491773Z 2025-05-07T19:50:34.8491975Z 2025-05-07T19:50:34.8492409Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:34.8492547Z 2025-05-07T19:50:34.8492649Z 2025-05-07T19:50:34.8492841Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:34.8492981Z 2025-05-07T19:50:34.8493096Z 2025-05-07T19:50:34.8493275Z OTHER_SRCS: 2025-05-07T19:50:34.8493391Z 2025-05-07T19:50:34.8493589Z 2025-05-07T19:50:34.8493773Z CC_FLAGS: 2025-05-07T19:50:34.8493900Z 2025-05-07T19:50:34.8493976Z 2025-05-07T19:50:34.8494162Z NVCC_FLAGS: 2025-05-07T19:50:34.8494382Z 2025-05-07T19:50:34.8494456Z 2025-05-07T19:50:34.8494629Z HIPCC_FLAGS: 2025-05-07T19:50:34.8494759Z 2025-05-07T19:50:34.8494837Z 2025-05-07T19:50:34.8495018Z INCLUDE_DIRS: 2025-05-07T19:50:34.8495244Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:34.8495565Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:34.8495839Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:34.8496181Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:34.8496790Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include 2025-05-07T19:50:34.8497564Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:34.8498190Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:34.8498601Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:34.8499033Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:34.8499489Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:34.8500001Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:34.8500441Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:34.8500995Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include 2025-05-07T19:50:34.8501483Z 2025-05-07T19:50:34.8501681Z Selected Source Files: 2025-05-07T19:50:34.8501937Z src/config/feature_gates.cpp 2025-05-07T19:50:34.8502174Z 2025-05-07T19:50:34.8502384Z HIPified Source Files: 2025-05-07T19:50:34.8502534Z 2025-05-07T19:50:34.8502610Z 2025-05-07T19:50:34.8502808Z Library Dependencies: 2025-05-07T19:50:34.8503029Z torch 2025-05-07T19:50:34.8503226Z torch_library 2025-05-07T19:50:34.8503649Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so 2025-05-07T19:50:34.8504386Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:34.8505051Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:34.8505828Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:34.8506559Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:34.8507133Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:34.8507530Z 2025-05-07T19:50:34.8507712Z Output Library: 2025-05-07T19:50:34.8507940Z fbgemm_gpu_config 2025-05-07T19:50:34.8508131Z 2025-05-07T19:50:34.8508331Z Destination Directory: 2025-05-07T19:50:34.8508559Z fbgemm_gpu 2025-05-07T19:50:34.8508796Z ================================================================================ 2025-05-07T19:50:34.8509019Z 2025-05-07T19:50:34.8509024Z 2025-05-07T19:50:34.8509033Z 2025-05-07T19:50:34.8509159Z ================================================================================ 2025-05-07T19:50:34.8509519Z GPU CPP Library Target: fbgemm_gpu_tbe_utils (SHARED) 2025-05-07T19:50:34.8509859Z 2025-05-07T19:50:34.8510143Z CPU_SRCS: 2025-05-07T19:50:34.8510424Z src/split_embeddings_utils/split_embeddings_utils_cpu.cpp 2025-05-07T19:50:34.8511022Z src/split_embeddings_utils/split_embeddings_utils_meta.cpp 2025-05-07T19:50:34.8511475Z 2025-05-07T19:50:34.8511666Z GPU_SRCS: 2025-05-07T19:50:34.8511919Z src/split_embeddings_utils/split_embeddings_utils.cpp 2025-05-07T19:50:34.8512368Z src/split_embeddings_utils/generate_vbe_metadata.cu 2025-05-07T19:50:34.8512744Z src/split_embeddings_utils/get_infos_metadata.cu 2025-05-07T19:50:34.8513115Z src/split_embeddings_utils/radix_sort_pairs.cu 2025-05-07T19:50:34.8513496Z src/split_embeddings_utils/transpose_embedding_input.cu 2025-05-07T19:50:34.8513837Z 2025-05-07T19:50:34.8514023Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:34.8514259Z 2025-05-07T19:50:34.8514333Z 2025-05-07T19:50:34.8514514Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:34.8514670Z 2025-05-07T19:50:34.8514741Z 2025-05-07T19:50:34.8514926Z OTHER_SRCS: 2025-05-07T19:50:34.8515041Z 2025-05-07T19:50:34.8515115Z 2025-05-07T19:50:34.8515300Z CC_FLAGS: 2025-05-07T19:50:34.8515413Z 2025-05-07T19:50:34.8515485Z 2025-05-07T19:50:34.8515681Z NVCC_FLAGS: 2025-05-07T19:50:34.8515799Z 2025-05-07T19:50:34.8515874Z 2025-05-07T19:50:34.8516065Z HIPCC_FLAGS: 2025-05-07T19:50:34.8516182Z 2025-05-07T19:50:34.8516254Z 2025-05-07T19:50:34.8516444Z INCLUDE_DIRS: 2025-05-07T19:50:34.8516677Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:34.8517051Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:34.8517321Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:34.8517624Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:34.8518101Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include 2025-05-07T19:50:34.8518875Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:34.8519500Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:34.8519913Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:34.8520335Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:34.8520790Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:34.8521299Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:34.8521739Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:34.8522286Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include 2025-05-07T19:50:34.8522775Z 2025-05-07T19:50:34.8522979Z Selected Source Files: 2025-05-07T19:50:34.8523340Z src/split_embeddings_utils/split_embeddings_utils_cpu.cpp 2025-05-07T19:50:34.8523803Z src/split_embeddings_utils/split_embeddings_utils_meta.cpp 2025-05-07T19:50:34.8524237Z src/split_embeddings_utils/split_embeddings_utils.cpp 2025-05-07T19:50:34.8524633Z src/split_embeddings_utils/generate_vbe_metadata.cu 2025-05-07T19:50:34.8525025Z src/split_embeddings_utils/get_infos_metadata.cu 2025-05-07T19:50:34.8525387Z src/split_embeddings_utils/radix_sort_pairs.cu 2025-05-07T19:50:34.8525776Z src/split_embeddings_utils/transpose_embedding_input.cu 2025-05-07T19:50:34.8526118Z 2025-05-07T19:50:34.8526305Z HIPified Source Files: 2025-05-07T19:50:34.8526453Z 2025-05-07T19:50:34.8526543Z 2025-05-07T19:50:34.8526735Z Library Dependencies: 2025-05-07T19:50:34.8526969Z torch 2025-05-07T19:50:34.8527151Z torch_library 2025-05-07T19:50:34.8527583Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so 2025-05-07T19:50:34.8528244Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:34.8528931Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:34.8529712Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:34.8530525Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:34.8531095Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:34.8531464Z 2025-05-07T19:50:34.8531655Z Output Library: 2025-05-07T19:50:34.8531940Z fbgemm_gpu_tbe_utils 2025-05-07T19:50:34.8532164Z 2025-05-07T19:50:34.8532351Z Destination Directory: 2025-05-07T19:50:34.8532587Z fbgemm_gpu 2025-05-07T19:50:34.8532803Z ================================================================================ 2025-05-07T19:50:34.8533039Z 2025-05-07T19:50:34.8533044Z 2025-05-07T19:50:34.8533048Z 2025-05-07T19:50:34.8533157Z ================================================================================ 2025-05-07T19:50:34.8533563Z GPU CPP Library Target: fbgemm_gpu_sparse_async_cumsum (SHARED) 2025-05-07T19:50:34.8533912Z 2025-05-07T19:50:34.8534165Z CPU_SRCS: 2025-05-07T19:50:34.8534383Z src/sparse_ops/sparse_async_cumsum.cpp 2025-05-07T19:50:34.8534671Z 2025-05-07T19:50:34.8534844Z GPU_SRCS: 2025-05-07T19:50:34.8535072Z src/sparse_ops/sparse_async_cumsum.cu 2025-05-07T19:50:34.8535332Z 2025-05-07T19:50:34.8535530Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:34.8535665Z 2025-05-07T19:50:34.8535757Z 2025-05-07T19:50:34.8535944Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:34.8536084Z 2025-05-07T19:50:34.8536179Z 2025-05-07T19:50:34.8536449Z OTHER_SRCS: 2025-05-07T19:50:34.8536583Z 2025-05-07T19:50:34.8536660Z 2025-05-07T19:50:34.8537018Z CC_FLAGS: 2025-05-07T19:50:34.8537149Z 2025-05-07T19:50:34.8537232Z 2025-05-07T19:50:34.8537415Z NVCC_FLAGS: 2025-05-07T19:50:34.8537631Z 2025-05-07T19:50:34.8537713Z 2025-05-07T19:50:34.8537918Z HIPCC_FLAGS: 2025-05-07T19:50:34.8538050Z 2025-05-07T19:50:34.8538140Z 2025-05-07T19:50:34.8538337Z INCLUDE_DIRS: 2025-05-07T19:50:34.8538568Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:34.8538900Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:34.8539176Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:34.8539496Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:34.8539979Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include 2025-05-07T19:50:34.8540761Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:34.8541414Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:34.8541827Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:34.8542264Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:34.8542718Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:34.8543250Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:34.8543695Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:34.8544261Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include 2025-05-07T19:50:34.8544772Z 2025-05-07T19:50:34.8544968Z Selected Source Files: 2025-05-07T19:50:34.8545247Z src/sparse_ops/sparse_async_cumsum.cpp 2025-05-07T19:50:34.8545565Z src/sparse_ops/sparse_async_cumsum.cu 2025-05-07T19:50:34.8545865Z 2025-05-07T19:50:34.8546063Z HIPified Source Files: 2025-05-07T19:50:34.8546238Z 2025-05-07T19:50:34.8546317Z 2025-05-07T19:50:34.8546522Z Library Dependencies: 2025-05-07T19:50:34.8546776Z torch 2025-05-07T19:50:34.8546965Z torch_library 2025-05-07T19:50:34.8547409Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so 2025-05-07T19:50:34.8548085Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:34.8548760Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:34.8549551Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:34.8550268Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:34.8550746Z fbgemm_gpu_tbe_utils 2025-05-07T19:50:34.8551095Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:34.8551497Z 2025-05-07T19:50:34.8551684Z Output Library: 2025-05-07T19:50:34.8551941Z fbgemm_gpu_sparse_async_cumsum 2025-05-07T19:50:34.8552210Z 2025-05-07T19:50:34.8552498Z Destination Directory: 2025-05-07T19:50:34.8552752Z fbgemm_gpu 2025-05-07T19:50:34.8552974Z ================================================================================ 2025-05-07T19:50:34.8553200Z 2025-05-07T19:50:34.8553221Z 2025-05-07T19:50:34.8553225Z 2025-05-07T19:50:34.8553340Z ================================================================================ 2025-05-07T19:50:34.8553702Z GPU CPP Library Target: fbgemm_gpu_tbe_common (SHARED) 2025-05-07T19:50:34.8554051Z 2025-05-07T19:50:34.8554245Z CPU_SRCS: 2025-05-07T19:50:34.8554557Z codegen/utils/embedding_bounds_check_host_cpu.cpp 2025-05-07T19:50:34.8554978Z codegen/training/forward/embedding_forward_split_cpu.cpp 2025-05-07T19:50:34.8555355Z codegen/training/pt2/pt2_autograd_utils.cpp 2025-05-07T19:50:34.8555656Z 2025-05-07T19:50:34.8555837Z GPU_SRCS: 2025-05-07T19:50:34.8556094Z codegen/utils/embedding_bounds_check_v1.cu 2025-05-07T19:50:34.8556448Z codegen/utils/embedding_bounds_check_v2.cu 2025-05-07T19:50:34.8556788Z codegen/utils/embedding_bounds_check_host.cpp 2025-05-07T19:50:34.8557082Z 2025-05-07T19:50:34.8557271Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:34.8557404Z 2025-05-07T19:50:34.8557475Z 2025-05-07T19:50:34.8557655Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:34.8557786Z 2025-05-07T19:50:34.8557866Z 2025-05-07T19:50:34.8558041Z OTHER_SRCS: 2025-05-07T19:50:34.8558159Z 2025-05-07T19:50:34.8558237Z 2025-05-07T19:50:34.8558401Z CC_FLAGS: 2025-05-07T19:50:34.8558509Z 2025-05-07T19:50:34.8558592Z 2025-05-07T19:50:34.8558765Z NVCC_FLAGS: 2025-05-07T19:50:34.8559052Z --expt-relaxed-constexpr 2025-05-07T19:50:34.8559325Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:34.8559661Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:34.8559934Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:34.8560191Z 2025-05-07T19:50:34.8560420Z HIPCC_FLAGS: 2025-05-07T19:50:34.8560538Z 2025-05-07T19:50:34.8560609Z 2025-05-07T19:50:34.8560801Z INCLUDE_DIRS: 2025-05-07T19:50:34.8561017Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:34.8561338Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:34.8561611Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:34.8561923Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:34.8562394Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include 2025-05-07T19:50:34.8563160Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:34.8563795Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:34.8564197Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:34.8564612Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:34.8565069Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:34.8565602Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:34.8566036Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:34.8566586Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include 2025-05-07T19:50:34.8567099Z 2025-05-07T19:50:34.8567281Z Selected Source Files: 2025-05-07T19:50:34.8567584Z codegen/utils/embedding_bounds_check_host_cpu.cpp 2025-05-07T19:50:34.8568003Z codegen/training/forward/embedding_forward_split_cpu.cpp 2025-05-07T19:50:34.8568411Z codegen/training/pt2/pt2_autograd_utils.cpp 2025-05-07T19:50:34.8568750Z codegen/utils/embedding_bounds_check_host.cpp 2025-05-07T19:50:34.8569090Z codegen/utils/embedding_bounds_check_v1.cu 2025-05-07T19:50:34.8569421Z codegen/utils/embedding_bounds_check_v2.cu 2025-05-07T19:50:34.8569705Z 2025-05-07T19:50:34.8569890Z HIPified Source Files: 2025-05-07T19:50:34.8570053Z 2025-05-07T19:50:34.8570346Z 2025-05-07T19:50:34.8570553Z Library Dependencies: 2025-05-07T19:50:34.8570776Z torch 2025-05-07T19:50:34.8570965Z torch_library 2025-05-07T19:50:34.8571381Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so 2025-05-07T19:50:34.8572179Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:34.8572848Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:34.8573629Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:34.8574347Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:34.8574793Z fbgemm 2025-05-07T19:50:34.8574989Z fbgemm_gpu_config 2025-05-07T19:50:34.8575423Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:34.8575805Z 2025-05-07T19:50:34.8575978Z Output Library: 2025-05-07T19:50:34.8576191Z fbgemm_gpu_tbe_common 2025-05-07T19:50:34.8576496Z 2025-05-07T19:50:34.8576695Z Destination Directory: 2025-05-07T19:50:34.8576917Z fbgemm_gpu 2025-05-07T19:50:34.8577151Z ================================================================================ 2025-05-07T19:50:34.8577375Z 2025-05-07T19:50:34.8577505Z 2025-05-07T19:50:34.8577510Z 2025-05-07T19:50:34.8577632Z ================================================================================ 2025-05-07T19:50:34.8578014Z GPU CPP Library Target: fbgemm_gpu_tbe_optimizers (SHARED) 2025-05-07T19:50:34.8578361Z 2025-05-07T19:50:34.8578530Z CPU_SRCS: 2025-05-07T19:50:34.8578646Z 2025-05-07T19:50:34.8578718Z 2025-05-07T19:50:34.8578889Z GPU_SRCS: 2025-05-07T19:50:34.8579140Z gen_embedding_optimizer_rowwise_adagrad_split.cpp 2025-05-07T19:50:34.8579531Z gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu 2025-05-07T19:50:34.8579920Z gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu 2025-05-07T19:50:34.8580245Z 2025-05-07T19:50:34.8580416Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:34.8580549Z 2025-05-07T19:50:34.8580632Z 2025-05-07T19:50:34.8580803Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:34.8580943Z 2025-05-07T19:50:34.8581011Z 2025-05-07T19:50:34.8581174Z OTHER_SRCS: 2025-05-07T19:50:34.8581294Z 2025-05-07T19:50:34.8581366Z 2025-05-07T19:50:34.8581529Z CC_FLAGS: 2025-05-07T19:50:34.8581648Z 2025-05-07T19:50:34.8581714Z 2025-05-07T19:50:34.8581886Z NVCC_FLAGS: 2025-05-07T19:50:34.8582085Z --expt-relaxed-constexpr 2025-05-07T19:50:34.8582340Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:34.8582601Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:34.8582883Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:34.8583120Z 2025-05-07T19:50:34.8583296Z HIPCC_FLAGS: 2025-05-07T19:50:34.8583410Z 2025-05-07T19:50:34.8583478Z 2025-05-07T19:50:34.8583659Z INCLUDE_DIRS: 2025-05-07T19:50:34.8583888Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:34.8584199Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:34.8584496Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:34.8584804Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:34.8585305Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include 2025-05-07T19:50:34.8586074Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:34.8586728Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:34.8587136Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:34.8587576Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:34.8588060Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:34.8588568Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:34.8589128Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:34.8589642Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include 2025-05-07T19:50:34.8590118Z 2025-05-07T19:50:34.8590307Z Selected Source Files: 2025-05-07T19:50:34.8590593Z gen_embedding_optimizer_rowwise_adagrad_split.cpp 2025-05-07T19:50:34.8590960Z gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu 2025-05-07T19:50:34.8591354Z gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu 2025-05-07T19:50:34.8591776Z 2025-05-07T19:50:34.8591968Z HIPified Source Files: 2025-05-07T19:50:34.8592112Z 2025-05-07T19:50:34.8592209Z 2025-05-07T19:50:34.8592399Z Library Dependencies: 2025-05-07T19:50:34.8592638Z torch 2025-05-07T19:50:34.8592824Z torch_library 2025-05-07T19:50:34.8593240Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so 2025-05-07T19:50:34.8593860Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:34.8594602Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:34.8595355Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:34.8596037Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:34.8596630Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:34.8596998Z 2025-05-07T19:50:34.8597202Z Output Library: 2025-05-07T19:50:34.8597418Z fbgemm_gpu_tbe_optimizers 2025-05-07T19:50:34.8597659Z 2025-05-07T19:50:34.8597841Z Destination Directory: 2025-05-07T19:50:34.8598075Z fbgemm_gpu 2025-05-07T19:50:34.8598290Z ================================================================================ 2025-05-07T19:50:34.8598520Z 2025-05-07T19:50:34.8598523Z 2025-05-07T19:50:34.8598527Z 2025-05-07T19:50:34.8598635Z ================================================================================ 2025-05-07T19:50:34.8599034Z GPU CPP Library Target: fbgemm_gpu_tbe_training_forward (SHARED) 2025-05-07T19:50:34.8599376Z 2025-05-07T19:50:34.8599567Z CPU_SRCS: 2025-05-07T19:50:34.8599800Z gen_embedding_forward_split_pt2_cpu_wrapper.cpp 2025-05-07T19:50:34.8600105Z 2025-05-07T19:50:34.8600280Z GPU_SRCS: 2025-05-07T19:50:34.8600522Z gen_embedding_forward_split_weighted_kernel.cu 2025-05-07T19:50:34.8600870Z gen_embedding_forward_dense_weighted_kernel.cu 2025-05-07T19:50:34.8601197Z gen_embedding_forward_ssd_weighted_kernel.cu 2025-05-07T19:50:34.8601750Z gen_embedding_forward_split_unweighted_nobag_kernel.cu 2025-05-07T19:50:34.8602151Z gen_embedding_forward_dense_unweighted_nobag_kernel.cu 2025-05-07T19:50:34.8602558Z gen_embedding_forward_ssd_unweighted_nobag_kernel.cu 2025-05-07T19:50:34.8602935Z gen_embedding_forward_split_unweighted_kernel.cu 2025-05-07T19:50:34.8603335Z gen_embedding_forward_dense_unweighted_kernel.cu 2025-05-07T19:50:34.8603880Z gen_embedding_forward_ssd_unweighted_kernel.cu 2025-05-07T19:50:34.8604302Z gen_embedding_forward_split_weighted_codegen_cuda.cu 2025-05-07T19:50:34.8604750Z gen_embedding_forward_split_weighted_codegen_meta.cpp 2025-05-07T19:50:34.8605253Z gen_embedding_forward_split_unweighted_codegen_cuda.cu 2025-05-07T19:50:34.8605696Z gen_embedding_forward_split_unweighted_codegen_meta.cpp 2025-05-07T19:50:34.8606115Z gen_embedding_forward_dense_weighted_codegen_cuda.cu 2025-05-07T19:50:34.8606557Z gen_embedding_forward_dense_weighted_codegen_meta.cpp 2025-05-07T19:50:34.8606977Z gen_embedding_forward_dense_unweighted_codegen_cuda.cu 2025-05-07T19:50:34.8607427Z gen_embedding_forward_dense_unweighted_codegen_meta.cpp 2025-05-07T19:50:34.8607861Z gen_embedding_forward_ssd_weighted_codegen_cuda.cu 2025-05-07T19:50:34.8608268Z gen_embedding_forward_ssd_weighted_codegen_meta.cpp 2025-05-07T19:50:34.8608698Z gen_embedding_forward_ssd_unweighted_codegen_cuda.cu 2025-05-07T19:50:34.8609109Z gen_embedding_forward_ssd_unweighted_codegen_meta.cpp 2025-05-07T19:50:34.8609555Z gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu 2025-05-07T19:50:34.8609996Z gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp 2025-05-07T19:50:34.8610456Z gen_embedding_forward_split_weighted_vbe_kernel.cu 2025-05-07T19:50:34.8610851Z gen_embedding_forward_split_weighted_v2_kernel.cu 2025-05-07T19:50:34.8611288Z gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu 2025-05-07T19:50:34.8611773Z gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu 2025-05-07T19:50:34.8612216Z gen_embedding_forward_split_weighted_gwd_kernel.cu 2025-05-07T19:50:34.8612757Z gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu 2025-05-07T19:50:34.8613173Z gen_embedding_forward_dense_weighted_vbe_kernel.cu 2025-05-07T19:50:34.8613605Z gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu 2025-05-07T19:50:34.8614031Z gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu 2025-05-07T19:50:34.8614479Z gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp 2025-05-07T19:50:34.8614909Z gen_embedding_forward_ssd_weighted_vbe_kernel.cu 2025-05-07T19:50:34.8615384Z gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:50:34.8615873Z gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:50:34.8616410Z gen_embedding_forward_split_unweighted_vbe_kernel.cu 2025-05-07T19:50:34.8616847Z gen_embedding_forward_split_unweighted_v2_kernel.cu 2025-05-07T19:50:34.8617276Z gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu 2025-05-07T19:50:34.8617779Z gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu 2025-05-07T19:50:34.8618229Z gen_embedding_forward_split_unweighted_gwd_kernel.cu 2025-05-07T19:50:34.8618679Z gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:50:34.8619129Z gen_embedding_forward_dense_unweighted_vbe_kernel.cu 2025-05-07T19:50:34.8619545Z gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu 2025-05-07T19:50:34.8619992Z gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:50:34.8620424Z gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:50:34.8620874Z gen_embedding_forward_ssd_unweighted_vbe_kernel.cu 2025-05-07T19:50:34.8621303Z gen_embedding_forward_split_unweighted_nobag_kernel_small.cu 2025-05-07T19:50:34.8621795Z gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu 2025-05-07T19:50:34.8622283Z gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu 2025-05-07T19:50:34.8622710Z gen_embedding_forward_split_pt2_cuda_wrapper.cpp 2025-05-07T19:50:34.8623122Z gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:34.8623446Z 2025-05-07T19:50:34.8623675Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:34.8623825Z 2025-05-07T19:50:34.8623911Z 2025-05-07T19:50:34.8624137Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:34.8624283Z 2025-05-07T19:50:34.8624373Z 2025-05-07T19:50:34.8624797Z OTHER_SRCS: 2025-05-07T19:50:34.8624925Z 2025-05-07T19:50:34.8625034Z 2025-05-07T19:50:34.8625228Z CC_FLAGS: 2025-05-07T19:50:34.8625337Z 2025-05-07T19:50:34.8625416Z 2025-05-07T19:50:34.8625589Z NVCC_FLAGS: 2025-05-07T19:50:34.8625802Z --expt-relaxed-constexpr 2025-05-07T19:50:34.8626066Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:34.8626337Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:34.8626618Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:34.8626870Z 2025-05-07T19:50:34.8627041Z HIPCC_FLAGS: 2025-05-07T19:50:34.8627173Z 2025-05-07T19:50:34.8627248Z 2025-05-07T19:50:34.8627422Z INCLUDE_DIRS: 2025-05-07T19:50:34.8627671Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:34.8627986Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:34.8628259Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:34.8628583Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:34.8629051Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include 2025-05-07T19:50:34.8629829Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:34.8630463Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:34.8630871Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:34.8631299Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:34.8631751Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:34.8632262Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:34.8632696Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:34.8633233Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include 2025-05-07T19:50:34.8633821Z 2025-05-07T19:50:34.8634011Z Selected Source Files: 2025-05-07T19:50:34.8634282Z gen_embedding_forward_split_pt2_cpu_wrapper.cpp 2025-05-07T19:50:34.8634663Z gen_embedding_forward_split_weighted_codegen_meta.cpp 2025-05-07T19:50:34.8635065Z gen_embedding_forward_split_unweighted_codegen_meta.cpp 2025-05-07T19:50:34.8635455Z gen_embedding_forward_dense_weighted_codegen_meta.cpp 2025-05-07T19:50:34.8635851Z gen_embedding_forward_dense_unweighted_codegen_meta.cpp 2025-05-07T19:50:34.8636295Z gen_embedding_forward_ssd_weighted_codegen_meta.cpp 2025-05-07T19:50:34.8636681Z gen_embedding_forward_ssd_unweighted_codegen_meta.cpp 2025-05-07T19:50:34.8637078Z gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp 2025-05-07T19:50:34.8637487Z gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp 2025-05-07T19:50:34.8637908Z gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:50:34.8638328Z gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:50:34.8638722Z gen_embedding_forward_split_pt2_cuda_wrapper.cpp 2025-05-07T19:50:34.8639073Z gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:34.8639422Z gen_embedding_forward_split_weighted_kernel.cu 2025-05-07T19:50:34.8639760Z gen_embedding_forward_dense_weighted_kernel.cu 2025-05-07T19:50:34.8640094Z gen_embedding_forward_ssd_weighted_kernel.cu 2025-05-07T19:50:34.8640447Z gen_embedding_forward_split_unweighted_nobag_kernel.cu 2025-05-07T19:50:34.8640843Z gen_embedding_forward_dense_unweighted_nobag_kernel.cu 2025-05-07T19:50:34.8641240Z gen_embedding_forward_ssd_unweighted_nobag_kernel.cu 2025-05-07T19:50:34.8641601Z gen_embedding_forward_split_unweighted_kernel.cu 2025-05-07T19:50:34.8641957Z gen_embedding_forward_dense_unweighted_kernel.cu 2025-05-07T19:50:34.8642302Z gen_embedding_forward_ssd_unweighted_kernel.cu 2025-05-07T19:50:34.8642668Z gen_embedding_forward_split_weighted_codegen_cuda.cu 2025-05-07T19:50:34.8643057Z gen_embedding_forward_split_unweighted_codegen_cuda.cu 2025-05-07T19:50:34.8643453Z gen_embedding_forward_dense_weighted_codegen_cuda.cu 2025-05-07T19:50:34.8643840Z gen_embedding_forward_dense_unweighted_codegen_cuda.cu 2025-05-07T19:50:34.8644225Z gen_embedding_forward_ssd_weighted_codegen_cuda.cu 2025-05-07T19:50:34.8644611Z gen_embedding_forward_ssd_unweighted_codegen_cuda.cu 2025-05-07T19:50:34.8645004Z gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu 2025-05-07T19:50:34.8645401Z gen_embedding_forward_split_weighted_vbe_kernel.cu 2025-05-07T19:50:34.8645768Z gen_embedding_forward_split_weighted_v2_kernel.cu 2025-05-07T19:50:34.8646170Z gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu 2025-05-07T19:50:34.8646595Z gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu 2025-05-07T19:50:34.8647010Z gen_embedding_forward_split_weighted_gwd_kernel.cu 2025-05-07T19:50:34.8647404Z gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu 2025-05-07T19:50:34.8647787Z gen_embedding_forward_dense_weighted_vbe_kernel.cu 2025-05-07T19:50:34.8648172Z gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu 2025-05-07T19:50:34.8648573Z gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu 2025-05-07T19:50:34.8648956Z gen_embedding_forward_ssd_weighted_vbe_kernel.cu 2025-05-07T19:50:34.8649338Z gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:50:34.8649759Z gen_embedding_forward_split_unweighted_vbe_kernel.cu 2025-05-07T19:50:34.8650134Z gen_embedding_forward_split_unweighted_v2_kernel.cu 2025-05-07T19:50:34.8650546Z gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu 2025-05-07T19:50:34.8651002Z gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu 2025-05-07T19:50:34.8651422Z gen_embedding_forward_split_unweighted_gwd_kernel.cu 2025-05-07T19:50:34.8651835Z gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:50:34.8652234Z gen_embedding_forward_dense_unweighted_vbe_kernel.cu 2025-05-07T19:50:34.8652644Z gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu 2025-05-07T19:50:34.8653122Z gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:50:34.8653516Z gen_embedding_forward_ssd_unweighted_vbe_kernel.cu 2025-05-07T19:50:34.8653933Z gen_embedding_forward_split_unweighted_nobag_kernel_small.cu 2025-05-07T19:50:34.8654371Z gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu 2025-05-07T19:50:34.8654812Z gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu 2025-05-07T19:50:34.8655138Z 2025-05-07T19:50:34.8655330Z HIPified Source Files: 2025-05-07T19:50:34.8655478Z 2025-05-07T19:50:34.8655553Z 2025-05-07T19:50:34.8655805Z Library Dependencies: 2025-05-07T19:50:34.8656025Z torch 2025-05-07T19:50:34.8656214Z torch_library 2025-05-07T19:50:34.8656725Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so 2025-05-07T19:50:34.8657392Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:34.8658075Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:34.8658843Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:34.8659566Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:34.8660021Z fbgemm_gpu_tbe_common 2025-05-07T19:50:34.8660374Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:34.8660763Z 2025-05-07T19:50:34.8660939Z Output Library: 2025-05-07T19:50:34.8661173Z fbgemm_gpu_tbe_training_forward 2025-05-07T19:50:34.8661420Z 2025-05-07T19:50:34.8661616Z Destination Directory: 2025-05-07T19:50:34.8661838Z fbgemm_gpu 2025-05-07T19:50:34.8662063Z ================================================================================ 2025-05-07T19:50:34.8662289Z 2025-05-07T19:50:34.8662293Z 2025-05-07T19:50:34.8662296Z 2025-05-07T19:50:34.8662409Z ================================================================================ 2025-05-07T19:50:34.8662835Z GPU CPP Library Target: fbgemm_gpu_tbe_training_backward_pt2 (SHARED) 2025-05-07T19:50:34.8663235Z 2025-05-07T19:50:34.8663416Z CPU_SRCS: 2025-05-07T19:50:34.8663652Z gen_embedding_split_adagrad_pt2_autograd.cpp 2025-05-07T19:50:34.8664015Z gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:50:34.8664379Z gen_embedding_split_sgd_pt2_autograd.cpp 2025-05-07T19:50:34.8664696Z gen_embedding_split_adam_pt2_autograd.cpp 2025-05-07T19:50:34.8665031Z gen_embedding_split_lamb_pt2_autograd.cpp 2025-05-07T19:50:34.8665370Z gen_embedding_split_lars_sgd_pt2_autograd.cpp 2025-05-07T19:50:34.8665766Z gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp 2025-05-07T19:50:34.8666192Z gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp 2025-05-07T19:50:34.8666565Z gen_embedding_split_none_pt2_autograd.cpp 2025-05-07T19:50:34.8666974Z gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:50:34.8667386Z gen_embedding_split_approx_sgd_pt2_autograd.cpp 2025-05-07T19:50:34.8667795Z gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:50:34.8668274Z gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:50:34.8668840Z gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:50:34.8669381Z gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:50:34.8669876Z gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp 2025-05-07T19:50:34.8670471Z gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:50:34.8670864Z gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:34.8671315Z gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:34.8671737Z gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:50:34.8672135Z gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:50:34.8672519Z gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:50:34.8672930Z gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:50:34.8673564Z gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:50:34.8674074Z gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:50:34.8674533Z gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp 2025-05-07T19:50:34.8675000Z gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:50:34.8675518Z gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:50:34.8675992Z gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:34.8676641Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:50:34.8677287Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:50:34.8677908Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:50:34.8678491Z gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:34.8678884Z 2025-05-07T19:50:34.8679070Z GPU_SRCS: 2025-05-07T19:50:34.8679342Z gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:34.8679798Z gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:34.8680237Z gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:34.8680626Z gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:50:34.8681021Z gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:50:34.8681429Z gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:34.8681905Z gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:50:34.8682417Z gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:50:34.8682880Z gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp 2025-05-07T19:50:34.8683367Z gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:50:34.8683891Z gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:34.8684379Z gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:34.8684956Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:50:34.8685615Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:50:34.8686247Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:50:34.8686831Z gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:34.8687344Z gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:34.8687703Z 2025-05-07T19:50:34.8687888Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:34.8688022Z 2025-05-07T19:50:34.8688093Z 2025-05-07T19:50:34.8688281Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:34.8688414Z 2025-05-07T19:50:34.8688485Z 2025-05-07T19:50:34.8688661Z OTHER_SRCS: 2025-05-07T19:50:34.8688776Z 2025-05-07T19:50:34.8688847Z 2025-05-07T19:50:34.8689023Z CC_FLAGS: 2025-05-07T19:50:34.8689131Z 2025-05-07T19:50:34.8689213Z 2025-05-07T19:50:34.8689384Z NVCC_FLAGS: 2025-05-07T19:50:34.8689597Z --expt-relaxed-constexpr 2025-05-07T19:50:34.8689858Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:34.8690133Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:34.8690419Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:34.8690657Z 2025-05-07T19:50:34.8690829Z HIPCC_FLAGS: 2025-05-07T19:50:34.8690965Z 2025-05-07T19:50:34.8691040Z 2025-05-07T19:50:34.8691206Z INCLUDE_DIRS: 2025-05-07T19:50:34.8691436Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:34.8691733Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:34.8692006Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:34.8692308Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:34.8692773Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include 2025-05-07T19:50:34.8693542Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:34.8694248Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:34.8694648Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:34.8695053Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:34.8695527Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:34.8696035Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:34.8696632Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:34.8697176Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include 2025-05-07T19:50:34.8697648Z 2025-05-07T19:50:34.8697851Z Selected Source Files: 2025-05-07T19:50:34.8698114Z gen_embedding_split_adagrad_pt2_autograd.cpp 2025-05-07T19:50:34.8698479Z gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:50:34.8698833Z gen_embedding_split_sgd_pt2_autograd.cpp 2025-05-07T19:50:34.8699156Z gen_embedding_split_adam_pt2_autograd.cpp 2025-05-07T19:50:34.8699479Z gen_embedding_split_lamb_pt2_autograd.cpp 2025-05-07T19:50:34.8699803Z gen_embedding_split_lars_sgd_pt2_autograd.cpp 2025-05-07T19:50:34.8700203Z gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp 2025-05-07T19:50:34.8700618Z gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp 2025-05-07T19:50:34.8701000Z gen_embedding_split_none_pt2_autograd.cpp 2025-05-07T19:50:34.8701393Z gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:50:34.8701815Z gen_embedding_split_approx_sgd_pt2_autograd.cpp 2025-05-07T19:50:34.8702212Z gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:50:34.8702702Z gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:50:34.8703272Z gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:50:34.8703813Z gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:50:34.8704313Z gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp 2025-05-07T19:50:34.8704726Z gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:50:34.8705136Z gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:34.8705584Z gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:34.8706031Z gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:50:34.8706433Z gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:50:34.8706832Z gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:50:34.8707248Z gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:50:34.8707711Z gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:50:34.8708250Z gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:50:34.8708701Z gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp 2025-05-07T19:50:34.8709191Z gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:50:34.8709713Z gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:50:34.8710197Z gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:34.8710785Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:50:34.8711419Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:50:34.8712067Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:50:34.8712632Z gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:34.8713124Z gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:34.8713581Z gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:34.8714013Z gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:34.8714485Z gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:50:34.8714876Z gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:50:34.8715295Z gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:34.8715762Z gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:50:34.8716289Z gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:50:34.8716757Z gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp 2025-05-07T19:50:34.8718318Z gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:50:34.8718848Z gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:34.8719337Z gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:34.8719926Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:50:34.8720571Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:50:34.8721221Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:50:34.8721812Z gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:34.8722436Z gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:34.8722792Z 2025-05-07T19:50:34.8722976Z HIPified Source Files: 2025-05-07T19:50:34.8723132Z 2025-05-07T19:50:34.8723204Z 2025-05-07T19:50:34.8723385Z Library Dependencies: 2025-05-07T19:50:34.8723607Z torch 2025-05-07T19:50:34.8723783Z torch_library 2025-05-07T19:50:34.8724205Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so 2025-05-07T19:50:34.8724856Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:34.8725513Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:34.8726277Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:34.8726971Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:34.8727414Z fbgemm 2025-05-07T19:50:34.8727594Z fbgemm_gpu_config 2025-05-07T19:50:34.8727814Z fbgemm_gpu_tbe_cache 2025-05-07T19:50:34.8728199Z fbgemm_gpu_tbe_common 2025-05-07T19:50:34.8728418Z fbgemm_gpu_tbe_utils 2025-05-07T19:50:34.8728662Z fbgemm_gpu_sparse_async_cumsum 2025-05-07T19:50:34.8729030Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:34.8729412Z 2025-05-07T19:50:34.8729590Z Output Library: 2025-05-07T19:50:34.8729826Z fbgemm_gpu_tbe_training_backward_pt2 2025-05-07T19:50:34.8730080Z 2025-05-07T19:50:34.8730270Z Destination Directory: 2025-05-07T19:50:34.8730487Z fbgemm_gpu 2025-05-07T19:50:34.8730715Z ================================================================================ 2025-05-07T19:50:34.8730930Z 2025-05-07T19:50:34.8731162Z 2025-05-07T19:50:34.8731170Z 2025-05-07T19:50:34.8731293Z ================================================================================ 2025-05-07T19:50:34.8731685Z GPU CPP Library Target: fbgemm_gpu_tbe_training_backward (SHARED) 2025-05-07T19:50:34.8732048Z 2025-05-07T19:50:34.8732217Z CPU_SRCS: 2025-05-07T19:50:34.8732703Z codegen/training/backward/embedding_backward_dense_host_cpu.cpp 2025-05-07T19:50:34.8733128Z gen_embedding_backward_dense_split_cpu.cpp 2025-05-07T19:50:34.8733505Z gen_embedding_backward_split_adagrad_cpu.cpp 2025-05-07T19:50:34.8733871Z gen_embedding_backward_split_rowwise_adagrad_cpu.cpp 2025-05-07T19:50:34.8734224Z gen_embedding_backward_split_sgd_cpu.cpp 2025-05-07T19:50:34.8734548Z gen_embedding_backward_split_adam_cpu.cpp 2025-05-07T19:50:34.8734866Z gen_embedding_backward_split_lamb_cpu.cpp 2025-05-07T19:50:34.8735203Z gen_embedding_backward_split_lars_sgd_cpu.cpp 2025-05-07T19:50:34.8735581Z gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp 2025-05-07T19:50:34.8736010Z gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp 2025-05-07T19:50:34.8736552Z gen_embedding_backward_split_none_cpu.cpp 2025-05-07T19:50:34.8736968Z gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:50:34.8737402Z gen_embedding_backward_split_approx_sgd_cpu.cpp 2025-05-07T19:50:34.8737790Z gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp 2025-05-07T19:50:34.8738294Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:50:34.8738842Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:50:34.8739469Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:50:34.8750741Z gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp 2025-05-07T19:50:34.8751271Z gen_embedding_backward_adagrad_split_cpu.cpp 2025-05-07T19:50:34.8751650Z gen_embedding_backward_rowwise_adagrad_split_cpu.cpp 2025-05-07T19:50:34.8752004Z gen_embedding_backward_sgd_split_cpu.cpp 2025-05-07T19:50:34.8752303Z 2025-05-07T19:50:34.8752484Z GPU_SRCS: 2025-05-07T19:50:34.8752719Z gen_embedding_backward_split_grad_embedding_ops.cu 2025-05-07T19:50:34.8753130Z gen_embedding_backward_split_indice_weights_codegen_cuda.cu 2025-05-07T19:50:34.8753557Z gen_embedding_backward_dense_indice_weights_codegen_cuda.cu 2025-05-07T19:50:34.8753980Z gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu 2025-05-07T19:50:34.8754392Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu 2025-05-07T19:50:34.8754844Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu 2025-05-07T19:50:34.8755327Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu 2025-05-07T19:50:34.8755817Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu 2025-05-07T19:50:34.8756331Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:34.8756864Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:34.8757366Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu 2025-05-07T19:50:34.8757832Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu 2025-05-07T19:50:34.8758328Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu 2025-05-07T19:50:34.8758770Z gen_embedding_backward_adagrad_split_weighted_cuda.cu 2025-05-07T19:50:34.8759181Z gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu 2025-05-07T19:50:34.8759620Z gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu 2025-05-07T19:50:34.8760060Z gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:34.8760546Z gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:34.8761047Z gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:34.8761505Z gen_embedding_backward_adagrad_split_unweighted_cuda.cu 2025-05-07T19:50:34.8761930Z gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu 2025-05-07T19:50:34.8762382Z gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu 2025-05-07T19:50:34.8762846Z gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu 2025-05-07T19:50:34.8763306Z gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu 2025-05-07T19:50:34.8763804Z gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu 2025-05-07T19:50:34.8764406Z gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:34.8764924Z gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:34.8765462Z gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:34.8765969Z gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu 2025-05-07T19:50:34.8766446Z gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu 2025-05-07T19:50:34.8766939Z gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu 2025-05-07T19:50:34.8767372Z gen_embedding_backward_sgd_split_weighted_cuda.cu 2025-05-07T19:50:34.8767935Z gen_embedding_backward_sgd_split_weighted_kernel_cta.cu 2025-05-07T19:50:34.8768333Z gen_embedding_backward_sgd_split_weighted_kernel_warp.cu 2025-05-07T19:50:34.8768726Z gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:34.8769164Z gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:34.8769626Z gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:34.8770030Z gen_embedding_backward_sgd_split_unweighted_cuda.cu 2025-05-07T19:50:34.8770826Z gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu 2025-05-07T19:50:34.8771358Z gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu 2025-05-07T19:50:34.8771768Z gen_embedding_backward_adam_split_weighted_cuda.cu 2025-05-07T19:50:34.8772152Z gen_embedding_backward_adam_split_weighted_kernel_cta.cu 2025-05-07T19:50:34.8772565Z gen_embedding_backward_adam_split_weighted_kernel_warp.cu 2025-05-07T19:50:34.8772993Z gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:34.8773438Z gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:34.8773915Z gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:34.8774339Z gen_embedding_backward_adam_split_unweighted_cuda.cu 2025-05-07T19:50:34.8774749Z gen_embedding_backward_adam_split_unweighted_kernel_cta.cu 2025-05-07T19:50:34.8775173Z gen_embedding_backward_adam_split_unweighted_kernel_warp.cu 2025-05-07T19:50:34.8775570Z gen_embedding_backward_lamb_split_weighted_cuda.cu 2025-05-07T19:50:34.8775953Z gen_embedding_backward_lamb_split_weighted_kernel_cta.cu 2025-05-07T19:50:34.8776455Z gen_embedding_backward_lamb_split_weighted_kernel_warp.cu 2025-05-07T19:50:34.8776880Z gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:34.8777321Z gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:34.8777804Z gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:34.8778228Z gen_embedding_backward_lamb_split_unweighted_cuda.cu 2025-05-07T19:50:34.8778635Z gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu 2025-05-07T19:50:34.8779058Z gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu 2025-05-07T19:50:34.8779478Z gen_embedding_backward_lars_sgd_split_weighted_cuda.cu 2025-05-07T19:50:34.8779891Z gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu 2025-05-07T19:50:34.8780330Z gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu 2025-05-07T19:50:34.8780787Z gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:34.8781262Z gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:34.8781773Z gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:34.8782227Z gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu 2025-05-07T19:50:34.8782662Z gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu 2025-05-07T19:50:34.8783123Z gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu 2025-05-07T19:50:34.8783601Z gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu 2025-05-07T19:50:34.8784110Z gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu 2025-05-07T19:50:34.8784644Z gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu 2025-05-07T19:50:34.8785181Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:34.8785743Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:34.8786342Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:34.8786896Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu 2025-05-07T19:50:34.8787406Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu 2025-05-07T19:50:34.8787956Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu 2025-05-07T19:50:34.8788472Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu 2025-05-07T19:50:34.8789195Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu 2025-05-07T19:50:34.8789682Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu 2025-05-07T19:50:34.8790187Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:34.8790707Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:34.8791252Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:34.8791813Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu 2025-05-07T19:50:34.8792289Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu 2025-05-07T19:50:34.8792795Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu 2025-05-07T19:50:34.8793218Z gen_embedding_backward_none_split_weighted_cuda.cu 2025-05-07T19:50:34.8793582Z gen_embedding_backward_none_split_weighted_kernel_cta.cu 2025-05-07T19:50:34.8793974Z gen_embedding_backward_none_split_weighted_kernel_warp.cu 2025-05-07T19:50:34.8794363Z gen_embedding_backward_none_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:34.8794796Z gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:34.8795236Z gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:34.8795644Z gen_embedding_backward_none_split_unweighted_cuda.cu 2025-05-07T19:50:34.8796015Z gen_embedding_backward_none_split_unweighted_kernel_cta.cu 2025-05-07T19:50:34.8796421Z gen_embedding_backward_none_split_unweighted_kernel_warp.cu 2025-05-07T19:50:34.8796886Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu 2025-05-07T19:50:34.8797405Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu 2025-05-07T19:50:34.8797950Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu 2025-05-07T19:50:34.8798501Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:34.8799078Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:34.8799671Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:34.8800231Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu 2025-05-07T19:50:34.8800756Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu 2025-05-07T19:50:34.8801318Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu 2025-05-07T19:50:34.8801714Z 2025-05-07T19:50:34.8801884Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:34.8802007Z 2025-05-07T19:50:34.8802082Z 2025-05-07T19:50:34.8802242Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:34.8802547Z gen_embedding_backward_split_unweighted_nobag_device_kernel_hip.hip 2025-05-07T19:50:34.8802973Z gen_embedding_backward_split_weighted_device_kernel_hip.hip 2025-05-07T19:50:34.8803385Z gen_embedding_backward_split_unweighted_device_kernel_hip.hip 2025-05-07T19:50:34.8803710Z 2025-05-07T19:50:34.8803872Z OTHER_SRCS: 2025-05-07T19:50:34.8803975Z 2025-05-07T19:50:34.8804040Z 2025-05-07T19:50:34.8804198Z CC_FLAGS: 2025-05-07T19:50:34.8804298Z 2025-05-07T19:50:34.8804366Z 2025-05-07T19:50:34.8804526Z NVCC_FLAGS: 2025-05-07T19:50:34.8804717Z --expt-relaxed-constexpr 2025-05-07T19:50:34.8804951Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:34.8805198Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:34.8805445Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:34.8805666Z 2025-05-07T19:50:34.8805821Z HIPCC_FLAGS: 2025-05-07T19:50:34.8805930Z 2025-05-07T19:50:34.8805993Z 2025-05-07T19:50:34.8806149Z INCLUDE_DIRS: 2025-05-07T19:50:34.8806352Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:34.8806621Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:34.8806870Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:34.8807136Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:34.8807571Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include 2025-05-07T19:50:34.8808343Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:34.8808911Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:34.8809275Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:34.8809645Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:34.8810065Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:34.8810584Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:34.8810984Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:34.8811256Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include 2025-05-07T19:50:34.8811329Z 2025-05-07T19:50:34.8811411Z Selected Source Files: 2025-05-07T19:50:34.8811593Z codegen/training/backward/embedding_backward_dense_host_cpu.cpp 2025-05-07T19:50:34.8811709Z gen_embedding_backward_dense_split_cpu.cpp 2025-05-07T19:50:34.8811821Z gen_embedding_backward_split_adagrad_cpu.cpp 2025-05-07T19:50:34.8811957Z gen_embedding_backward_split_rowwise_adagrad_cpu.cpp 2025-05-07T19:50:34.8812063Z gen_embedding_backward_split_sgd_cpu.cpp 2025-05-07T19:50:34.8812173Z gen_embedding_backward_split_adam_cpu.cpp 2025-05-07T19:50:34.8812273Z gen_embedding_backward_split_lamb_cpu.cpp 2025-05-07T19:50:34.8812385Z gen_embedding_backward_split_lars_sgd_cpu.cpp 2025-05-07T19:50:34.8812540Z gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp 2025-05-07T19:50:34.8812682Z gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp 2025-05-07T19:50:34.8812782Z gen_embedding_backward_split_none_cpu.cpp 2025-05-07T19:50:34.8812952Z gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:50:34.8813076Z gen_embedding_backward_split_approx_sgd_cpu.cpp 2025-05-07T19:50:34.8813229Z gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp 2025-05-07T19:50:34.8813426Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:50:34.8813646Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:50:34.8813833Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:50:34.8813993Z gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp 2025-05-07T19:50:34.8814110Z gen_embedding_backward_adagrad_split_cpu.cpp 2025-05-07T19:50:34.8814239Z gen_embedding_backward_rowwise_adagrad_split_cpu.cpp 2025-05-07T19:50:34.8814342Z gen_embedding_backward_sgd_split_cpu.cpp 2025-05-07T19:50:34.8814464Z gen_embedding_backward_split_grad_embedding_ops.cu 2025-05-07T19:50:34.8814620Z gen_embedding_backward_split_indice_weights_codegen_cuda.cu 2025-05-07T19:50:34.8814770Z gen_embedding_backward_dense_indice_weights_codegen_cuda.cu 2025-05-07T19:50:34.8814911Z gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu 2025-05-07T19:50:34.8815066Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu 2025-05-07T19:50:34.8815238Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu 2025-05-07T19:50:34.8815407Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu 2025-05-07T19:50:34.8815592Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu 2025-05-07T19:50:34.8815789Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:34.8815986Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:34.8816164Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu 2025-05-07T19:50:34.8816429Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu 2025-05-07T19:50:34.8816609Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu 2025-05-07T19:50:34.8816916Z gen_embedding_backward_adagrad_split_weighted_cuda.cu 2025-05-07T19:50:34.8817091Z gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu 2025-05-07T19:50:34.8817326Z gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu 2025-05-07T19:50:34.8817498Z gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:34.8817704Z gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:34.8817898Z gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:34.8818045Z gen_embedding_backward_adagrad_split_unweighted_cuda.cu 2025-05-07T19:50:34.8818226Z gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu 2025-05-07T19:50:34.8818470Z gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu 2025-05-07T19:50:34.8818639Z gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu 2025-05-07T19:50:34.8818833Z gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu 2025-05-07T19:50:34.8819032Z gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu 2025-05-07T19:50:34.8819228Z gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:34.8819454Z gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:34.8819688Z gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:34.8819865Z gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu 2025-05-07T19:50:34.8820064Z gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu 2025-05-07T19:50:34.8820277Z gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu 2025-05-07T19:50:34.8820405Z gen_embedding_backward_sgd_split_weighted_cuda.cu 2025-05-07T19:50:34.8820556Z gen_embedding_backward_sgd_split_weighted_kernel_cta.cu 2025-05-07T19:50:34.8820706Z gen_embedding_backward_sgd_split_weighted_kernel_warp.cu 2025-05-07T19:50:34.8820867Z gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:34.8821046Z gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:34.8821225Z gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:34.8821375Z gen_embedding_backward_sgd_split_unweighted_cuda.cu 2025-05-07T19:50:34.8821528Z gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu 2025-05-07T19:50:34.8821685Z gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu 2025-05-07T19:50:34.8821825Z gen_embedding_backward_adam_split_weighted_cuda.cu 2025-05-07T19:50:34.8821976Z gen_embedding_backward_adam_split_weighted_kernel_cta.cu 2025-05-07T19:50:34.8822128Z gen_embedding_backward_adam_split_weighted_kernel_warp.cu 2025-05-07T19:50:34.8822284Z gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:34.8822475Z gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:34.8822657Z gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:34.8822794Z gen_embedding_backward_adam_split_unweighted_cuda.cu 2025-05-07T19:50:34.8822956Z gen_embedding_backward_adam_split_unweighted_kernel_cta.cu 2025-05-07T19:50:34.8823115Z gen_embedding_backward_adam_split_unweighted_kernel_warp.cu 2025-05-07T19:50:34.8823247Z gen_embedding_backward_lamb_split_weighted_cuda.cu 2025-05-07T19:50:34.8823407Z gen_embedding_backward_lamb_split_weighted_kernel_cta.cu 2025-05-07T19:50:34.8823753Z gen_embedding_backward_lamb_split_weighted_kernel_warp.cu 2025-05-07T19:50:34.8823907Z gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:34.8824086Z gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:34.8824275Z gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:34.8824417Z gen_embedding_backward_lamb_split_unweighted_cuda.cu 2025-05-07T19:50:34.8824571Z gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu 2025-05-07T19:50:34.8824737Z gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu 2025-05-07T19:50:34.8824880Z gen_embedding_backward_lars_sgd_split_weighted_cuda.cu 2025-05-07T19:50:34.8825042Z gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu 2025-05-07T19:50:34.8825218Z gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu 2025-05-07T19:50:34.8825441Z gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:34.8825635Z gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:34.8825838Z gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:34.8825987Z gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu 2025-05-07T19:50:34.8826160Z gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu 2025-05-07T19:50:34.8826335Z gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu 2025-05-07T19:50:34.8826580Z gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu 2025-05-07T19:50:34.8826791Z gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu 2025-05-07T19:50:34.8827004Z gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu 2025-05-07T19:50:34.8827221Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:34.8827463Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:34.8827704Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:34.8827909Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu 2025-05-07T19:50:34.8828124Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu 2025-05-07T19:50:34.8828341Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu 2025-05-07T19:50:34.8828531Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu 2025-05-07T19:50:34.8828752Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu 2025-05-07T19:50:34.8828964Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu 2025-05-07T19:50:34.8829276Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:34.8829503Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:34.8829731Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:34.8829911Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu 2025-05-07T19:50:34.8830115Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu 2025-05-07T19:50:34.8830320Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu 2025-05-07T19:50:34.8830439Z gen_embedding_backward_none_split_weighted_cuda.cu 2025-05-07T19:50:34.8830588Z gen_embedding_backward_none_split_weighted_kernel_cta.cu 2025-05-07T19:50:34.8830735Z gen_embedding_backward_none_split_weighted_kernel_warp.cu 2025-05-07T19:50:34.8830884Z gen_embedding_backward_none_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:34.8831053Z gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:34.8831232Z gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:34.8831358Z gen_embedding_backward_none_split_unweighted_cuda.cu 2025-05-07T19:50:34.8831505Z gen_embedding_backward_none_split_unweighted_kernel_cta.cu 2025-05-07T19:50:34.8831659Z gen_embedding_backward_none_split_unweighted_kernel_warp.cu 2025-05-07T19:50:34.8831860Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu 2025-05-07T19:50:34.8832077Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu 2025-05-07T19:50:34.8832309Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu 2025-05-07T19:50:34.8832535Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:34.8832778Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:34.8833030Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:34.8833232Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu 2025-05-07T19:50:34.8833457Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu 2025-05-07T19:50:34.8833734Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu 2025-05-07T19:50:34.8833806Z 2025-05-07T19:50:34.8833888Z HIPified Source Files: 2025-05-07T19:50:34.8833893Z 2025-05-07T19:50:34.8833959Z 2025-05-07T19:50:34.8834047Z Library Dependencies: 2025-05-07T19:50:34.8834112Z torch 2025-05-07T19:50:34.8834183Z torch_library 2025-05-07T19:50:34.8834459Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so 2025-05-07T19:50:34.8834743Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:34.8835036Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:34.8835346Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:34.8835592Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:34.8835660Z fbgemm 2025-05-07T19:50:34.8835735Z fbgemm_gpu_config 2025-05-07T19:50:34.8835815Z fbgemm_gpu_tbe_cache 2025-05-07T19:50:34.8835892Z fbgemm_gpu_tbe_common 2025-05-07T19:50:34.8835968Z fbgemm_gpu_tbe_utils 2025-05-07T19:50:34.8836063Z fbgemm_gpu_sparse_async_cumsum 2025-05-07T19:50:34.8836262Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:34.8836328Z 2025-05-07T19:50:34.8836399Z Output Library: 2025-05-07T19:50:34.8836498Z fbgemm_gpu_tbe_training_backward 2025-05-07T19:50:34.8836563Z 2025-05-07T19:50:34.8836640Z Destination Directory: 2025-05-07T19:50:34.8836710Z fbgemm_gpu 2025-05-07T19:50:34.8836819Z ================================================================================ 2025-05-07T19:50:34.8836824Z 2025-05-07T19:50:34.8836829Z 2025-05-07T19:50:34.8836833Z 2025-05-07T19:50:34.8836928Z ================================================================================ 2025-05-07T19:50:34.8837109Z GPU CPP Library Target: fbgemm_gpu_tbe_training_backward_gwd (SHARED) 2025-05-07T19:50:34.8837183Z 2025-05-07T19:50:34.8837250Z CPU_SRCS: 2025-05-07T19:50:34.8837254Z 2025-05-07T19:50:34.8837316Z 2025-05-07T19:50:34.8837396Z GPU_SRCS: 2025-05-07T19:50:34.8837565Z gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu 2025-05-07T19:50:34.8837756Z gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu 2025-05-07T19:50:34.8837960Z gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu 2025-05-07T19:50:34.8838137Z gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu 2025-05-07T19:50:34.8838338Z gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu 2025-05-07T19:50:34.8838535Z gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu 2025-05-07T19:50:34.8838724Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu 2025-05-07T19:50:34.8838926Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu 2025-05-07T19:50:34.8839132Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu 2025-05-07T19:50:34.8839325Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu 2025-05-07T19:50:34.8839534Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu 2025-05-07T19:50:34.8839745Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu 2025-05-07T19:50:34.8839815Z 2025-05-07T19:50:34.8839890Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:34.8839894Z 2025-05-07T19:50:34.8839958Z 2025-05-07T19:50:34.8840031Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:34.8840046Z 2025-05-07T19:50:34.8840110Z 2025-05-07T19:50:34.8840179Z OTHER_SRCS: 2025-05-07T19:50:34.8840183Z 2025-05-07T19:50:34.8840249Z 2025-05-07T19:50:34.8840326Z CC_FLAGS: 2025-05-07T19:50:34.8840330Z 2025-05-07T19:50:34.8840398Z 2025-05-07T19:50:34.8840470Z NVCC_FLAGS: 2025-05-07T19:50:34.8840560Z --expt-relaxed-constexpr 2025-05-07T19:50:34.8840644Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:34.8840788Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:34.8840877Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:34.8840950Z 2025-05-07T19:50:34.8841021Z HIPCC_FLAGS: 2025-05-07T19:50:34.8841026Z 2025-05-07T19:50:34.8841091Z 2025-05-07T19:50:34.8841170Z INCLUDE_DIRS: 2025-05-07T19:50:34.8841266Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:34.8841351Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:34.8841442Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:34.8841541Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:34.8841842Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include 2025-05-07T19:50:34.8842191Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:34.8842330Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:34.8842472Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:34.8842612Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:34.8842804Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:34.8842981Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:34.8843107Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:34.8843381Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include 2025-05-07T19:50:34.8843455Z 2025-05-07T19:50:34.8843536Z Selected Source Files: 2025-05-07T19:50:34.8843711Z gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu 2025-05-07T19:50:34.8843914Z gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu 2025-05-07T19:50:34.8844113Z gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu 2025-05-07T19:50:34.8844290Z gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu 2025-05-07T19:50:34.8844494Z gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu 2025-05-07T19:50:34.8844694Z gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu 2025-05-07T19:50:34.8844877Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu 2025-05-07T19:50:34.8845077Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu 2025-05-07T19:50:34.8845289Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu 2025-05-07T19:50:34.8845482Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu 2025-05-07T19:50:34.8845694Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu 2025-05-07T19:50:34.8845914Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu 2025-05-07T19:50:34.8845984Z 2025-05-07T19:50:34.8846066Z HIPified Source Files: 2025-05-07T19:50:34.8846070Z 2025-05-07T19:50:34.8846138Z 2025-05-07T19:50:34.8846218Z Library Dependencies: 2025-05-07T19:50:34.8846286Z torch 2025-05-07T19:50:34.8846355Z torch_library 2025-05-07T19:50:34.8846640Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so 2025-05-07T19:50:34.8846862Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:34.8847154Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:34.8847472Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:34.8847710Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:34.8847803Z fbgemm_gpu_tbe_training_backward 2025-05-07T19:50:34.8847998Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:34.8848065Z 2025-05-07T19:50:34.8848138Z Output Library: 2025-05-07T19:50:34.8848229Z fbgemm_gpu_tbe_training_backward_gwd 2025-05-07T19:50:34.8848302Z 2025-05-07T19:50:34.8848382Z Destination Directory: 2025-05-07T19:50:34.8848454Z fbgemm_gpu 2025-05-07T19:50:34.8848565Z ================================================================================ 2025-05-07T19:50:34.8848627Z 2025-05-07T19:50:34.8848631Z 2025-05-07T19:50:34.8848635Z 2025-05-07T19:50:34.8848738Z ================================================================================ 2025-05-07T19:50:34.8848920Z GPU CPP Library Target: fbgemm_gpu_tbe_training_backward_vbe (SHARED) 2025-05-07T19:50:34.8848994Z 2025-05-07T19:50:34.8849065Z CPU_SRCS: 2025-05-07T19:50:34.8849069Z 2025-05-07T19:50:34.8849133Z 2025-05-07T19:50:34.8849202Z GPU_SRCS: 2025-05-07T19:50:34.8849462Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp 2025-05-07T19:50:34.8849633Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp 2025-05-07T19:50:34.8849812Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:34.8849991Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp 2025-05-07T19:50:34.8850210Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp 2025-05-07T19:50:34.8850436Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:34.8850577Z gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp 2025-05-07T19:50:34.8850721Z gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:34.8850858Z gen_embedding_backward_dense_split_weighted_vbe_meta.cpp 2025-05-07T19:50:34.8851003Z gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:34.8851147Z gen_embedding_backward_adam_split_weighted_vbe_meta.cpp 2025-05-07T19:50:34.8851292Z gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:34.8851464Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu 2025-05-07T19:50:34.8851671Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:34.8851863Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:34.8852027Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu 2025-05-07T19:50:34.8852221Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:34.8852409Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:34.8852582Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:34.8852781Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:34.8852985Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:34.8853154Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu 2025-05-07T19:50:34.8853347Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:34.8853546Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:34.8853759Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu 2025-05-07T19:50:34.8853994Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:34.8854236Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:34.8854454Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:34.8854695Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:34.8854942Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:34.8855072Z gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu 2025-05-07T19:50:34.8855230Z gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:34.8855381Z gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:34.8855523Z gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:34.8855679Z gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:34.8855839Z gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:34.8855984Z gen_embedding_backward_dense_split_weighted_vbe_cuda.cu 2025-05-07T19:50:34.8856196Z gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:34.8856443Z gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:34.8856593Z gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:34.8856938Z gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:34.8857120Z gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:34.8857273Z gen_embedding_backward_adam_split_weighted_vbe_cuda.cu 2025-05-07T19:50:34.8857501Z gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:34.8857671Z gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:34.8857821Z gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:34.8858000Z gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:34.8858173Z gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:34.8858252Z 2025-05-07T19:50:34.8858339Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:34.8858344Z 2025-05-07T19:50:34.8858412Z 2025-05-07T19:50:34.8858493Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:34.8858497Z 2025-05-07T19:50:34.8858565Z 2025-05-07T19:50:34.8858645Z OTHER_SRCS: 2025-05-07T19:50:34.8858650Z 2025-05-07T19:50:34.8858717Z 2025-05-07T19:50:34.8858789Z CC_FLAGS: 2025-05-07T19:50:34.8858793Z 2025-05-07T19:50:34.8858867Z 2025-05-07T19:50:34.8858940Z NVCC_FLAGS: 2025-05-07T19:50:34.8859035Z --expt-relaxed-constexpr 2025-05-07T19:50:34.8859138Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:34.8859236Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:34.8859325Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:34.8859393Z 2025-05-07T19:50:34.8859477Z HIPCC_FLAGS: 2025-05-07T19:50:34.8859482Z 2025-05-07T19:50:34.8859548Z 2025-05-07T19:50:34.8859624Z INCLUDE_DIRS: 2025-05-07T19:50:34.8859734Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:34.8859828Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:34.8859927Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:34.8860027Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:34.8860308Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include 2025-05-07T19:50:34.8860689Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:34.8860825Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:34.8860991Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:34.8861144Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:34.8861337Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:34.8861537Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:34.8861683Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:34.8861976Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include 2025-05-07T19:50:34.8862058Z 2025-05-07T19:50:34.8862143Z Selected Source Files: 2025-05-07T19:50:34.8862334Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp 2025-05-07T19:50:34.8862516Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp 2025-05-07T19:50:34.8862720Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:34.8862908Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp 2025-05-07T19:50:34.8863148Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp 2025-05-07T19:50:34.8863395Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:34.8863539Z gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp 2025-05-07T19:50:34.8863692Z gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:34.8863846Z gen_embedding_backward_dense_split_weighted_vbe_meta.cpp 2025-05-07T19:50:34.8864005Z gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:34.8864203Z gen_embedding_backward_adam_split_weighted_vbe_meta.cpp 2025-05-07T19:50:34.8864355Z gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:34.8864547Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu 2025-05-07T19:50:34.8864755Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:34.8864967Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:34.8865205Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu 2025-05-07T19:50:34.8865405Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:34.8865605Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:34.8865802Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:34.8866014Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:34.8866235Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:34.8866417Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu 2025-05-07T19:50:34.8866634Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:34.8866841Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:34.8867070Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu 2025-05-07T19:50:34.8867330Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:34.8867581Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:34.8867819Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:34.8868088Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:34.8868345Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:34.8868490Z gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu 2025-05-07T19:50:34.8868658Z gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:34.8868823Z gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:34.8868968Z gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:34.8869138Z gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:34.8869317Z gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:34.8869466Z gen_embedding_backward_dense_split_weighted_vbe_cuda.cu 2025-05-07T19:50:34.8869634Z gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:34.8869815Z gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:34.8869968Z gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:34.8870308Z gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:34.8870500Z gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:34.8870642Z gen_embedding_backward_adam_split_weighted_vbe_cuda.cu 2025-05-07T19:50:34.8870807Z gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:34.8870976Z gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:34.8871132Z gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:34.8871304Z gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:34.8871485Z gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:34.8871559Z 2025-05-07T19:50:34.8871643Z HIPified Source Files: 2025-05-07T19:50:34.8871647Z 2025-05-07T19:50:34.8871718Z 2025-05-07T19:50:34.8871810Z Library Dependencies: 2025-05-07T19:50:34.8871885Z torch 2025-05-07T19:50:34.8871964Z torch_library 2025-05-07T19:50:34.8872263Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so 2025-05-07T19:50:34.8872618Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:34.8872934Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:34.8873275Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:34.8873543Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:34.8873644Z fbgemm_gpu_tbe_training_backward 2025-05-07T19:50:34.8873919Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:34.8873995Z 2025-05-07T19:50:34.8874077Z Output Library: 2025-05-07T19:50:34.8874180Z fbgemm_gpu_tbe_training_backward_vbe 2025-05-07T19:50:34.8874251Z 2025-05-07T19:50:34.8874350Z Destination Directory: 2025-05-07T19:50:34.8874427Z fbgemm_gpu 2025-05-07T19:50:34.8874534Z ================================================================================ 2025-05-07T19:50:34.8874543Z 2025-05-07T19:50:34.8874547Z 2025-05-07T19:50:34.8874551Z 2025-05-07T19:50:34.8874673Z ================================================================================ 2025-05-07T19:50:34.8874879Z GPU CPP Library Target: fbgemm_gpu_tbe_training_backward_dense (SHARED) 2025-05-07T19:50:34.8874945Z 2025-05-07T19:50:34.8875034Z CPU_SRCS: 2025-05-07T19:50:34.8875039Z 2025-05-07T19:50:34.8875110Z 2025-05-07T19:50:34.8875186Z GPU_SRCS: 2025-05-07T19:50:34.8875335Z gen_embedding_backward_dense_split_weighted_cuda.cu 2025-05-07T19:50:34.8875487Z gen_embedding_backward_dense_split_weighted_meta.cpp 2025-05-07T19:50:34.8875643Z gen_embedding_backward_dense_split_weighted_kernel_cta.cu 2025-05-07T19:50:34.8875807Z gen_embedding_backward_dense_split_weighted_kernel_warp.cu 2025-05-07T19:50:34.8875985Z gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:34.8876152Z gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:34.8876337Z gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:34.8876540Z gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:34.8876694Z gen_embedding_backward_dense_split_unweighted_cuda.cu 2025-05-07T19:50:34.8876841Z gen_embedding_backward_dense_split_unweighted_meta.cpp 2025-05-07T19:50:34.8877006Z gen_embedding_backward_dense_split_unweighted_kernel_cta.cu 2025-05-07T19:50:34.8877187Z gen_embedding_backward_dense_split_unweighted_kernel_warp.cu 2025-05-07T19:50:34.8877297Z gen_embedding_backward_split_dense.cpp 2025-05-07T19:50:34.8877369Z 2025-05-07T19:50:34.8877467Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:34.8877472Z 2025-05-07T19:50:34.8877547Z 2025-05-07T19:50:34.8877631Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:34.8877636Z 2025-05-07T19:50:34.8877702Z 2025-05-07T19:50:34.8877788Z OTHER_SRCS: 2025-05-07T19:50:34.8877792Z 2025-05-07T19:50:34.8877865Z 2025-05-07T19:50:34.8877940Z CC_FLAGS: 2025-05-07T19:50:34.8877944Z 2025-05-07T19:50:34.8878028Z 2025-05-07T19:50:34.8878107Z NVCC_FLAGS: 2025-05-07T19:50:34.8878204Z --expt-relaxed-constexpr 2025-05-07T19:50:34.8878303Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:34.8878406Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:34.8878499Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:34.8878571Z 2025-05-07T19:50:34.8878665Z HIPCC_FLAGS: 2025-05-07T19:50:34.8878670Z 2025-05-07T19:50:34.8878742Z 2025-05-07T19:50:34.8878821Z INCLUDE_DIRS: 2025-05-07T19:50:34.8878935Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:34.8879026Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:34.8879133Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:34.8879231Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:34.8879515Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include 2025-05-07T19:50:34.8879892Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:34.8880028Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:34.8880276Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:34.8880427Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:34.8880625Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:34.8880825Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:34.8880962Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:34.8881264Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include 2025-05-07T19:50:34.8881343Z 2025-05-07T19:50:34.8881499Z Selected Source Files: 2025-05-07T19:50:34.8881647Z gen_embedding_backward_dense_split_weighted_meta.cpp 2025-05-07T19:50:34.8881930Z gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:34.8882085Z gen_embedding_backward_dense_split_unweighted_meta.cpp 2025-05-07T19:50:34.8882187Z gen_embedding_backward_split_dense.cpp 2025-05-07T19:50:34.8882311Z gen_embedding_backward_dense_split_weighted_cuda.cu 2025-05-07T19:50:34.8882463Z gen_embedding_backward_dense_split_weighted_kernel_cta.cu 2025-05-07T19:50:34.8882620Z gen_embedding_backward_dense_split_weighted_kernel_warp.cu 2025-05-07T19:50:34.8882772Z gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:34.8882942Z gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:34.8883139Z gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:34.8883271Z gen_embedding_backward_dense_split_unweighted_cuda.cu 2025-05-07T19:50:34.8883430Z gen_embedding_backward_dense_split_unweighted_kernel_cta.cu 2025-05-07T19:50:34.8883593Z gen_embedding_backward_dense_split_unweighted_kernel_warp.cu 2025-05-07T19:50:34.8883661Z 2025-05-07T19:50:34.8883747Z HIPified Source Files: 2025-05-07T19:50:34.8883751Z 2025-05-07T19:50:34.8883817Z 2025-05-07T19:50:34.8883913Z Library Dependencies: 2025-05-07T19:50:34.8883982Z torch 2025-05-07T19:50:34.8884052Z torch_library 2025-05-07T19:50:34.8884349Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so 2025-05-07T19:50:34.8884575Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:34.8884866Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:34.8885195Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:34.8885433Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:34.8885530Z fbgemm_gpu_tbe_training_backward 2025-05-07T19:50:34.8885719Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:34.8885794Z 2025-05-07T19:50:34.8885868Z Output Library: 2025-05-07T19:50:34.8885965Z fbgemm_gpu_tbe_training_backward_dense 2025-05-07T19:50:34.8886048Z 2025-05-07T19:50:34.8886131Z Destination Directory: 2025-05-07T19:50:34.8886201Z fbgemm_gpu 2025-05-07T19:50:34.8886305Z ================================================================================ 2025-05-07T19:50:34.8886317Z 2025-05-07T19:50:34.8886320Z 2025-05-07T19:50:34.8886324Z 2025-05-07T19:50:34.8886425Z ================================================================================ 2025-05-07T19:50:34.8886623Z GPU CPP Library Target: fbgemm_gpu_tbe_training_backward_split_host (SHARED) 2025-05-07T19:50:34.8886687Z 2025-05-07T19:50:34.8886767Z CPU_SRCS: 2025-05-07T19:50:34.8886771Z 2025-05-07T19:50:34.8886838Z 2025-05-07T19:50:34.8886909Z GPU_SRCS: 2025-05-07T19:50:34.8887022Z gen_embedding_backward_split_adagrad.cpp 2025-05-07T19:50:34.8887140Z gen_embedding_backward_split_rowwise_adagrad.cpp 2025-05-07T19:50:34.8887233Z gen_embedding_backward_split_sgd.cpp 2025-05-07T19:50:34.8887336Z gen_embedding_backward_split_adam.cpp 2025-05-07T19:50:34.8887426Z gen_embedding_backward_split_lamb.cpp 2025-05-07T19:50:34.8887524Z gen_embedding_backward_split_lars_sgd.cpp 2025-05-07T19:50:34.8887715Z gen_embedding_backward_split_partial_rowwise_adam.cpp 2025-05-07T19:50:34.8887865Z gen_embedding_backward_split_partial_rowwise_lamb.cpp 2025-05-07T19:50:34.8887960Z gen_embedding_backward_split_none.cpp 2025-05-07T19:50:34.8888120Z gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp 2025-05-07T19:50:34.8888236Z gen_embedding_backward_split_approx_sgd.cpp 2025-05-07T19:50:34.8888372Z gen_embedding_backward_split_approx_rowwise_adagrad.cpp 2025-05-07T19:50:34.8888555Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp 2025-05-07T19:50:34.8888801Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T19:50:34.8888979Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T19:50:34.8889122Z gen_embedding_backward_split_rowwise_weighted_adagrad.cpp 2025-05-07T19:50:34.8889239Z gen_embedding_backward_ssd_rowwise_adagrad.cpp 2025-05-07T19:50:34.8889385Z gen_embedding_backward_adagrad_split_weighted_meta.cpp 2025-05-07T19:50:34.8889531Z gen_embedding_backward_adagrad_split_unweighted_meta.cpp 2025-05-07T19:50:34.8889698Z gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp 2025-05-07T19:50:34.8889877Z gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp 2025-05-07T19:50:34.8890011Z gen_embedding_backward_sgd_split_weighted_meta.cpp 2025-05-07T19:50:34.8890151Z gen_embedding_backward_sgd_split_unweighted_meta.cpp 2025-05-07T19:50:34.8890280Z gen_embedding_backward_adam_split_weighted_meta.cpp 2025-05-07T19:50:34.8890416Z gen_embedding_backward_adam_split_unweighted_meta.cpp 2025-05-07T19:50:34.8890539Z gen_embedding_backward_lamb_split_weighted_meta.cpp 2025-05-07T19:50:34.8890668Z gen_embedding_backward_lamb_split_unweighted_meta.cpp 2025-05-07T19:50:34.8890807Z gen_embedding_backward_lars_sgd_split_weighted_meta.cpp 2025-05-07T19:50:34.8890948Z gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp 2025-05-07T19:50:34.8891124Z gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp 2025-05-07T19:50:34.8891312Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp 2025-05-07T19:50:34.8891490Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp 2025-05-07T19:50:34.8891674Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp 2025-05-07T19:50:34.8891797Z gen_embedding_backward_none_split_weighted_meta.cpp 2025-05-07T19:50:34.8891934Z gen_embedding_backward_none_split_unweighted_meta.cpp 2025-05-07T19:50:34.8892138Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp 2025-05-07T19:50:34.8892354Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp 2025-05-07T19:50:34.8892430Z 2025-05-07T19:50:34.8892503Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:34.8892507Z 2025-05-07T19:50:34.8892572Z 2025-05-07T19:50:34.8892652Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:34.8892661Z 2025-05-07T19:50:34.8892725Z 2025-05-07T19:50:34.8892794Z OTHER_SRCS: 2025-05-07T19:50:34.8892798Z 2025-05-07T19:50:34.8892862Z 2025-05-07T19:50:34.8892944Z CC_FLAGS: 2025-05-07T19:50:34.8892948Z 2025-05-07T19:50:34.8893011Z 2025-05-07T19:50:34.8893079Z NVCC_FLAGS: 2025-05-07T19:50:34.8893174Z --expt-relaxed-constexpr 2025-05-07T19:50:34.8893260Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:34.8893351Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:34.8893438Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:34.8893509Z 2025-05-07T19:50:34.8893581Z HIPCC_FLAGS: 2025-05-07T19:50:34.8893585Z 2025-05-07T19:50:34.8893647Z 2025-05-07T19:50:34.8893724Z INCLUDE_DIRS: 2025-05-07T19:50:34.8893819Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:34.8893907Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:34.8893997Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:34.8894098Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:34.8894347Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include 2025-05-07T19:50:34.8894697Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:34.8894889Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:34.8895030Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:34.8895168Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:34.8895355Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:34.8895534Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:34.8895660Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:34.8895990Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include 2025-05-07T19:50:34.8896065Z 2025-05-07T19:50:34.8896144Z Selected Source Files: 2025-05-07T19:50:34.8896246Z gen_embedding_backward_split_adagrad.cpp 2025-05-07T19:50:34.8896456Z gen_embedding_backward_split_rowwise_adagrad.cpp 2025-05-07T19:50:34.8896550Z gen_embedding_backward_split_sgd.cpp 2025-05-07T19:50:34.8896643Z gen_embedding_backward_split_adam.cpp 2025-05-07T19:50:34.8896914Z gen_embedding_backward_split_lamb.cpp 2025-05-07T19:50:34.8897029Z gen_embedding_backward_split_lars_sgd.cpp 2025-05-07T19:50:34.8897171Z gen_embedding_backward_split_partial_rowwise_adam.cpp 2025-05-07T19:50:34.8897311Z gen_embedding_backward_split_partial_rowwise_lamb.cpp 2025-05-07T19:50:34.8897421Z gen_embedding_backward_split_none.cpp 2025-05-07T19:50:34.8897591Z gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp 2025-05-07T19:50:34.8897701Z gen_embedding_backward_split_approx_sgd.cpp 2025-05-07T19:50:34.8897855Z gen_embedding_backward_split_approx_rowwise_adagrad.cpp 2025-05-07T19:50:34.8898052Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp 2025-05-07T19:50:34.8898264Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T19:50:34.8898449Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T19:50:34.8898608Z gen_embedding_backward_split_rowwise_weighted_adagrad.cpp 2025-05-07T19:50:34.8898732Z gen_embedding_backward_ssd_rowwise_adagrad.cpp 2025-05-07T19:50:34.8898873Z gen_embedding_backward_adagrad_split_weighted_meta.cpp 2025-05-07T19:50:34.8899028Z gen_embedding_backward_adagrad_split_unweighted_meta.cpp 2025-05-07T19:50:34.8899202Z gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp 2025-05-07T19:50:34.8899380Z gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp 2025-05-07T19:50:34.8899514Z gen_embedding_backward_sgd_split_weighted_meta.cpp 2025-05-07T19:50:34.8899652Z gen_embedding_backward_sgd_split_unweighted_meta.cpp 2025-05-07T19:50:34.8899785Z gen_embedding_backward_adam_split_weighted_meta.cpp 2025-05-07T19:50:34.8899926Z gen_embedding_backward_adam_split_unweighted_meta.cpp 2025-05-07T19:50:34.8900062Z gen_embedding_backward_lamb_split_weighted_meta.cpp 2025-05-07T19:50:34.8900201Z gen_embedding_backward_lamb_split_unweighted_meta.cpp 2025-05-07T19:50:34.8900344Z gen_embedding_backward_lars_sgd_split_weighted_meta.cpp 2025-05-07T19:50:34.8900510Z gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp 2025-05-07T19:50:34.8900701Z gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp 2025-05-07T19:50:34.8900899Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp 2025-05-07T19:50:34.8901095Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp 2025-05-07T19:50:34.8901291Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp 2025-05-07T19:50:34.8901424Z gen_embedding_backward_none_split_weighted_meta.cpp 2025-05-07T19:50:34.8901566Z gen_embedding_backward_none_split_unweighted_meta.cpp 2025-05-07T19:50:34.8901791Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp 2025-05-07T19:50:34.8902020Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp 2025-05-07T19:50:34.8902091Z 2025-05-07T19:50:34.8902184Z HIPified Source Files: 2025-05-07T19:50:34.8902188Z 2025-05-07T19:50:34.8902256Z 2025-05-07T19:50:34.8902340Z Library Dependencies: 2025-05-07T19:50:34.8902477Z torch 2025-05-07T19:50:34.8902562Z torch_library 2025-05-07T19:50:34.8902862Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so 2025-05-07T19:50:34.8903103Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:34.8903426Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:34.8903764Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:34.8904079Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:34.8904169Z fbgemm_gpu_config 2025-05-07T19:50:34.8904253Z fbgemm_gpu_tbe_utils 2025-05-07T19:50:34.8904459Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:34.8904531Z 2025-05-07T19:50:34.8904616Z Output Library: 2025-05-07T19:50:34.8904728Z fbgemm_gpu_tbe_training_backward_split_host 2025-05-07T19:50:34.8904800Z 2025-05-07T19:50:34.8904896Z Destination Directory: 2025-05-07T19:50:34.8904971Z fbgemm_gpu 2025-05-07T19:50:34.8905080Z ================================================================================ 2025-05-07T19:50:34.8905084Z 2025-05-07T19:50:34.8905089Z 2025-05-07T19:50:34.8905092Z 2025-05-07T19:50:34.8905205Z ================================================================================ 2025-05-07T19:50:34.8905369Z GPU CPP Library Target: fbgemm_gpu_tbe_index_select (SHARED) 2025-05-07T19:50:34.8905440Z 2025-05-07T19:50:34.8905511Z CPU_SRCS: 2025-05-07T19:50:34.8905722Z codegen/training/index_select/batch_index_select_dim0_cpu_host.cpp 2025-05-07T19:50:34.8905901Z codegen/training/index_select/batch_index_select_dim0_ops.cpp 2025-05-07T19:50:34.8905969Z 2025-05-07T19:50:34.8906049Z GPU_SRCS: 2025-05-07T19:50:34.8906229Z codegen/training/index_select/batch_index_select_dim0_host.cpp 2025-05-07T19:50:34.8906359Z gen_batch_index_select_dim0_forward_codegen_cuda.cu 2025-05-07T19:50:34.8906486Z gen_batch_index_select_dim0_forward_kernel.cu 2025-05-07T19:50:34.8906614Z gen_batch_index_select_dim0_forward_kernel_small.cu 2025-05-07T19:50:34.8906749Z gen_batch_index_select_dim0_backward_codegen_cuda.cu 2025-05-07T19:50:34.8906875Z gen_batch_index_select_dim0_backward_kernel_cta.cu 2025-05-07T19:50:34.8907008Z gen_batch_index_select_dim0_backward_kernel_warp.cu 2025-05-07T19:50:34.8907132Z gen_embedding_backward_split_grad_index_select.cu 2025-05-07T19:50:34.8907205Z 2025-05-07T19:50:34.8907297Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:34.8907302Z 2025-05-07T19:50:34.8907373Z 2025-05-07T19:50:34.8907453Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:34.8907457Z 2025-05-07T19:50:34.8907524Z 2025-05-07T19:50:34.8907604Z OTHER_SRCS: 2025-05-07T19:50:34.8907609Z 2025-05-07T19:50:34.8907676Z 2025-05-07T19:50:34.8907749Z CC_FLAGS: 2025-05-07T19:50:34.8907753Z 2025-05-07T19:50:34.8907826Z 2025-05-07T19:50:34.8907901Z NVCC_FLAGS: 2025-05-07T19:50:34.8907993Z --expt-relaxed-constexpr 2025-05-07T19:50:34.8908094Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:34.8908192Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:34.8908283Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:34.8908351Z 2025-05-07T19:50:34.8908434Z HIPCC_FLAGS: 2025-05-07T19:50:34.8908439Z 2025-05-07T19:50:34.8908505Z 2025-05-07T19:50:34.8908580Z INCLUDE_DIRS: 2025-05-07T19:50:34.8908688Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:34.8908776Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:34.8908871Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:34.8909084Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:34.8909341Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include 2025-05-07T19:50:34.8909689Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:34.8909813Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:34.8909961Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:34.8910167Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:34.8910349Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:34.8910531Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:34.8910656Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:34.8910928Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include 2025-05-07T19:50:34.8910993Z 2025-05-07T19:50:34.8911080Z Selected Source Files: 2025-05-07T19:50:34.8911314Z codegen/training/index_select/batch_index_select_dim0_cpu_host.cpp 2025-05-07T19:50:34.8911481Z codegen/training/index_select/batch_index_select_dim0_ops.cpp 2025-05-07T19:50:34.8911653Z codegen/training/index_select/batch_index_select_dim0_host.cpp 2025-05-07T19:50:34.8911772Z gen_batch_index_select_dim0_forward_codegen_cuda.cu 2025-05-07T19:50:34.8911881Z gen_batch_index_select_dim0_forward_kernel.cu 2025-05-07T19:50:34.8912009Z gen_batch_index_select_dim0_forward_kernel_small.cu 2025-05-07T19:50:34.8912131Z gen_batch_index_select_dim0_backward_codegen_cuda.cu 2025-05-07T19:50:34.8912247Z gen_batch_index_select_dim0_backward_kernel_cta.cu 2025-05-07T19:50:34.8912363Z gen_batch_index_select_dim0_backward_kernel_warp.cu 2025-05-07T19:50:34.8912485Z gen_embedding_backward_split_grad_index_select.cu 2025-05-07T19:50:34.8912550Z 2025-05-07T19:50:34.8912630Z HIPified Source Files: 2025-05-07T19:50:34.8912634Z 2025-05-07T19:50:34.8912703Z 2025-05-07T19:50:34.8912780Z Library Dependencies: 2025-05-07T19:50:34.8912849Z torch 2025-05-07T19:50:34.8912916Z torch_library 2025-05-07T19:50:34.8913194Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so 2025-05-07T19:50:34.8913416Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:34.8913708Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:34.8914026Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:34.8914267Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:34.8914354Z fbgemm_gpu_sparse_async_cumsum 2025-05-07T19:50:34.8914438Z fbgemm_gpu_tbe_utils 2025-05-07T19:50:34.8914624Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:34.8914687Z 2025-05-07T19:50:34.8914759Z Output Library: 2025-05-07T19:50:34.8914856Z fbgemm_gpu_tbe_index_select 2025-05-07T19:50:34.8914919Z 2025-05-07T19:50:34.8914999Z Destination Directory: 2025-05-07T19:50:34.8915072Z fbgemm_gpu 2025-05-07T19:50:34.8915173Z ================================================================================ 2025-05-07T19:50:34.8915177Z 2025-05-07T19:50:34.8915181Z 2025-05-07T19:50:34.8915185Z 2025-05-07T19:50:34.8915285Z ================================================================================ 2025-05-07T19:50:34.8915479Z GPU CPP Library Target: fbgemm_gpu_embedding_inplace_ops (SHARED) 2025-05-07T19:50:34.8915546Z 2025-05-07T19:50:34.8915618Z CPU_SRCS: 2025-05-07T19:50:34.8915780Z src/embedding_inplace_ops/embedding_inplace_update_cpu.cpp 2025-05-07T19:50:34.8915858Z 2025-05-07T19:50:34.8915927Z GPU_SRCS: 2025-05-07T19:50:34.8916084Z src/embedding_inplace_ops/embedding_inplace_update_gpu.cpp 2025-05-07T19:50:34.8916235Z src/embedding_inplace_ops/embedding_inplace_update.cu 2025-05-07T19:50:34.8916302Z 2025-05-07T19:50:34.8916379Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:34.8916383Z 2025-05-07T19:50:34.8916450Z 2025-05-07T19:50:34.8916540Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:34.8916544Z 2025-05-07T19:50:34.8916615Z 2025-05-07T19:50:34.8916689Z OTHER_SRCS: 2025-05-07T19:50:34.8916693Z 2025-05-07T19:50:34.8916772Z 2025-05-07T19:50:34.8916843Z CC_FLAGS: 2025-05-07T19:50:34.8916847Z 2025-05-07T19:50:34.8916916Z 2025-05-07T19:50:34.8917002Z NVCC_FLAGS: 2025-05-07T19:50:34.8917095Z --expt-relaxed-constexpr 2025-05-07T19:50:34.8917237Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:34.8917333Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:34.8917434Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:34.8917505Z 2025-05-07T19:50:34.8917578Z HIPCC_FLAGS: 2025-05-07T19:50:34.8917582Z 2025-05-07T19:50:34.8917662Z 2025-05-07T19:50:34.8917740Z INCLUDE_DIRS: 2025-05-07T19:50:34.8917839Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:34.8917929Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:34.8918037Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:34.8918186Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:34.8918445Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include 2025-05-07T19:50:34.8918805Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:34.8918938Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:34.8919085Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:34.8919232Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:34.8919430Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:34.8919612Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:34.8919744Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:34.8920034Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include 2025-05-07T19:50:34.8920107Z 2025-05-07T19:50:34.8920190Z Selected Source Files: 2025-05-07T19:50:34.8920364Z src/embedding_inplace_ops/embedding_inplace_update_cpu.cpp 2025-05-07T19:50:34.8920522Z src/embedding_inplace_ops/embedding_inplace_update_gpu.cpp 2025-05-07T19:50:34.8920659Z src/embedding_inplace_ops/embedding_inplace_update.cu 2025-05-07T19:50:34.8920728Z 2025-05-07T19:50:34.8920829Z HIPified Source Files: 2025-05-07T19:50:34.8920833Z 2025-05-07T19:50:34.8920903Z 2025-05-07T19:50:34.8920989Z Library Dependencies: 2025-05-07T19:50:34.8921075Z torch 2025-05-07T19:50:34.8921153Z torch_library 2025-05-07T19:50:34.8921434Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so 2025-05-07T19:50:34.8921661Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:34.8921970Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:34.8922288Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:34.8922537Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:34.8922746Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:34.8922819Z 2025-05-07T19:50:34.8922899Z Output Library: 2025-05-07T19:50:34.8923007Z fbgemm_gpu_embedding_inplace_ops 2025-05-07T19:50:34.8923078Z 2025-05-07T19:50:34.8923160Z Destination Directory: 2025-05-07T19:50:34.8923388Z fbgemm_gpu 2025-05-07T19:50:34.8923496Z ================================================================================ 2025-05-07T19:50:34.8923500Z 2025-05-07T19:50:34.8923575Z 2025-05-07T19:50:34.8923579Z 2025-05-07T19:50:34.8923685Z ================================================================================ 2025-05-07T19:50:34.8923800Z GPU CPP Library Target: fbgemm_gpu_py (SHARED) 2025-05-07T19:50:34.8923865Z 2025-05-07T19:50:34.8923948Z CPU_SRCS: 2025-05-07T19:50:34.8924041Z src/memory_utils/memory_utils.cpp 2025-05-07T19:50:34.8924136Z src/memory_utils/memory_utils_ops.cpp 2025-05-07T19:50:34.8924315Z src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_cpu.cpp 2025-05-07T19:50:34.8924516Z src/permute_multi_embedding_ops/permute_multi_embedding_function.cpp 2025-05-07T19:50:34.8924698Z src/permute_multi_embedding_ops/permute_multi_embedding_ops_cpu.cpp 2025-05-07T19:50:34.8924893Z src/permute_pooled_embedding_ops/permute_pooled_embedding_function.cpp 2025-05-07T19:50:34.8925142Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_cpu.cpp 2025-05-07T19:50:34.8925351Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_cpu.cpp 2025-05-07T19:50:34.8925485Z src/jagged_tensor_ops/jagged_tensor_ops_autograd.cpp 2025-05-07T19:50:34.8925611Z src/jagged_tensor_ops/jagged_tensor_ops_meta.cpp 2025-05-07T19:50:34.8925726Z src/jagged_tensor_ops/jagged_tensor_ops_cpu.cpp 2025-05-07T19:50:34.8925836Z src/input_combine_ops/input_combine_cpu.cpp 2025-05-07T19:50:34.8925970Z src/layout_transform_ops/layout_transform_ops_cpu.cpp 2025-05-07T19:50:34.8926123Z src/quantize_ops/quantize_ops_cpu.cpp 2025-05-07T19:50:34.8926224Z src/quantize_ops/quantize_ops_meta.cpp 2025-05-07T19:50:34.8926340Z src/sparse_ops/sparse_async_batched_cumsum.cpp 2025-05-07T19:50:34.8926443Z src/sparse_ops/sparse_ops_cpu.cpp 2025-05-07T19:50:34.8926535Z src/sparse_ops/sparse_ops_meta.cpp 2025-05-07T19:50:34.8926620Z src/tbe/eeg/eeg_models.cpp 2025-05-07T19:50:34.8926703Z src/tbe/eeg/eeg_utils.cpp 2025-05-07T19:50:34.8926812Z src/tbe/eeg/indices_estimator_ops.cpp 2025-05-07T19:50:34.8926903Z src/tbe/eeg/indices_estimator.cpp 2025-05-07T19:50:34.8926997Z src/tbe/eeg/indices_generator_ops.cpp 2025-05-07T19:50:34.8927098Z src/tbe/eeg/indices_generator.cpp 2025-05-07T19:50:34.8927313Z src/intraining_embedding_pruning_ops/intraining_embedding_pruning_gpu.cpp 2025-05-07T19:50:34.8927451Z src/layout_transform_ops/layout_transform_ops_gpu.cpp 2025-05-07T19:50:34.8927654Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_gpu.cpp 2025-05-07T19:50:34.8927867Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_gpu.cpp 2025-05-07T19:50:34.8927966Z src/quantize_ops/quantize_ops_gpu.cpp 2025-05-07T19:50:34.8928057Z src/sparse_ops/sparse_ops_gpu.cpp 2025-05-07T19:50:34.8928157Z src/metric_ops/metric_ops_host.cpp 2025-05-07T19:50:34.8928266Z src/input_combine_ops/input_combine_gpu.cpp 2025-05-07T19:50:34.8928446Z src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_gpu.cpp 2025-05-07T19:50:34.8928538Z src/topology_utils.cpp 2025-05-07T19:50:34.8928604Z 2025-05-07T19:50:34.8928678Z GPU_SRCS: 2025-05-07T19:50:34.8928783Z src/histogram_binning_calibration_ops.cu 2025-05-07T19:50:34.8928889Z src/input_combine_ops/input_combine.cu 2025-05-07T19:50:34.8929082Z src/intraining_embedding_pruning_ops/intraining_embedding_pruning.cu 2025-05-07T19:50:34.8929173Z src/memory_utils/memory_utils.cu 2025-05-07T19:50:34.8929276Z src/memory_utils/memory_utils_ops.cu 2025-05-07T19:50:34.8929452Z src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_backward.cu 2025-05-07T19:50:34.8929627Z src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_forward.cu 2025-05-07T19:50:34.8929758Z src/jagged_tensor_ops/dense_to_jagged_forward.cu 2025-05-07T19:50:34.8929879Z src/jagged_tensor_ops/jagged_dense_bmm_forward.cu 2025-05-07T19:50:34.8930110Z src/jagged_tensor_ops/jagged_dense_dense_elementwise_add_jagged_output_forward.cu 2025-05-07T19:50:34.8930276Z src/jagged_tensor_ops/jagged_dense_elementwise_mul_backward.cu 2025-05-07T19:50:34.8930451Z src/jagged_tensor_ops/jagged_dense_elementwise_mul_forward.cu 2025-05-07T19:50:34.8930586Z src/jagged_tensor_ops/jagged_index_add_2d_forward.cu 2025-05-07T19:50:34.8930725Z src/jagged_tensor_ops/jagged_index_select_2d_forward.cu 2025-05-07T19:50:34.8930867Z src/jagged_tensor_ops/jagged_jagged_bmm_forward.cu 2025-05-07T19:50:34.8930986Z src/jagged_tensor_ops/jagged_softmax_backward.cu 2025-05-07T19:50:34.8931106Z src/jagged_tensor_ops/jagged_softmax_forward.cu 2025-05-07T19:50:34.8931219Z src/jagged_tensor_ops/jagged_tensor_ops.cu 2025-05-07T19:50:34.8931367Z src/jagged_tensor_ops/jagged_to_padded_dense_backward.cu 2025-05-07T19:50:34.8931505Z src/jagged_tensor_ops/jagged_to_padded_dense_forward.cu 2025-05-07T19:50:34.8931618Z src/jagged_tensor_ops/jagged_unique_indices.cu 2025-05-07T19:50:34.8931765Z src/jagged_tensor_ops/keyed_jagged_index_select_dim1.cu 2025-05-07T19:50:34.8931883Z src/layout_transform_ops/layout_transform_ops.cu 2025-05-07T19:50:34.8931975Z src/metric_ops/metric_ops.cu 2025-05-07T19:50:34.8932231Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split.cu 2025-05-07T19:50:34.8932405Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops.cu 2025-05-07T19:50:34.8932575Z src/permute_multi_embedding_ops/permute_multi_embedding_ops.cu 2025-05-07T19:50:34.8932675Z src/quantize_ops/quantize_bfloat16.cu 2025-05-07T19:50:34.8932787Z src/quantize_ops/quantize_fp8_rowwise.cu 2025-05-07T19:50:34.8932905Z src/quantize_ops/quantize_fused_8bit_rowwise.cu 2025-05-07T19:50:34.8933021Z src/quantize_ops/quantize_fused_nbit_rowwise.cu 2025-05-07T19:50:34.8933189Z src/quantize_ops/quantize_hfp8.cu 2025-05-07T19:50:34.8933281Z src/quantize_ops/quantize_msfp.cu 2025-05-07T19:50:34.8933399Z src/quantize_ops/quantize_padded_fp8_rowwise.cu 2025-05-07T19:50:34.8933501Z src/quantize_ops/quantize_mx.cu 2025-05-07T19:50:34.8933615Z src/sparse_ops/sparse_async_batched_cumsum.cu 2025-05-07T19:50:34.8933737Z src/sparse_ops/sparse_block_bucketize_features.cu 2025-05-07T19:50:34.8933850Z src/sparse_ops/sparse_bucketize_features.cu 2025-05-07T19:50:34.8933984Z src/sparse_ops/sparse_batched_unary_embeddings.cu 2025-05-07T19:50:34.8934113Z src/sparse_ops/sparse_compute_frequency_sequence.cu 2025-05-07T19:50:34.8934244Z src/sparse_ops/sparse_expand_into_jagged_permute.cu 2025-05-07T19:50:34.8934352Z src/sparse_ops/sparse_group_index.cu 2025-05-07T19:50:34.8934443Z src/sparse_ops/sparse_index_add.cu 2025-05-07T19:50:34.8934539Z src/sparse_ops/sparse_index_select.cu 2025-05-07T19:50:34.8934640Z src/sparse_ops/sparse_invert_permute.cu 2025-05-07T19:50:34.8934772Z src/sparse_ops/sparse_pack_segments_backward.cu 2025-05-07T19:50:34.8934888Z src/sparse_ops/sparse_pack_segments_forward.cu 2025-05-07T19:50:34.8934985Z src/sparse_ops/sparse_permute_1d.cu 2025-05-07T19:50:34.8935087Z src/sparse_ops/sparse_permute_2d.cu 2025-05-07T19:50:34.8935180Z src/sparse_ops/sparse_permute102.cu 2025-05-07T19:50:34.8935286Z src/sparse_ops/sparse_permute_embeddings.cu 2025-05-07T19:50:34.8935381Z src/sparse_ops/sparse_range.cu 2025-05-07T19:50:34.8935499Z src/sparse_ops/sparse_reorder_batched_ad.cu 2025-05-07T19:50:34.8935600Z src/sparse_ops/sparse_segment_sum_csr.cu 2025-05-07T19:50:34.8935691Z src/sparse_ops/sparse_zipf.cu 2025-05-07T19:50:34.8935771Z 2025-05-07T19:50:34.8935850Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:34.8935855Z 2025-05-07T19:50:34.8935924Z 2025-05-07T19:50:34.8936014Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:34.8936018Z 2025-05-07T19:50:34.8936083Z 2025-05-07T19:50:34.8936156Z OTHER_SRCS: 2025-05-07T19:50:34.8936161Z 2025-05-07T19:50:34.8936232Z 2025-05-07T19:50:34.8936395Z CC_FLAGS: 2025-05-07T19:50:34.8936404Z 2025-05-07T19:50:34.8936472Z 2025-05-07T19:50:34.8936545Z NVCC_FLAGS: 2025-05-07T19:50:34.8936650Z --expt-relaxed-constexpr 2025-05-07T19:50:34.8936910Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:34.8937009Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:34.8937103Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:34.8937193Z 2025-05-07T19:50:34.8937271Z HIPCC_FLAGS: 2025-05-07T19:50:34.8937280Z 2025-05-07T19:50:34.8937353Z 2025-05-07T19:50:34.8937444Z INCLUDE_DIRS: 2025-05-07T19:50:34.8937549Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:34.8937720Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:34.8937822Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:34.8937932Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:34.8938207Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include 2025-05-07T19:50:34.8938593Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:34.8938758Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:34.8938916Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:34.8939073Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:34.8939281Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:34.8939473Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:34.8939678Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:34.8939977Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include 2025-05-07T19:50:34.8940063Z 2025-05-07T19:50:34.8940152Z Selected Source Files: 2025-05-07T19:50:34.8940252Z src/memory_utils/memory_utils.cpp 2025-05-07T19:50:34.8940365Z src/memory_utils/memory_utils_ops.cpp 2025-05-07T19:50:34.8940559Z src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_cpu.cpp 2025-05-07T19:50:34.8940821Z src/permute_multi_embedding_ops/permute_multi_embedding_function.cpp 2025-05-07T19:50:34.8941034Z src/permute_multi_embedding_ops/permute_multi_embedding_ops_cpu.cpp 2025-05-07T19:50:34.8941248Z src/permute_pooled_embedding_ops/permute_pooled_embedding_function.cpp 2025-05-07T19:50:34.8941452Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_cpu.cpp 2025-05-07T19:50:34.8941680Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_cpu.cpp 2025-05-07T19:50:34.8941842Z src/jagged_tensor_ops/jagged_tensor_ops_autograd.cpp 2025-05-07T19:50:34.8941970Z src/jagged_tensor_ops/jagged_tensor_ops_meta.cpp 2025-05-07T19:50:34.8942098Z src/jagged_tensor_ops/jagged_tensor_ops_cpu.cpp 2025-05-07T19:50:34.8942226Z src/input_combine_ops/input_combine_cpu.cpp 2025-05-07T19:50:34.8942371Z src/layout_transform_ops/layout_transform_ops_cpu.cpp 2025-05-07T19:50:34.8942476Z src/quantize_ops/quantize_ops_cpu.cpp 2025-05-07T19:50:34.8942592Z src/quantize_ops/quantize_ops_meta.cpp 2025-05-07T19:50:34.8942722Z src/sparse_ops/sparse_async_batched_cumsum.cpp 2025-05-07T19:50:34.8942821Z src/sparse_ops/sparse_ops_cpu.cpp 2025-05-07T19:50:34.8942918Z src/sparse_ops/sparse_ops_meta.cpp 2025-05-07T19:50:34.8943013Z src/tbe/eeg/eeg_models.cpp 2025-05-07T19:50:34.8943101Z src/tbe/eeg/eeg_utils.cpp 2025-05-07T19:50:34.8943202Z src/tbe/eeg/indices_estimator_ops.cpp 2025-05-07T19:50:34.8943299Z src/tbe/eeg/indices_estimator.cpp 2025-05-07T19:50:34.8943402Z src/tbe/eeg/indices_generator_ops.cpp 2025-05-07T19:50:34.8943499Z src/tbe/eeg/indices_generator.cpp 2025-05-07T19:50:34.8943726Z src/intraining_embedding_pruning_ops/intraining_embedding_pruning_gpu.cpp 2025-05-07T19:50:34.8943879Z src/layout_transform_ops/layout_transform_ops_gpu.cpp 2025-05-07T19:50:34.8944084Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_gpu.cpp 2025-05-07T19:50:34.8944310Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_gpu.cpp 2025-05-07T19:50:34.8944418Z src/quantize_ops/quantize_ops_gpu.cpp 2025-05-07T19:50:34.8944518Z src/sparse_ops/sparse_ops_gpu.cpp 2025-05-07T19:50:34.8944613Z src/metric_ops/metric_ops_host.cpp 2025-05-07T19:50:34.8944725Z src/input_combine_ops/input_combine_gpu.cpp 2025-05-07T19:50:34.8944919Z src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_gpu.cpp 2025-05-07T19:50:34.8945003Z src/topology_utils.cpp 2025-05-07T19:50:34.8945113Z src/histogram_binning_calibration_ops.cu 2025-05-07T19:50:34.8945223Z src/input_combine_ops/input_combine.cu 2025-05-07T19:50:34.8945429Z src/intraining_embedding_pruning_ops/intraining_embedding_pruning.cu 2025-05-07T19:50:34.8945524Z src/memory_utils/memory_utils.cu 2025-05-07T19:50:34.8945628Z src/memory_utils/memory_utils_ops.cu 2025-05-07T19:50:34.8945813Z src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_backward.cu 2025-05-07T19:50:34.8945990Z src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_forward.cu 2025-05-07T19:50:34.8946113Z src/jagged_tensor_ops/dense_to_jagged_forward.cu 2025-05-07T19:50:34.8946248Z src/jagged_tensor_ops/jagged_dense_bmm_forward.cu 2025-05-07T19:50:34.8946496Z src/jagged_tensor_ops/jagged_dense_dense_elementwise_add_jagged_output_forward.cu 2025-05-07T19:50:34.8946671Z src/jagged_tensor_ops/jagged_dense_elementwise_mul_backward.cu 2025-05-07T19:50:34.8946849Z src/jagged_tensor_ops/jagged_dense_elementwise_mul_forward.cu 2025-05-07T19:50:34.8946989Z src/jagged_tensor_ops/jagged_index_add_2d_forward.cu 2025-05-07T19:50:34.8947132Z src/jagged_tensor_ops/jagged_index_select_2d_forward.cu 2025-05-07T19:50:34.8947321Z src/jagged_tensor_ops/jagged_jagged_bmm_forward.cu 2025-05-07T19:50:34.8947443Z src/jagged_tensor_ops/jagged_softmax_backward.cu 2025-05-07T19:50:34.8947564Z src/jagged_tensor_ops/jagged_softmax_forward.cu 2025-05-07T19:50:34.8947670Z src/jagged_tensor_ops/jagged_tensor_ops.cu 2025-05-07T19:50:34.8947831Z src/jagged_tensor_ops/jagged_to_padded_dense_backward.cu 2025-05-07T19:50:34.8947978Z src/jagged_tensor_ops/jagged_to_padded_dense_forward.cu 2025-05-07T19:50:34.8948099Z src/jagged_tensor_ops/jagged_unique_indices.cu 2025-05-07T19:50:34.8948301Z src/jagged_tensor_ops/keyed_jagged_index_select_dim1.cu 2025-05-07T19:50:34.8948429Z src/layout_transform_ops/layout_transform_ops.cu 2025-05-07T19:50:34.8948522Z src/metric_ops/metric_ops.cu 2025-05-07T19:50:34.8948742Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split.cu 2025-05-07T19:50:34.8948926Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops.cu 2025-05-07T19:50:34.8949210Z src/permute_multi_embedding_ops/permute_multi_embedding_ops.cu 2025-05-07T19:50:34.8949306Z src/quantize_ops/quantize_bfloat16.cu 2025-05-07T19:50:34.8949409Z src/quantize_ops/quantize_fp8_rowwise.cu 2025-05-07T19:50:34.8949525Z src/quantize_ops/quantize_fused_8bit_rowwise.cu 2025-05-07T19:50:34.8949636Z src/quantize_ops/quantize_fused_nbit_rowwise.cu 2025-05-07T19:50:34.8949730Z src/quantize_ops/quantize_hfp8.cu 2025-05-07T19:50:34.8949813Z src/quantize_ops/quantize_msfp.cu 2025-05-07T19:50:34.8949922Z src/quantize_ops/quantize_padded_fp8_rowwise.cu 2025-05-07T19:50:34.8950013Z src/quantize_ops/quantize_mx.cu 2025-05-07T19:50:34.8950127Z src/sparse_ops/sparse_async_batched_cumsum.cu 2025-05-07T19:50:34.8950248Z src/sparse_ops/sparse_block_bucketize_features.cu 2025-05-07T19:50:34.8950350Z src/sparse_ops/sparse_bucketize_features.cu 2025-05-07T19:50:34.8950477Z src/sparse_ops/sparse_batched_unary_embeddings.cu 2025-05-07T19:50:34.8950602Z src/sparse_ops/sparse_compute_frequency_sequence.cu 2025-05-07T19:50:34.8950728Z src/sparse_ops/sparse_expand_into_jagged_permute.cu 2025-05-07T19:50:34.8950827Z src/sparse_ops/sparse_group_index.cu 2025-05-07T19:50:34.8950915Z src/sparse_ops/sparse_index_add.cu 2025-05-07T19:50:34.8951006Z src/sparse_ops/sparse_index_select.cu 2025-05-07T19:50:34.8951100Z src/sparse_ops/sparse_invert_permute.cu 2025-05-07T19:50:34.8951219Z src/sparse_ops/sparse_pack_segments_backward.cu 2025-05-07T19:50:34.8951328Z src/sparse_ops/sparse_pack_segments_forward.cu 2025-05-07T19:50:34.8951416Z src/sparse_ops/sparse_permute_1d.cu 2025-05-07T19:50:34.8951511Z src/sparse_ops/sparse_permute_2d.cu 2025-05-07T19:50:34.8951603Z src/sparse_ops/sparse_permute102.cu 2025-05-07T19:50:34.8951705Z src/sparse_ops/sparse_permute_embeddings.cu 2025-05-07T19:50:34.8951791Z src/sparse_ops/sparse_range.cu 2025-05-07T19:50:34.8951904Z src/sparse_ops/sparse_reorder_batched_ad.cu 2025-05-07T19:50:34.8952001Z src/sparse_ops/sparse_segment_sum_csr.cu 2025-05-07T19:50:34.8952087Z src/sparse_ops/sparse_zipf.cu 2025-05-07T19:50:34.8952163Z 2025-05-07T19:50:34.8952240Z HIPified Source Files: 2025-05-07T19:50:34.8952244Z 2025-05-07T19:50:34.8952305Z 2025-05-07T19:50:34.8952383Z Library Dependencies: 2025-05-07T19:50:34.8952450Z torch 2025-05-07T19:50:34.8952522Z torch_library 2025-05-07T19:50:34.8952799Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so 2025-05-07T19:50:34.8953026Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:34.8953317Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:34.8953630Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:34.8953878Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:34.8953942Z fbgemm 2025-05-07T19:50:34.8954032Z fbgemm_gpu_sparse_async_cumsum 2025-05-07T19:50:34.8954121Z fbgemm_gpu_embedding_inplace_ops 2025-05-07T19:50:34.8954260Z fbgemm_gpu_tbe_index_select 2025-05-07T19:50:34.8954333Z fbgemm_gpu_tbe_cache 2025-05-07T19:50:34.8954414Z fbgemm_gpu_tbe_optimizers 2025-05-07T19:50:34.8954501Z fbgemm_gpu_tbe_utils 2025-05-07T19:50:34.8954691Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:34.8954753Z 2025-05-07T19:50:34.8954825Z Output Library: 2025-05-07T19:50:34.8954906Z fbgemm_gpu_py 2025-05-07T19:50:34.8954969Z 2025-05-07T19:50:34.8955049Z Destination Directory: 2025-05-07T19:50:34.8955129Z fbgemm_gpu 2025-05-07T19:50:34.8955280Z ================================================================================ 2025-05-07T19:50:34.8955285Z 2025-05-07T19:50:34.8955372Z -- Configuring done (9.0s) 2025-05-07T19:50:35.0245717Z -- Generating done (0.1s) 2025-05-07T19:50:35.0263157Z -- Build files have been written to: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build 2025-05-07T19:50:35.0415612Z Change Dir: '/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build' 2025-05-07T19:50:35.0416120Z 2025-05-07T19:50:35.0416701Z Run Build Command(s): /github/home/miniconda/envs/build_binary/bin/ninja -v -j 48 install 2025-05-07T19:50:35.1633336Z [1/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/errorhandler.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/errorhandler.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/errorhandler.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/errorhandler.cpp 2025-05-07T19:50:35.1644451Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.1836527Z [2/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/support.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/support.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/support.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/support.cpp 2025-05-07T19:50:35.1848373Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.1873824Z [3/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/globals.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/globals.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/globals.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/globals.cpp 2025-05-07T19:50:35.1885647Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.1943212Z [4/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonelist.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonelist.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonelist.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonelist.cpp 2025-05-07T19:50:35.1955036Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.1990007Z [5/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonetree.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonetree.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonetree.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonetree.cpp 2025-05-07T19:50:35.2001842Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.2109243Z [6/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonestack.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonestack.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonestack.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonestack.cpp 2025-05-07T19:50:35.2121187Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.2132655Z [7/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/osutils.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/osutils.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/osutils.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/osutils.cpp 2025-05-07T19:50:35.2144487Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.2171265Z [8/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/operand.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/operand.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/operand.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/operand.cpp 2025-05-07T19:50:35.2183306Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.2199932Z [9/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonehash.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonehash.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonehash.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonehash.cpp 2025-05-07T19:50:35.2211757Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.2503805Z [10/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zone.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zone.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zone.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zone.cpp 2025-05-07T19:50:35.2513887Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.2661533Z [11/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/environment.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/environment.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/environment.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/environment.cpp 2025-05-07T19:50:35.2673553Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.2684685Z [12/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/string.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/string.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/string.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/string.cpp 2025-05-07T19:50:35.2695409Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.2814623Z [13/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/constpool.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/constpool.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/constpool.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/constpool.cpp 2025-05-07T19:50:35.2825713Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.2877596Z [14/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/target.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/target.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/target.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/target.cpp 2025-05-07T19:50:35.2889350Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.3024978Z [15/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/type.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/type.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/type.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/type.cpp 2025-05-07T19:50:35.3035828Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.3082035Z [16/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/logger.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/logger.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/logger.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/logger.cpp 2025-05-07T19:50:35.3092929Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.3124978Z [17/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/instdb.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/instdb.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/instdb.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/instdb.cpp 2025-05-07T19:50:35.3135813Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.3219366Z [18/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64operand.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64operand.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64operand.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64operand.cpp 2025-05-07T19:50:35.3230615Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.3271866Z [19/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/inst.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/inst.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/inst.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/inst.cpp 2025-05-07T19:50:35.3282967Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.3505087Z [20/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitruntime.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitruntime.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitruntime.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitruntime.cpp 2025-05-07T19:50:35.3516975Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.3549548Z [21/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonevector.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonevector.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonevector.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonevector.cpp 2025-05-07T19:50:35.3560662Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.3862301Z [22/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rastack.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rastack.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rastack.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rastack.cpp 2025-05-07T19:50:35.3874167Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.3887265Z [23/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitterutils.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitterutils.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitterutils.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitterutils.cpp 2025-05-07T19:50:35.3899085Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.3935760Z [24/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codewriter.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codewriter.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codewriter.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codewriter.cpp 2025-05-07T19:50:35.3947105Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.3957118Z [25/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/func.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/func.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/func.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/func.cpp 2025-05-07T19:50:35.3967918Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.3979028Z [26/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64func.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64func.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64func.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64func.cpp 2025-05-07T19:50:35.3989865Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.4194461Z [27/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitter.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitter.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitter.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitter.cpp 2025-05-07T19:50:35.4205361Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.4347601Z [28/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/funcargscontext.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/funcargscontext.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/funcargscontext.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/funcargscontext.cpp 2025-05-07T19:50:35.4359767Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.4422350Z [29/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instapi.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instapi.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instapi.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instapi.cpp 2025-05-07T19:50:35.4433223Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.4443820Z [30/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instdb.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instdb.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instdb.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instdb.cpp 2025-05-07T19:50:35.4454767Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.4527029Z [31/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64formatter.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64formatter.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64formatter.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64formatter.cpp 2025-05-07T19:50:35.4539071Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.4615756Z [32/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/assembler.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/assembler.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/assembler.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/assembler.cpp 2025-05-07T19:50:35.4626529Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.4637049Z [33/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emithelper.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emithelper.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emithelper.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emithelper.cpp 2025-05-07T19:50:35.4647986Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.4726023Z [34/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64builder.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64builder.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64builder.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64builder.cpp 2025-05-07T19:50:35.4741533Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.4874517Z [35/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/archtraits.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/archtraits.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/archtraits.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/archtraits.cpp 2025-05-07T19:50:35.4885723Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.5092023Z [36/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/armformatter.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/armformatter.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/armformatter.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/armformatter.cpp 2025-05-07T19:50:35.5102694Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.5445427Z [37/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/formatter.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/formatter.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/formatter.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/formatter.cpp 2025-05-07T19:50:35.5451662Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.5457728Z [38/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitallocator.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitallocator.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitallocator.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitallocator.cpp 2025-05-07T19:50:35.5463867Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.5478477Z [39/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64compiler.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64compiler.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64compiler.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64compiler.cpp 2025-05-07T19:50:35.5484563Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.5577208Z [40/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86operand.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86operand.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86operand.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86operand.cpp 2025-05-07T19:50:35.5588303Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.5816949Z [41/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64emithelper.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64emithelper.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64emithelper.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64emithelper.cpp 2025-05-07T19:50:35.5827366Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.5944507Z [42/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/compiler.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/compiler.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/compiler.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/compiler.cpp 2025-05-07T19:50:35.5954562Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.5965396Z [43/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/virtmem.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/virtmem.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/virtmem.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/virtmem.cpp 2025-05-07T19:50:35.5975668Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.6274034Z [44/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/builder.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/builder.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/builder.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/builder.cpp 2025-05-07T19:50:35.6284199Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.6880734Z [45/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/cpuinfo.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/cpuinfo.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/cpuinfo.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/cpuinfo.cpp 2025-05-07T19:50:35.6891035Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.7491349Z [46/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86builder.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86builder.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86builder.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86builder.cpp 2025-05-07T19:50:35.7501716Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.7623791Z [47/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86formatter.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86formatter.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86formatter.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86formatter.cpp 2025-05-07T19:50:35.7634239Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.7869888Z [48/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86func.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86func.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86func.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86func.cpp 2025-05-07T19:50:35.7876464Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.8139764Z [49/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86compiler.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86compiler.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86compiler.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86compiler.cpp 2025-05-07T19:50:35.8146008Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.8151854Z [50/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/ralocal.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/ralocal.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/ralocal.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/ralocal.cpp 2025-05-07T19:50:35.8157927Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.8533896Z [51/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86emithelper.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86emithelper.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86emithelper.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86emithelper.cpp 2025-05-07T19:50:35.8546043Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.9118390Z [52/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instapi.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instapi.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instapi.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instapi.cpp 2025-05-07T19:50:35.9130324Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.9660774Z [53/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64rapass.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64rapass.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64rapass.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64rapass.cpp 2025-05-07T19:50:35.9672519Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:36.1997574Z [54/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rapass.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rapass.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rapass.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rapass.cpp 2025-05-07T19:50:36.2011628Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:36.2226362Z [55/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instdb.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instdb.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instdb.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instdb.cpp 2025-05-07T19:50:36.2238274Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:36.2799253Z [56/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -mavx512f -mavx512bw -mavx512dq -mavx512vl -fopenmp -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx512.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx512.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx512.cc.o -c /__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx512.cc 2025-05-07T19:50:36.2817988Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:36.2839670Z [57/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx2.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx2.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx2.cc.o -c /__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx2.cc 2025-05-07T19:50:36.2858009Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:36.3737011Z [58/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codeholder.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codeholder.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codeholder.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codeholder.cpp 2025-05-07T19:50:36.3749284Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:36.4611486Z [59/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86rapass.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86rapass.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86rapass.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86rapass.cpp 2025-05-07T19:50:36.4623230Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:36.5209809Z [60/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64assembler.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64assembler.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64assembler.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64assembler.cpp 2025-05-07T19:50:36.5221398Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:36.6457134Z [61/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtils.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtils.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtils.cc.o -c /__w/FBGEMM/FBGEMM/src/QuantUtils.cc 2025-05-07T19:50:36.6473790Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:36.8351659Z [62/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86assembler.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86assembler.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86assembler.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86assembler.cpp 2025-05-07T19:50:36.8374910Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:37.4349149Z [63/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,asmjit.so -o asmjit.so CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64assembler.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64builder.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64compiler.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64emithelper.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64formatter.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64func.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instapi.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instdb.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64operand.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64rapass.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/armformatter.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/archtraits.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/assembler.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/builder.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codeholder.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codewriter.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/compiler.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/constpool.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/cpuinfo.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emithelper.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitter.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitterutils.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/environment.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/errorhandler.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/formatter.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/func.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/funcargscontext.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/globals.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/inst.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/instdb.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitallocator.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitruntime.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/logger.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/operand.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/osutils.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/ralocal.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rapass.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rastack.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/string.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/support.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/target.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/type.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/virtmem.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zone.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonehash.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonelist.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonestack.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonetree.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonevector.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86assembler.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86builder.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86compiler.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86emithelper.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86formatter.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86func.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instapi.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instdb.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86operand.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86rapass.cpp.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,/lib/intel64:/lib/intel64_win:/lib/win-x64:/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib:/github/home/miniconda/envs/build_binary/lib/stubs: /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so" -Wl,--as-needed && : 2025-05-07T19:50:38.0403895Z [64/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/Utils.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/Utils.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/Utils.cc.o -c /__w/FBGEMM/FBGEMM/src/Utils.cc 2025-05-07T19:50:38.0421326Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:38.1553990Z [65/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/SparseAdagrad.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/SparseAdagrad.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/SparseAdagrad.cc.o -c /__w/FBGEMM/FBGEMM/src/SparseAdagrad.cc 2025-05-07T19:50:38.1572008Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:41.4997981Z [66/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RefImplementations.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RefImplementations.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RefImplementations.cc.o -c /__w/FBGEMM/FBGEMM/src/RefImplementations.cc 2025-05-07T19:50:41.5014299Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:41.8581778Z [67/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RowWiseSparseAdagradFused.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RowWiseSparseAdagradFused.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RowWiseSparseAdagradFused.cc.o -c /__w/FBGEMM/FBGEMM/src/RowWiseSparseAdagradFused.cc 2025-05-07T19:50:41.8599713Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:43.6199800Z [68/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate_byte.cpp 2025-05-07T19:50:43.6298596Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:43.6314611Z [69/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAutovec.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAutovec.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAutovec.cc.o -c /__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAutovec.cc 2025-05-07T19:50:43.6330410Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:43.7594494Z [70/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate_byte.cpp 2025-05-07T19:50:43.7611092Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:43.8604867Z [71/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lxu_cache.cpp 2025-05-07T19:50:43.8621768Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:43.8871982Z [72/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/linearize_cache_indices.cpp 2025-05-07T19:50:43.8889375Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:44.5119400Z [73/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/split_embeddings_cache_ops.cpp 2025-05-07T19:50:44.5137111Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:45.2071288Z [74/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/inference/embedding_forward_quantized_host.cpp 2025-05-07T19:50:45.2089864Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:46.0215484Z [75/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_weighted_codegen_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_weighted_codegen_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_weighted_codegen_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_weighted_codegen_cpu.cpp 2025-05-07T19:50:46.0234075Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:47.0810789Z [76/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_config_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_config.dir/src/config/feature_gates.cpp.o -MF CMakeFiles/fbgemm_gpu_config.dir/src/config/feature_gates.cpp.o.d -o CMakeFiles/fbgemm_gpu_config.dir/src/config/feature_gates.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/config/feature_gates.cpp 2025-05-07T19:50:47.0827503Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:47.2707213Z [77/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtilsAvx2.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtilsAvx2.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtilsAvx2.cc.o -c /__w/FBGEMM/FBGEMM/src/QuantUtilsAvx2.cc 2025-05-07T19:50:47.2722584Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:47.8462921Z [78/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_config.so -o fbgemm_gpu_config.so CMakeFiles/fbgemm_gpu_config.dir/src/config/feature_gates.cpp.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,/lib/intel64:/lib/intel64_win:/lib/win-x64:/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib:/github/home/miniconda/envs/build_binary/lib/stubs: /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so" -Wl,--as-needed && : 2025-05-07T19:50:48.2521384Z [79/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp 2025-05-07T19:50:48.2540692Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:50.6955090Z [80/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_utils_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_utils/split_embeddings_utils_cpu.cpp 2025-05-07T19:50:50.6973527Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:50.7115227Z [81/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_utils_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_utils/split_embeddings_utils_meta.cpp 2025-05-07T19:50:50.7132891Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:52.6290348Z [82/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_utils_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_utils/split_embeddings_utils.cpp 2025-05-07T19:50:52.6307463Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:54.5224045Z [83/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/inference/embedding_forward_quantized_host_cpu.cpp 2025-05-07T19:50:54.5242039Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:54.9477649Z [84/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_sparse_async_cumsum_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cpp.o -MF CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cpp.o.d -o CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_async_cumsum.cpp 2025-05-07T19:50:54.9494086Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:55.8121215Z [85/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_common_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/pt2/pt2_autograd_utils.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/pt2/pt2_autograd_utils.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/pt2/pt2_autograd_utils.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/training/pt2/pt2_autograd_utils.cpp 2025-05-07T19:50:55.8137845Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:58.6760482Z [86/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_common_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/utils/embedding_bounds_check_host.cpp 2025-05-07T19:50:58.6778367Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:59.0735203Z [87/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_common_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/utils/embedding_bounds_check_host_cpu.cpp 2025-05-07T19:50:59.0751721Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:51:01.2752941Z [88/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMNBit.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMNBit.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMNBit.cc.o -c /__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMNBit.cc 2025-05-07T19:51:01.2769382Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:51:05.3771721Z [89/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_optimizers_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_optimizer_rowwise_adagrad_split.cpp 2025-05-07T19:51:05.3789581Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:51:07.0910652Z [90/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_common_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/forward/embedding_forward_split_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/forward/embedding_forward_split_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/forward/embedding_forward_split_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/training/forward/embedding_forward_split_cpu.cpp 2025-05-07T19:51:07.0930026Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:51:07.7173289Z [91/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_codegen_meta.cpp 2025-05-07T19:51:07.7191293Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:51:08.1783351Z [92/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_codegen_meta.cpp 2025-05-07T19:51:08.1802900Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:51:10.5617801Z [93/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_pt2_cpu_wrapper.cpp 2025-05-07T19:51:10.5635670Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:51:11.4685345Z [94/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_dense_weighted_codegen_meta.cpp 2025-05-07T19:51:11.4702674Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:51:14.3735767Z [95/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_dense_unweighted_codegen_meta.cpp 2025-05-07T19:51:14.3752267Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:51:16.0637264Z [96/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_ssd_weighted_codegen_meta.cpp 2025-05-07T19:51:16.7717204Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:51:16.7733127Z [97/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_ssd_unweighted_codegen_meta.cpp 2025-05-07T19:51:16.7750156Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:51:17.2547674Z [98/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp 2025-05-07T19:51:17.2561754Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:51:19.7490843Z [99/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp 2025-05-07T19:51:19.7511512Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:51:20.7152605Z [100/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:51:20.7171326Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:51:23.5156084Z [101/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:51:23.5174522Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:51:31.9603008Z [102/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp 2025-05-07T19:51:31.9622263Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:51:33.2355584Z [103/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_pt2_cuda_wrapper.cpp 2025-05-07T19:51:33.2373000Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:51:36.7308391Z [104/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_split_lookup.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_split_lookup.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/inference/embedding_forward_quantized_split_lookup.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_split_lookup.cu.o 2025-05-07T19:51:36.7329084Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:36.7331464Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:36.7332529Z ^ 2025-05-07T19:51:36.7332771Z 2025-05-07T19:51:36.7333237Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:36.7333814Z 2025-05-07T19:51:36.7335231Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:36.7337784Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:36.7338897Z ^ 2025-05-07T19:51:36.7339221Z 2025-05-07T19:51:36.7340667Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:36.7343091Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:36.7344149Z ^ 2025-05-07T19:51:36.7344389Z 2025-05-07T19:51:36.7344778Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:36.7345378Z 2025-05-07T19:51:36.7346966Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:36.7349409Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:36.7350416Z ^ 2025-05-07T19:51:36.7350730Z 2025-05-07T19:51:36.7352170Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:36.7354918Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:36.7355978Z ^ 2025-05-07T19:51:36.7356204Z 2025-05-07T19:51:36.7356638Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:36.7357221Z 2025-05-07T19:51:36.7358964Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:36.7361175Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:36.7362265Z ^ 2025-05-07T19:51:36.7362589Z 2025-05-07T19:51:36.7364284Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:36.7366607Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:36.7367614Z ^ 2025-05-07T19:51:36.7367848Z 2025-05-07T19:51:36.7368286Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:36.7368928Z 2025-05-07T19:51:36.7370679Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:36.7372943Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:36.7373956Z ^ 2025-05-07T19:51:36.7374260Z 2025-05-07T19:51:36.7375680Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:36.7378232Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:36.7379342Z ^ 2025-05-07T19:51:36.7379585Z 2025-05-07T19:51:36.7379975Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:36.7380549Z 2025-05-07T19:51:36.7382184Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:36.7384780Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:36.7386230Z ^ 2025-05-07T19:51:36.7386597Z 2025-05-07T19:51:37.5240500Z [105/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate_byte.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cu.o 2025-05-07T19:51:37.5262028Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:37.5264628Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:37.5265668Z ^ 2025-05-07T19:51:37.5265909Z 2025-05-07T19:51:37.5266329Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:37.5266959Z 2025-05-07T19:51:37.5268545Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:37.5271002Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:37.5272096Z ^ 2025-05-07T19:51:37.5272430Z 2025-05-07T19:51:37.5273999Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:37.5276414Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:37.5277508Z ^ 2025-05-07T19:51:37.5277744Z 2025-05-07T19:51:37.5278155Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:37.5278776Z 2025-05-07T19:51:37.5280279Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:37.5282605Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:37.5283703Z ^ 2025-05-07T19:51:37.5284057Z 2025-05-07T19:51:37.5285620Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:37.5288069Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:37.5289328Z ^ 2025-05-07T19:51:37.5289938Z 2025-05-07T19:51:37.5290300Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:37.5290841Z 2025-05-07T19:51:37.5292323Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:37.5294819Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:37.5300088Z ^ 2025-05-07T19:51:37.5300440Z 2025-05-07T19:51:37.5301933Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:37.5304434Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:37.5305527Z ^ 2025-05-07T19:51:37.5305762Z 2025-05-07T19:51:37.5306177Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:37.5306811Z 2025-05-07T19:51:37.5308323Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:37.5310668Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:37.5311742Z ^ 2025-05-07T19:51:37.5312087Z 2025-05-07T19:51:37.5313604Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:37.5316043Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:37.5317094Z ^ 2025-05-07T19:51:37.5317325Z 2025-05-07T19:51:37.5317762Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:37.5318376Z 2025-05-07T19:51:37.5319879Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:37.5322450Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:37.5323504Z ^ 2025-05-07T19:51:37.5323841Z 2025-05-07T19:51:37.5427252Z [106/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/split_embeddings_cache_ops.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cu.o 2025-05-07T19:51:37.5448116Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:37.5450523Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:37.5451651Z ^ 2025-05-07T19:51:37.5451899Z 2025-05-07T19:51:37.5452299Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:37.5452936Z 2025-05-07T19:51:37.5454436Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:37.5456984Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:37.5458019Z ^ 2025-05-07T19:51:37.5458350Z 2025-05-07T19:51:37.5459875Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:37.5462352Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:37.5463430Z ^ 2025-05-07T19:51:37.5463682Z 2025-05-07T19:51:37.5464073Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:37.5464694Z 2025-05-07T19:51:37.5466264Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:37.5468671Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:37.5469643Z ^ 2025-05-07T19:51:37.5469966Z 2025-05-07T19:51:37.5471905Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:37.5474630Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:37.5475686Z ^ 2025-05-07T19:51:37.5475899Z 2025-05-07T19:51:37.5476308Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:37.5476919Z 2025-05-07T19:51:37.5478532Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:37.5480972Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:37.5482080Z ^ 2025-05-07T19:51:37.5482442Z 2025-05-07T19:51:37.5484003Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:37.5486522Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:37.5487623Z ^ 2025-05-07T19:51:37.5487882Z 2025-05-07T19:51:37.5488300Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:37.5488904Z 2025-05-07T19:51:37.5490467Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:37.5492996Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:37.5494035Z ^ 2025-05-07T19:51:37.5494391Z 2025-05-07T19:51:37.5496125Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:37.5498682Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:37.5499691Z ^ 2025-05-07T19:51:37.5499929Z 2025-05-07T19:51:37.5500363Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:37.5501020Z 2025-05-07T19:51:37.5502526Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:37.5504941Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:37.5506010Z ^ 2025-05-07T19:51:37.5506347Z 2025-05-07T19:51:38.3585782Z [107/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lxu_cache.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cu.o 2025-05-07T19:51:38.3608777Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:38.3611336Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:38.3612394Z ^ 2025-05-07T19:51:38.3612631Z 2025-05-07T19:51:38.3613029Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:38.3613587Z 2025-05-07T19:51:38.3614990Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:38.3617433Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:38.3618461Z ^ 2025-05-07T19:51:38.3618802Z 2025-05-07T19:51:38.3620173Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:38.3622398Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:38.3623429Z ^ 2025-05-07T19:51:38.3623686Z 2025-05-07T19:51:38.3624068Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:38.3624625Z 2025-05-07T19:51:38.3626051Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:38.3628271Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:38.3629375Z ^ 2025-05-07T19:51:38.3629713Z 2025-05-07T19:51:38.3631126Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:38.3633699Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:38.3634689Z ^ 2025-05-07T19:51:38.3634934Z 2025-05-07T19:51:38.3635353Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:38.3635924Z 2025-05-07T19:51:38.3637564Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:38.3639935Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:38.3640992Z ^ 2025-05-07T19:51:38.3641378Z 2025-05-07T19:51:38.3642752Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:38.3644967Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:38.3645946Z ^ 2025-05-07T19:51:38.3646211Z 2025-05-07T19:51:38.3646605Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:38.3647194Z 2025-05-07T19:51:38.3648613Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:38.3651028Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:38.3652073Z ^ 2025-05-07T19:51:38.3652376Z 2025-05-07T19:51:38.3653718Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:38.3655948Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:38.3657153Z ^ 2025-05-07T19:51:38.3657380Z 2025-05-07T19:51:38.3657771Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:38.3658403Z 2025-05-07T19:51:38.3659761Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:38.3662087Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:38.3663081Z ^ 2025-05-07T19:51:38.3663400Z 2025-05-07T19:51:38.6020269Z [108/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate.cu.o 2025-05-07T19:51:38.6040006Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:38.6042339Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:38.6043394Z ^ 2025-05-07T19:51:38.6043634Z 2025-05-07T19:51:38.6044008Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:38.6044615Z 2025-05-07T19:51:38.6046086Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:38.6048441Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:38.6049492Z ^ 2025-05-07T19:51:38.6049861Z 2025-05-07T19:51:38.6051355Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:38.6053615Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:38.6054677Z ^ 2025-05-07T19:51:38.6054921Z 2025-05-07T19:51:38.6055302Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:38.6055910Z 2025-05-07T19:51:38.6057520Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:38.6059851Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:38.6061236Z ^ 2025-05-07T19:51:38.6061595Z 2025-05-07T19:51:38.6062942Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:38.6065242Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:38.6066401Z ^ 2025-05-07T19:51:38.6066665Z 2025-05-07T19:51:38.6067320Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:38.6067955Z 2025-05-07T19:51:38.6069346Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:38.6071942Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:38.6073009Z ^ 2025-05-07T19:51:38.6073355Z 2025-05-07T19:51:38.6074786Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:38.6077113Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:38.6078191Z ^ 2025-05-07T19:51:38.6078435Z 2025-05-07T19:51:38.6078858Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:38.6079438Z 2025-05-07T19:51:38.6080859Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:38.6083168Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:38.6084232Z ^ 2025-05-07T19:51:38.6084585Z 2025-05-07T19:51:38.6086003Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:38.6088311Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:38.6089351Z ^ 2025-05-07T19:51:38.6089625Z 2025-05-07T19:51:38.6090024Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:38.6090637Z 2025-05-07T19:51:38.6092049Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:38.6094340Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:38.6095490Z ^ 2025-05-07T19:51:38.6095858Z 2025-05-07T19:51:38.7768104Z [109/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/reset_weight_momentum.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/reset_weight_momentum.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/reset_weight_momentum.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/reset_weight_momentum.cu.o 2025-05-07T19:51:38.7787647Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:38.7789986Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:38.7790964Z ^ 2025-05-07T19:51:38.7791183Z 2025-05-07T19:51:38.7791600Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:38.7792199Z 2025-05-07T19:51:38.7793590Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:38.7795841Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:38.7796880Z ^ 2025-05-07T19:51:38.7797178Z 2025-05-07T19:51:38.7798583Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:38.7800849Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:38.7801852Z ^ 2025-05-07T19:51:38.7802084Z 2025-05-07T19:51:38.7802455Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:38.7803018Z 2025-05-07T19:51:38.7804456Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:38.7806980Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:38.7808039Z ^ 2025-05-07T19:51:38.7808353Z 2025-05-07T19:51:38.7809763Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:38.7812188Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:38.7813207Z ^ 2025-05-07T19:51:38.7813429Z 2025-05-07T19:51:38.7813831Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:38.7814384Z 2025-05-07T19:51:38.7815793Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:38.7818251Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:38.7819218Z ^ 2025-05-07T19:51:38.7819560Z 2025-05-07T19:51:38.7820922Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:38.7823116Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:38.7824068Z ^ 2025-05-07T19:51:38.7824301Z 2025-05-07T19:51:38.7824633Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:38.7825136Z 2025-05-07T19:51:38.7826445Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:38.7828542Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:38.7829530Z ^ 2025-05-07T19:51:38.7829814Z 2025-05-07T19:51:38.7831117Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:38.7833162Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:38.7834097Z ^ 2025-05-07T19:51:38.7834313Z 2025-05-07T19:51:38.7834676Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:38.7835233Z 2025-05-07T19:51:38.7836547Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:38.7838685Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:38.7839614Z ^ 2025-05-07T19:51:38.7839939Z 2025-05-07T19:51:39.0719197Z [110/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate.cu.o 2025-05-07T19:51:39.0740518Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:39.0743092Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:39.0744313Z ^ 2025-05-07T19:51:39.0744572Z 2025-05-07T19:51:39.0744971Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:39.0745551Z 2025-05-07T19:51:39.0746915Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:39.0749301Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:39.0750294Z ^ 2025-05-07T19:51:39.0750646Z 2025-05-07T19:51:39.0752084Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:39.0754489Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:39.0755532Z ^ 2025-05-07T19:51:39.0755815Z 2025-05-07T19:51:39.0756218Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:39.0756849Z 2025-05-07T19:51:39.0758409Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:39.0761244Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:39.0762396Z ^ 2025-05-07T19:51:39.0762742Z 2025-05-07T19:51:39.0764444Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:39.0766880Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:39.0768004Z ^ 2025-05-07T19:51:39.0768244Z 2025-05-07T19:51:39.0768650Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:39.0769293Z 2025-05-07T19:51:39.0771018Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:39.0773275Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:39.0774271Z ^ 2025-05-07T19:51:39.0774611Z 2025-05-07T19:51:39.0776079Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:39.0778404Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:39.0779343Z ^ 2025-05-07T19:51:39.0779595Z 2025-05-07T19:51:39.0779987Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:39.0780561Z 2025-05-07T19:51:39.0782064Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:39.0784533Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:39.0785666Z ^ 2025-05-07T19:51:39.0786003Z 2025-05-07T19:51:39.0787472Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:39.0789934Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:39.0790965Z ^ 2025-05-07T19:51:39.0791195Z 2025-05-07T19:51:39.0791569Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:39.0792110Z 2025-05-07T19:51:39.0793400Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:39.0795539Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:39.0796631Z ^ 2025-05-07T19:51:39.0796995Z 2025-05-07T19:51:39.5880797Z [111/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_utils_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/generate_vbe_metadata.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/generate_vbe_metadata.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_utils/generate_vbe_metadata.cu -o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/generate_vbe_metadata.cu.o 2025-05-07T19:51:39.8682046Z [112/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate_byte.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cu.o 2025-05-07T19:51:39.8702840Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:39.8705370Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:39.8706521Z ^ 2025-05-07T19:51:39.8706789Z 2025-05-07T19:51:39.8707183Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:39.8707768Z 2025-05-07T19:51:39.8709336Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:39.8711848Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:39.8712964Z ^ 2025-05-07T19:51:39.8713273Z 2025-05-07T19:51:39.8714800Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:39.8717261Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:39.8718387Z ^ 2025-05-07T19:51:39.8718627Z 2025-05-07T19:51:39.8719051Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:39.8719672Z 2025-05-07T19:51:39.8721185Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:39.8723702Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:39.8724823Z ^ 2025-05-07T19:51:39.8725191Z 2025-05-07T19:51:39.8726673Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:39.8729082Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:39.8730187Z ^ 2025-05-07T19:51:39.8730451Z 2025-05-07T19:51:39.8730881Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:39.8731518Z 2025-05-07T19:51:39.8733119Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:39.8735593Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:39.8737130Z ^ 2025-05-07T19:51:39.8737485Z 2025-05-07T19:51:39.8739024Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:39.8741425Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:39.8742483Z ^ 2025-05-07T19:51:39.8742938Z 2025-05-07T19:51:39.8743375Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:39.8744025Z 2025-05-07T19:51:39.8745518Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:39.8747946Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:39.8749052Z ^ 2025-05-07T19:51:39.8749402Z 2025-05-07T19:51:39.8750963Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:39.8753592Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:39.8754711Z ^ 2025-05-07T19:51:39.8755007Z 2025-05-07T19:51:39.8755453Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:39.8756105Z 2025-05-07T19:51:39.8757729Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:39.8760097Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:39.8761189Z ^ 2025-05-07T19:51:39.8761513Z 2025-05-07T19:51:41.2211249Z [113/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_utils_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/get_infos_metadata.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/get_infos_metadata.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_utils/get_infos_metadata.cu -o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/get_infos_metadata.cu.o 2025-05-07T19:51:41.2232415Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:41.2235175Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:41.2236348Z ^ 2025-05-07T19:51:41.2236630Z 2025-05-07T19:51:41.2237057Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:41.2237736Z 2025-05-07T19:51:41.2239347Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:41.2241964Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:41.2243157Z ^ 2025-05-07T19:51:41.2243521Z 2025-05-07T19:51:41.2245158Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:41.2247764Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:41.2248979Z ^ 2025-05-07T19:51:41.2249239Z 2025-05-07T19:51:41.2249711Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:41.2250344Z 2025-05-07T19:51:41.2251969Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:41.2254622Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:41.2255765Z ^ 2025-05-07T19:51:41.2256175Z 2025-05-07T19:51:41.2257918Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:41.2260518Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:41.2261620Z ^ 2025-05-07T19:51:41.2261885Z 2025-05-07T19:51:41.2262330Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:41.2262942Z 2025-05-07T19:51:41.2264554Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:41.2267133Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:41.2268561Z ^ 2025-05-07T19:51:41.2268918Z 2025-05-07T19:51:41.2270907Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:41.2273753Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:41.2274931Z ^ 2025-05-07T19:51:41.2275186Z 2025-05-07T19:51:41.2275635Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:41.2276332Z 2025-05-07T19:51:41.2277962Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:41.2280671Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:41.2281835Z ^ 2025-05-07T19:51:41.2282233Z 2025-05-07T19:51:41.2283853Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:41.2286534Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:41.2287706Z ^ 2025-05-07T19:51:41.2287957Z 2025-05-07T19:51:41.2288432Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:41.2289105Z 2025-05-07T19:51:41.2290736Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:41.2293389Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:41.2294575Z ^ 2025-05-07T19:51:41.2294943Z 2025-05-07T19:51:49.3393327Z [114/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_common_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v1.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v1.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/utils/embedding_bounds_check_v1.cu -o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v1.cu.o 2025-05-07T19:51:49.3414341Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:49.3417061Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:49.3418110Z ^ 2025-05-07T19:51:49.3418341Z 2025-05-07T19:51:49.3418737Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:49.3419365Z 2025-05-07T19:51:49.3420866Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:49.3423399Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:49.3424507Z ^ 2025-05-07T19:51:49.3424885Z 2025-05-07T19:51:49.3426394Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:49.3428800Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:49.3429889Z ^ 2025-05-07T19:51:49.3430147Z 2025-05-07T19:51:49.3430583Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:49.3431203Z 2025-05-07T19:51:49.3432777Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:49.3435315Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:49.3436398Z ^ 2025-05-07T19:51:49.3436734Z 2025-05-07T19:51:49.3438257Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:49.3440743Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:49.3441844Z ^ 2025-05-07T19:51:49.3442082Z 2025-05-07T19:51:49.3442484Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:49.3443092Z 2025-05-07T19:51:49.3444573Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:49.3447458Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:49.3448526Z ^ 2025-05-07T19:51:49.3448889Z 2025-05-07T19:51:49.3450596Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:49.3453006Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:49.3454061Z ^ 2025-05-07T19:51:49.3454323Z 2025-05-07T19:51:49.3454749Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:49.3455377Z 2025-05-07T19:51:49.3457106Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:49.3459606Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:49.3460683Z ^ 2025-05-07T19:51:49.3461025Z 2025-05-07T19:51:49.3462516Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:49.3465034Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:49.3466123Z ^ 2025-05-07T19:51:49.3466364Z 2025-05-07T19:51:49.3466794Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:49.3467405Z 2025-05-07T19:51:49.3468904Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:49.3471645Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:49.3472779Z ^ 2025-05-07T19:51:49.3473145Z 2025-05-07T19:51:50.7515495Z [115/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_common_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v2.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v2.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/utils/embedding_bounds_check_v2.cu -o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v2.cu.o 2025-05-07T19:51:50.7536065Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:50.7538808Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:50.7539892Z ^ 2025-05-07T19:51:50.7540159Z 2025-05-07T19:51:50.7540611Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:50.7541239Z 2025-05-07T19:51:50.7542802Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:50.7545255Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:50.7546364Z ^ 2025-05-07T19:51:50.7546694Z 2025-05-07T19:51:50.7548148Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:50.7550461Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:50.7551457Z ^ 2025-05-07T19:51:50.7551655Z 2025-05-07T19:51:50.7552054Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:50.7552655Z 2025-05-07T19:51:50.7554268Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:50.7556901Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:50.7558032Z ^ 2025-05-07T19:51:50.7558392Z 2025-05-07T19:51:50.7559985Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:50.7562557Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:50.7563666Z ^ 2025-05-07T19:51:50.7563919Z 2025-05-07T19:51:50.7564347Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:50.7564982Z 2025-05-07T19:51:50.7566887Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:50.7569172Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:50.7570496Z ^ 2025-05-07T19:51:50.7570841Z 2025-05-07T19:51:50.7572755Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:50.7575054Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:50.7576182Z ^ 2025-05-07T19:51:50.7576537Z 2025-05-07T19:51:50.7576941Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:50.7577523Z 2025-05-07T19:51:50.7578830Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:50.7581311Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:50.7582432Z ^ 2025-05-07T19:51:50.7582778Z 2025-05-07T19:51:50.7584344Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:50.7586886Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:50.7587984Z ^ 2025-05-07T19:51:50.7588225Z 2025-05-07T19:51:50.7588668Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:50.7589299Z 2025-05-07T19:51:50.7590888Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:50.7593392Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:50.7594472Z ^ 2025-05-07T19:51:50.7594784Z 2025-05-07T19:51:55.2196366Z [116/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_optimizers_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu.o 2025-05-07T19:51:55.2216204Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:55.2218774Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:55.2219782Z ^ 2025-05-07T19:51:55.2220015Z 2025-05-07T19:51:55.2220405Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:55.2220994Z 2025-05-07T19:51:55.2222365Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:55.2224627Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:55.2225667Z ^ 2025-05-07T19:51:55.2225999Z 2025-05-07T19:51:55.2227445Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:55.2229685Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:55.2230695Z ^ 2025-05-07T19:51:55.2230916Z 2025-05-07T19:51:55.2231304Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:55.2231954Z 2025-05-07T19:51:55.2233362Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:55.2235531Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:55.2236530Z ^ 2025-05-07T19:51:55.2236857Z 2025-05-07T19:51:55.2238284Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:55.2240557Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:55.2241854Z ^ 2025-05-07T19:51:55.2242064Z 2025-05-07T19:51:55.2242450Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:55.2243015Z 2025-05-07T19:51:55.2244375Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:55.2246833Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:55.2247893Z ^ 2025-05-07T19:51:55.2248208Z 2025-05-07T19:51:55.2249623Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:55.2251889Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:55.2252852Z ^ 2025-05-07T19:51:55.2253079Z 2025-05-07T19:51:55.2253500Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:55.2254031Z 2025-05-07T19:51:55.2255463Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:55.2257713Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:55.2258720Z ^ 2025-05-07T19:51:55.2259022Z 2025-05-07T19:51:55.2260353Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:55.2262585Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:55.2263498Z ^ 2025-05-07T19:51:55.2263693Z 2025-05-07T19:51:55.2264051Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:55.2264661Z 2025-05-07T19:51:55.2266034Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:55.2268368Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:55.2269401Z ^ 2025-05-07T19:51:55.2269733Z 2025-05-07T19:51:55.6218718Z [117/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDM.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDM.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDM.cc.o -c /__w/FBGEMM/FBGEMM/src/EmbeddingSpMDM.cc 2025-05-07T19:51:55.6234574Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:51:56.2656771Z [118/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm.so -o fbgemm.so CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDM.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAutovec.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMNBit.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtils.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RefImplementations.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RowWiseSparseAdagradFused.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/SparseAdagrad.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/Utils.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx2.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtilsAvx2.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx512.cc.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so asmjit.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so && : 2025-05-07T19:51:56.9053502Z [119/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_common.so -o fbgemm_gpu_tbe_common.so CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/forward/embedding_forward_split_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/pt2/pt2_autograd_utils.cpp.o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host.cpp.o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v1.cu.o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v2.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm.so fbgemm_gpu_config.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -lcudadevrt -lcudart_static -ldl -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && : 2025-05-07T19:51:58.1085000Z [120/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_find.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_find.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_find.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_find.cu.o 2025-05-07T19:51:58.1102122Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:58.1104253Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:58.1105177Z ^ 2025-05-07T19:51:58.1105388Z 2025-05-07T19:51:58.1105739Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:58.1106246Z 2025-05-07T19:51:58.1107563Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:58.1109620Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:58.1110533Z ^ 2025-05-07T19:51:58.1110830Z 2025-05-07T19:51:58.1112078Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:58.1114055Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:58.1114927Z ^ 2025-05-07T19:51:58.1115136Z 2025-05-07T19:51:58.1115471Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:58.1115990Z 2025-05-07T19:51:58.1117279Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:58.1119363Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:58.1120295Z ^ 2025-05-07T19:51:58.1120589Z 2025-05-07T19:51:58.1121858Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:58.1123888Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:58.1124783Z ^ 2025-05-07T19:51:58.1124984Z 2025-05-07T19:51:58.1125334Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:58.1125876Z 2025-05-07T19:51:58.1127154Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:58.1129215Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:58.1130367Z ^ 2025-05-07T19:51:58.1130660Z 2025-05-07T19:51:58.1131960Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:58.1134015Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:58.1134950Z ^ 2025-05-07T19:51:58.1135143Z 2025-05-07T19:51:58.1135673Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:58.1136192Z 2025-05-07T19:51:58.1137567Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:58.1139632Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:58.1140598Z ^ 2025-05-07T19:51:58.1140883Z 2025-05-07T19:51:58.1142114Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:58.1144160Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:58.1145098Z ^ 2025-05-07T19:51:58.1145308Z 2025-05-07T19:51:58.1145689Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:58.1146205Z 2025-05-07T19:51:58.1147476Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:58.1149487Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:58.1150388Z ^ 2025-05-07T19:51:58.1150670Z 2025-05-07T19:52:01.5941952Z [121/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu.o 2025-05-07T19:52:01.5965755Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:01.5968417Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:01.5969565Z ^ 2025-05-07T19:52:01.5969823Z 2025-05-07T19:52:01.5970530Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:01.5971224Z 2025-05-07T19:52:01.5972847Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:01.5975494Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:01.5976818Z ^ 2025-05-07T19:52:01.5977181Z 2025-05-07T19:52:01.5978805Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:01.5981423Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:01.5982647Z ^ 2025-05-07T19:52:01.5982897Z 2025-05-07T19:52:01.5983348Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:01.5983985Z 2025-05-07T19:52:01.5985658Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:01.5988322Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:01.5989502Z ^ 2025-05-07T19:52:01.5989874Z 2025-05-07T19:52:01.5991502Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:01.5994153Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:01.5995316Z ^ 2025-05-07T19:52:01.5995567Z 2025-05-07T19:52:01.5996027Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:01.5996662Z 2025-05-07T19:52:01.5998345Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:01.6001293Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:01.6002483Z ^ 2025-05-07T19:52:01.6002817Z 2025-05-07T19:52:01.6004238Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:01.6006336Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:01.6007283Z ^ 2025-05-07T19:52:01.6007492Z 2025-05-07T19:52:01.6007855Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:01.6008428Z 2025-05-07T19:52:01.6009850Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:01.6012199Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:01.6013235Z ^ 2025-05-07T19:52:01.6013549Z 2025-05-07T19:52:01.6014938Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:01.6017410Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:01.6018527Z ^ 2025-05-07T19:52:01.6018768Z 2025-05-07T19:52:01.6019215Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:01.6019867Z 2025-05-07T19:52:01.6021521Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:01.6024213Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:01.6025396Z ^ 2025-05-07T19:52:01.6025750Z 2025-05-07T19:52:04.7002543Z [122/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu.o 2025-05-07T19:52:04.7024055Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:04.7026577Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:04.7027749Z ^ 2025-05-07T19:52:04.7027991Z 2025-05-07T19:52:04.7028388Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:04.7029001Z 2025-05-07T19:52:04.7030489Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:04.7033043Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:04.7034075Z ^ 2025-05-07T19:52:04.7034446Z 2025-05-07T19:52:04.7035956Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:04.7038433Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:04.7039488Z ^ 2025-05-07T19:52:04.7039709Z 2025-05-07T19:52:04.7040154Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:04.7040770Z 2025-05-07T19:52:04.7042310Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:04.7044764Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:04.7045850Z ^ 2025-05-07T19:52:04.7046211Z 2025-05-07T19:52:04.7047756Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:04.7050184Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:04.7051535Z ^ 2025-05-07T19:52:04.7051785Z 2025-05-07T19:52:04.7052175Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:04.7052786Z 2025-05-07T19:52:04.7054320Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:04.7057143Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:04.7058214Z ^ 2025-05-07T19:52:04.7058558Z 2025-05-07T19:52:04.7060076Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:04.7062491Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:04.7063577Z ^ 2025-05-07T19:52:04.7063803Z 2025-05-07T19:52:04.7064229Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:04.7064841Z 2025-05-07T19:52:04.7066435Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:04.7068906Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:04.7069971Z ^ 2025-05-07T19:52:04.7070581Z 2025-05-07T19:52:04.7072019Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:04.7074469Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:04.7075534Z ^ 2025-05-07T19:52:04.7075770Z 2025-05-07T19:52:04.7076183Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:04.7076777Z 2025-05-07T19:52:04.7078325Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:04.7080771Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:04.7081866Z ^ 2025-05-07T19:52:04.7082166Z 2025-05-07T19:52:05.4436277Z [123/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_sparse_async_cumsum_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cu.o -MF CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_async_cumsum.cu -o CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cu.o 2025-05-07T19:52:05.4455836Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:05.4458331Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:05.4459326Z ^ 2025-05-07T19:52:05.4459534Z 2025-05-07T19:52:05.4459967Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:05.4460559Z 2025-05-07T19:52:05.4462086Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:05.4464661Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:05.4465812Z ^ 2025-05-07T19:52:05.4466161Z 2025-05-07T19:52:05.4467722Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:05.4469957Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:05.4471337Z ^ 2025-05-07T19:52:05.4471577Z 2025-05-07T19:52:05.4471995Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:05.4472632Z 2025-05-07T19:52:05.4474275Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:05.4476653Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:05.4477665Z ^ 2025-05-07T19:52:05.4477982Z 2025-05-07T19:52:05.4479355Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:05.4481705Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:05.4483234Z ^ 2025-05-07T19:52:05.4483483Z 2025-05-07T19:52:05.4483930Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:05.4484594Z 2025-05-07T19:52:05.4485990Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:05.4488297Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:05.4500380Z ^ 2025-05-07T19:52:05.4500831Z 2025-05-07T19:52:05.4502347Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:05.4504557Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:05.4505591Z ^ 2025-05-07T19:52:05.4505856Z 2025-05-07T19:52:05.4506300Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:05.4506937Z 2025-05-07T19:52:05.4508380Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:05.4510721Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:05.4511572Z ^ 2025-05-07T19:52:05.4511823Z 2025-05-07T19:52:05.4513007Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:05.4515223Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:05.4516249Z ^ 2025-05-07T19:52:05.4516484Z 2025-05-07T19:52:05.4516888Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:05.4517456Z 2025-05-07T19:52:05.4518889Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:05.4521328Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:05.4522401Z ^ 2025-05-07T19:52:05.4522762Z 2025-05-07T19:52:05.6096655Z [124/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_find.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_find.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_find.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_find.cu.o 2025-05-07T19:52:05.6107859Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:05.6109215Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:05.6109815Z ^ 2025-05-07T19:52:05.6109974Z 2025-05-07T19:52:05.6110209Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:05.6110555Z 2025-05-07T19:52:05.6111399Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:05.6112732Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:05.6113357Z ^ 2025-05-07T19:52:05.6113551Z 2025-05-07T19:52:05.6114383Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:05.6115699Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:05.6116302Z ^ 2025-05-07T19:52:05.6116439Z 2025-05-07T19:52:05.6116672Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:05.6117032Z 2025-05-07T19:52:05.6117857Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:05.6119204Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:05.6119802Z ^ 2025-05-07T19:52:05.6120005Z 2025-05-07T19:52:05.6120827Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:05.6122252Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:05.6122838Z ^ 2025-05-07T19:52:05.6122990Z 2025-05-07T19:52:05.6123220Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:05.6123558Z 2025-05-07T19:52:05.6124449Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:05.6125795Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:05.6126411Z ^ 2025-05-07T19:52:05.6126603Z 2025-05-07T19:52:05.6127426Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:05.6128760Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:05.6129363Z ^ 2025-05-07T19:52:05.6129499Z 2025-05-07T19:52:05.6129729Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:05.6130088Z 2025-05-07T19:52:05.6130912Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:05.6132260Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:05.6132864Z ^ 2025-05-07T19:52:05.6133057Z 2025-05-07T19:52:05.6133894Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:05.6135213Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:05.6135810Z ^ 2025-05-07T19:52:05.6135949Z 2025-05-07T19:52:05.6136193Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:05.6136665Z 2025-05-07T19:52:05.6137490Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:05.6138838Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:05.6139454Z ^ 2025-05-07T19:52:05.6139642Z 2025-05-07T19:52:11.3565219Z [125/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu.o 2025-05-07T19:52:11.3588427Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:11.3591025Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:11.3592165Z ^ 2025-05-07T19:52:11.3592408Z 2025-05-07T19:52:11.3592857Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:11.3593505Z 2025-05-07T19:52:11.3595130Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:11.3597733Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:11.3598881Z ^ 2025-05-07T19:52:11.3599240Z 2025-05-07T19:52:11.3600840Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:11.3603407Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:11.3604563Z ^ 2025-05-07T19:52:11.3604811Z 2025-05-07T19:52:11.3605261Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:11.3605923Z 2025-05-07T19:52:11.3607632Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:11.3610260Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:11.3611692Z ^ 2025-05-07T19:52:11.3612062Z 2025-05-07T19:52:11.3613634Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:11.3616195Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:11.3617560Z ^ 2025-05-07T19:52:11.3617801Z 2025-05-07T19:52:11.3618241Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:11.3618851Z 2025-05-07T19:52:11.3620450Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:11.3623102Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:11.3624268Z ^ 2025-05-07T19:52:11.3624626Z 2025-05-07T19:52:11.3626232Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:11.3628852Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:11.3629986Z ^ 2025-05-07T19:52:11.3630225Z 2025-05-07T19:52:11.3630654Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:11.3631293Z 2025-05-07T19:52:11.3632890Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:11.3635562Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:11.3636747Z ^ 2025-05-07T19:52:11.3637115Z 2025-05-07T19:52:11.3638767Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:11.3641384Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:11.3642570Z ^ 2025-05-07T19:52:11.3642827Z 2025-05-07T19:52:11.3643303Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:11.3643977Z 2025-05-07T19:52:11.3645643Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:11.3648337Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:11.3649551Z ^ 2025-05-07T19:52:11.3649927Z 2025-05-07T19:52:35.7588463Z [126/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/linearize_cache_indices.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cu.o 2025-05-07T19:52:35.7610468Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:35.7613095Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:35.7614161Z ^ 2025-05-07T19:52:35.7614418Z 2025-05-07T19:52:35.7614869Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:35.7615493Z 2025-05-07T19:52:35.7617245Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:35.7619765Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:35.7620916Z ^ 2025-05-07T19:52:35.7621253Z 2025-05-07T19:52:35.7622803Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:35.7625293Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:35.7626398Z ^ 2025-05-07T19:52:35.7626639Z 2025-05-07T19:52:35.7627063Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:35.7627709Z 2025-05-07T19:52:35.7629283Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:35.7632144Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:35.7633265Z ^ 2025-05-07T19:52:35.7633628Z 2025-05-07T19:52:35.7635330Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:35.7637857Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:35.7638955Z ^ 2025-05-07T19:52:35.7639191Z 2025-05-07T19:52:35.7639618Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:35.7640252Z 2025-05-07T19:52:35.7641727Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:35.7644062Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:35.7645157Z ^ 2025-05-07T19:52:35.7645517Z 2025-05-07T19:52:35.7646900Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:35.7649157Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:35.7650268Z ^ 2025-05-07T19:52:35.7650514Z 2025-05-07T19:52:35.7650926Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:35.7651584Z 2025-05-07T19:52:35.7653151Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:35.7655702Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:35.7657129Z ^ 2025-05-07T19:52:35.7657487Z 2025-05-07T19:52:35.7659102Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:35.7661715Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:35.7662852Z ^ 2025-05-07T19:52:35.7663105Z 2025-05-07T19:52:35.7663566Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:35.7664235Z 2025-05-07T19:52:35.7665881Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:35.7668345Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:35.7669431Z ^ 2025-05-07T19:52:35.7669784Z 2025-05-07T19:52:36.4026904Z [127/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_cache.so -o fbgemm_gpu_tbe_cache.so CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cpp.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cpp.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cpp.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cpp.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cpp.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_find.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_find.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/reset_weight_momentum.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,/lib/intel64:/lib/intel64_win:/lib/win-x64:/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib:/github/home/miniconda/envs/build_binary/lib/stubs: /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so" -Wl,--as-needed -lcudadevrt -lcudart_static -ldl && : 2025-05-07T19:52:37.2233863Z [128/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_optimizers_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu.o 2025-05-07T19:52:37.2256282Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:37.2258767Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:37.2259815Z ^ 2025-05-07T19:52:37.2260059Z 2025-05-07T19:52:37.2260501Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:37.2261109Z 2025-05-07T19:52:37.2262608Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:37.2265194Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:37.2266366Z ^ 2025-05-07T19:52:37.2266726Z 2025-05-07T19:52:37.2268348Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:37.2271021Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:37.2272051Z ^ 2025-05-07T19:52:37.2272290Z 2025-05-07T19:52:37.2272661Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:37.2273203Z 2025-05-07T19:52:37.2274800Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:37.2277264Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:37.2278408Z ^ 2025-05-07T19:52:37.2278773Z 2025-05-07T19:52:37.2280901Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:37.2283306Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:37.2284356Z ^ 2025-05-07T19:52:37.2284568Z 2025-05-07T19:52:37.2284962Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:37.2285860Z 2025-05-07T19:52:37.2287389Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:37.2289954Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:37.2291043Z ^ 2025-05-07T19:52:37.2291408Z 2025-05-07T19:52:37.2293219Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:37.2295762Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:37.2297026Z ^ 2025-05-07T19:52:37.2297295Z 2025-05-07T19:52:37.2297693Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:37.2298326Z 2025-05-07T19:52:37.2299900Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:37.2302163Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:37.2303179Z ^ 2025-05-07T19:52:37.2303503Z 2025-05-07T19:52:37.2304906Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:37.2307263Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:37.2308269Z ^ 2025-05-07T19:52:37.2308493Z 2025-05-07T19:52:37.2308922Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:37.2309580Z 2025-05-07T19:52:37.2311116Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:37.2313687Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:37.2314685Z ^ 2025-05-07T19:52:37.2315031Z 2025-05-07T19:52:37.7865093Z [129/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_optimizers.so -o fbgemm_gpu_tbe_optimizers.so CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split.cpp.o CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,/lib/intel64:/lib/intel64_win:/lib/win-x64:/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib:/github/home/miniconda/envs/build_binary/lib/stubs: /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so" -Wl,--as-needed -lcudadevrt -lcudart_static -ldl && : 2025-05-07T19:52:38.7291480Z [130/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_dense_unweighted_nobag_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel.cu.o 2025-05-07T19:52:38.7311158Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:38.7313899Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:38.7314952Z ^ 2025-05-07T19:52:38.7315182Z 2025-05-07T19:52:38.7315594Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:38.7316181Z 2025-05-07T19:52:38.7317691Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:38.7320112Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:38.7321129Z ^ 2025-05-07T19:52:38.7321461Z 2025-05-07T19:52:38.7322933Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:38.7325278Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:38.7326316Z ^ 2025-05-07T19:52:38.7326548Z 2025-05-07T19:52:38.7326949Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:38.7327574Z 2025-05-07T19:52:38.7329059Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:38.7331464Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:38.7332504Z ^ 2025-05-07T19:52:38.7332850Z 2025-05-07T19:52:38.7334318Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:38.7336798Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:38.7337832Z ^ 2025-05-07T19:52:38.7338054Z 2025-05-07T19:52:38.7338460Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:38.7339060Z 2025-05-07T19:52:38.7340538Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:38.7342907Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:38.7343964Z ^ 2025-05-07T19:52:38.7344294Z 2025-05-07T19:52:38.7345780Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:38.7348428Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:38.7349470Z ^ 2025-05-07T19:52:38.7349713Z 2025-05-07T19:52:38.7350112Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:38.7350678Z 2025-05-07T19:52:38.7352335Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:38.7354749Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:38.7355819Z ^ 2025-05-07T19:52:38.7356138Z 2025-05-07T19:52:38.7357649Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:38.7360005Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:38.7361066Z ^ 2025-05-07T19:52:38.7361290Z 2025-05-07T19:52:38.7361692Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:38.7362301Z 2025-05-07T19:52:38.7363771Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:38.7366238Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:38.7367283Z ^ 2025-05-07T19:52:38.7367626Z 2025-05-07T19:52:48.2043777Z [131/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_utils_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/transpose_embedding_input.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/transpose_embedding_input.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_utils/transpose_embedding_input.cu -o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/transpose_embedding_input.cu.o 2025-05-07T19:52:48.2060541Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:48.2062557Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:48.2063403Z ^ 2025-05-07T19:52:48.2063661Z 2025-05-07T19:52:48.2064018Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:48.2064534Z 2025-05-07T19:52:48.2065710Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:48.2067678Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:48.2068551Z ^ 2025-05-07T19:52:48.2068831Z 2025-05-07T19:52:48.2070036Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:48.2072177Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:48.2073050Z ^ 2025-05-07T19:52:48.2073259Z 2025-05-07T19:52:48.2073593Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:48.2074074Z 2025-05-07T19:52:48.2075259Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:48.2077162Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:48.2078011Z ^ 2025-05-07T19:52:48.2078286Z 2025-05-07T19:52:48.2079502Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:48.2081465Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:48.2082356Z ^ 2025-05-07T19:52:48.2082562Z 2025-05-07T19:52:48.2082933Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:48.2083384Z 2025-05-07T19:52:48.2084597Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:48.2086553Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:48.2087456Z ^ 2025-05-07T19:52:48.2087744Z 2025-05-07T19:52:48.2088932Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:48.2091231Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:48.2092086Z ^ 2025-05-07T19:52:48.2092284Z 2025-05-07T19:52:48.2092623Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:48.2093131Z 2025-05-07T19:52:48.2094561Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:48.2096612Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:48.2097497Z ^ 2025-05-07T19:52:48.2097772Z 2025-05-07T19:52:48.2098967Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:48.2100827Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:48.2101694Z ^ 2025-05-07T19:52:48.2101870Z 2025-05-07T19:52:48.2102183Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:48.2102709Z 2025-05-07T19:52:48.2103929Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:48.2105889Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:48.2106773Z ^ 2025-05-07T19:52:48.2107051Z 2025-05-07T19:52:52.2863774Z [132/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_nobag_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel.cu.o 2025-05-07T19:52:52.2888814Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:52.2891594Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:52.2892933Z ^ 2025-05-07T19:52:52.2893231Z 2025-05-07T19:52:52.2893632Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:52.2894353Z 2025-05-07T19:52:52.2896140Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:52.2899226Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:52.2900427Z ^ 2025-05-07T19:52:52.2900799Z 2025-05-07T19:52:52.2902549Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:52.2905423Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:52.2906687Z ^ 2025-05-07T19:52:52.2906959Z 2025-05-07T19:52:52.2907433Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:52.2908163Z 2025-05-07T19:52:52.2910028Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:52.2912789Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:52.2914064Z ^ 2025-05-07T19:52:52.2914467Z 2025-05-07T19:52:52.2916234Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:52.2919095Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:52.2920264Z ^ 2025-05-07T19:52:52.2920504Z 2025-05-07T19:52:52.2920986Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:52.2921731Z 2025-05-07T19:52:52.2923505Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:52.2926335Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:52.2927998Z ^ 2025-05-07T19:52:52.2928391Z 2025-05-07T19:52:52.2930155Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:52.2933032Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:52.2934311Z ^ 2025-05-07T19:52:52.2934825Z 2025-05-07T19:52:52.2935306Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:52.2936043Z 2025-05-07T19:52:52.2937953Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:52.2940849Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:52.2942120Z ^ 2025-05-07T19:52:52.2942513Z 2025-05-07T19:52:52.2944274Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:52.2947139Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:52.2948434Z ^ 2025-05-07T19:52:52.2948720Z 2025-05-07T19:52:52.2949229Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:52.2949957Z 2025-05-07T19:52:52.2951753Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:52.2954690Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:52.2956028Z ^ 2025-05-07T19:52:52.2956426Z 2025-05-07T19:52:55.7534381Z [133/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_dense_unweighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_cuda.cu.o 2025-05-07T19:52:55.7556487Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:55.7559139Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:55.7560223Z ^ 2025-05-07T19:52:55.7560465Z 2025-05-07T19:52:55.7560904Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:55.7561563Z 2025-05-07T19:52:55.7563160Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:55.7565696Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:55.7566858Z ^ 2025-05-07T19:52:55.7567221Z 2025-05-07T19:52:55.7568783Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:55.7571520Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:55.7572624Z ^ 2025-05-07T19:52:55.7572909Z 2025-05-07T19:52:55.7573350Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:55.7573989Z 2025-05-07T19:52:55.7575525Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:55.7578185Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:55.7579340Z ^ 2025-05-07T19:52:55.7579712Z 2025-05-07T19:52:55.7581210Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:55.7583764Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:55.7584897Z ^ 2025-05-07T19:52:55.7585143Z 2025-05-07T19:52:55.7585577Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:55.7586181Z 2025-05-07T19:52:55.7587762Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:55.7590705Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:55.7591772Z ^ 2025-05-07T19:52:55.7592127Z 2025-05-07T19:52:55.7593870Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:55.7596346Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:55.7597474Z ^ 2025-05-07T19:52:55.7597731Z 2025-05-07T19:52:55.7598188Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:55.7598814Z 2025-05-07T19:52:55.7600398Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:55.7602905Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:55.7603973Z ^ 2025-05-07T19:52:55.7604250Z 2025-05-07T19:52:55.7605543Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:55.7608485Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:55.7609607Z ^ 2025-05-07T19:52:55.7609820Z 2025-05-07T19:52:55.7610153Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:55.7610664Z 2025-05-07T19:52:55.7611985Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:55.7614206Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:55.7615289Z ^ 2025-05-07T19:52:55.7615635Z 2025-05-07T19:52:56.5993552Z [134/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_dense_weighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_cuda.cu.o 2025-05-07T19:52:56.6015237Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:56.6017921Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:56.6018965Z ^ 2025-05-07T19:52:56.6019181Z 2025-05-07T19:52:56.6019581Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:56.6020125Z 2025-05-07T19:52:56.6021664Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:56.6024249Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:56.6025439Z ^ 2025-05-07T19:52:56.6025799Z 2025-05-07T19:52:56.6027368Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:56.6029888Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:56.6031037Z ^ 2025-05-07T19:52:56.6031299Z 2025-05-07T19:52:56.6031778Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:56.6032502Z 2025-05-07T19:52:56.6034145Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:56.6036790Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:56.6037814Z ^ 2025-05-07T19:52:56.6038137Z 2025-05-07T19:52:56.6039402Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:56.6041509Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:56.6042454Z ^ 2025-05-07T19:52:56.6043086Z 2025-05-07T19:52:56.6043483Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:56.6044045Z 2025-05-07T19:52:56.6045488Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:56.6047911Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:56.6049338Z ^ 2025-05-07T19:52:56.6049686Z 2025-05-07T19:52:56.6051113Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:56.6053411Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:56.6054379Z ^ 2025-05-07T19:52:56.6054582Z 2025-05-07T19:52:56.6054964Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:56.6055615Z 2025-05-07T19:52:56.6057327Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:56.6059803Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:56.6060807Z ^ 2025-05-07T19:52:56.6061119Z 2025-05-07T19:52:56.6062582Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:56.6064886Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:56.6065992Z ^ 2025-05-07T19:52:56.6066250Z 2025-05-07T19:52:56.6066710Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:56.6067366Z 2025-05-07T19:52:56.6069003Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:56.6071511Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:56.6072540Z ^ 2025-05-07T19:52:56.6072898Z 2025-05-07T19:52:58.6984058Z [135/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_ssd_unweighted_nobag_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel.cu.o 2025-05-07T19:52:58.7006438Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:58.7009127Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:58.7010318Z ^ 2025-05-07T19:52:58.7010617Z 2025-05-07T19:52:58.7011111Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:58.7011798Z 2025-05-07T19:52:58.7013306Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:58.7015614Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:58.7016875Z ^ 2025-05-07T19:52:58.7017221Z 2025-05-07T19:52:58.7018646Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:58.7020840Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:58.7021912Z ^ 2025-05-07T19:52:58.7022186Z 2025-05-07T19:52:58.7022600Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:58.7023197Z 2025-05-07T19:52:58.7024658Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:58.7027059Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:58.7028157Z ^ 2025-05-07T19:52:58.7028496Z 2025-05-07T19:52:58.7029983Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:58.7032728Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:58.7033788Z ^ 2025-05-07T19:52:58.7034030Z 2025-05-07T19:52:58.7034441Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:58.7035080Z 2025-05-07T19:52:58.7036748Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:58.7039218Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:58.7040286Z ^ 2025-05-07T19:52:58.7040659Z 2025-05-07T19:52:58.7042108Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:58.7044660Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:58.7045733Z ^ 2025-05-07T19:52:58.7045948Z 2025-05-07T19:52:58.7046326Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:58.7046915Z 2025-05-07T19:52:58.7048450Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:58.7050975Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:58.7052102Z ^ 2025-05-07T19:52:58.7052434Z 2025-05-07T19:52:58.7053808Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:58.7056200Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:58.7057426Z ^ 2025-05-07T19:52:58.7057689Z 2025-05-07T19:52:58.7058101Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:58.7058715Z 2025-05-07T19:52:58.7060180Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:58.7062460Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:58.7063415Z ^ 2025-05-07T19:52:58.7063761Z 2025-05-07T19:53:04.6128697Z [136/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_cuda.cu.o 2025-05-07T19:53:04.6150775Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:04.6153220Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:04.6154322Z ^ 2025-05-07T19:53:04.6154524Z 2025-05-07T19:53:04.6154918Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:04.6155514Z 2025-05-07T19:53:04.6157094Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:04.6159650Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:04.6160752Z ^ 2025-05-07T19:53:04.6161128Z 2025-05-07T19:53:04.6162625Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:04.6165002Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:04.6166089Z ^ 2025-05-07T19:53:04.6166359Z 2025-05-07T19:53:04.6166799Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:04.6167470Z 2025-05-07T19:53:04.6169199Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:04.6172282Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:04.6173850Z ^ 2025-05-07T19:53:04.6174175Z 2025-05-07T19:53:04.6175805Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:04.6178527Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:04.6179646Z ^ 2025-05-07T19:53:04.6179896Z 2025-05-07T19:53:04.6180602Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:04.6181231Z 2025-05-07T19:53:04.6182747Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:04.6185232Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:04.6186330Z ^ 2025-05-07T19:53:04.6186708Z 2025-05-07T19:53:04.6188301Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:04.6190970Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:04.6192106Z ^ 2025-05-07T19:53:04.6192360Z 2025-05-07T19:53:04.6192794Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:04.6193445Z 2025-05-07T19:53:04.6195121Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:04.6197763Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:04.6198908Z ^ 2025-05-07T19:53:04.6199268Z 2025-05-07T19:53:04.6200858Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:04.6203436Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:04.6204566Z ^ 2025-05-07T19:53:04.6204811Z 2025-05-07T19:53:04.6205220Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:04.6205896Z 2025-05-07T19:53:04.6207501Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:04.6210068Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:04.6211237Z ^ 2025-05-07T19:53:04.6211602Z 2025-05-07T19:53:08.9917462Z [137/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu.o 2025-05-07T19:53:08.9942750Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:08.9945339Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:08.9946499Z ^ 2025-05-07T19:53:08.9946740Z 2025-05-07T19:53:08.9947166Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:08.9947778Z 2025-05-07T19:53:08.9949368Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:08.9951917Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:08.9952964Z ^ 2025-05-07T19:53:08.9953286Z 2025-05-07T19:53:08.9954720Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:08.9956518Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:08.9957014Z ^ 2025-05-07T19:53:08.9957296Z 2025-05-07T19:53:08.9958727Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:08.9960956Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:08.9961520Z ^ 2025-05-07T19:53:08.9961834Z 2025-05-07T19:53:08.9963444Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:08.9965283Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:08.9965842Z ^ 2025-05-07T19:53:08.9966143Z 2025-05-07T19:53:08.9967983Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:08.9970744Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:08.9971824Z ^ 2025-05-07T19:53:08.9972072Z 2025-05-07T19:53:08.9972505Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:08.9973109Z 2025-05-07T19:53:08.9974513Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:08.9977289Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:08.9978472Z ^ 2025-05-07T19:53:08.9978831Z 2025-05-07T19:53:08.9980420Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:08.9982428Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:08.9982979Z ^ 2025-05-07T19:53:08.9983293Z 2025-05-07T19:53:08.9984876Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:08.9986874Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:08.9987415Z ^ 2025-05-07T19:53:08.9987722Z 2025-05-07T19:53:08.9989212Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:08.9991214Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:08.9991725Z ^ 2025-05-07T19:53:08.9991992Z 2025-05-07T19:53:08.9993513Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:08.9996130Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:08.9997205Z ^ 2025-05-07T19:53:08.9997416Z 2025-05-07T19:53:08.9997787Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:08.9998417Z 2025-05-07T19:53:09.0000004Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:09.0002415Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:09.0004027Z ^ 2025-05-07T19:53:09.0004321Z 2025-05-07T19:53:09.0005786Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:09.0007517Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:09.0008080Z ^ 2025-05-07T19:53:09.0008360Z 2025-05-07T19:53:09.0010142Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:09.0012159Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:09.0012687Z ^ 2025-05-07T19:53:09.0012960Z 2025-05-07T19:53:09.0014457Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:09.0016471Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:09.0017049Z ^ 2025-05-07T19:53:09.0017305Z 2025-05-07T19:53:09.0018905Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:09.0021433Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:09.0022524Z ^ 2025-05-07T19:53:09.0022759Z 2025-05-07T19:53:09.0023171Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:09.0023807Z 2025-05-07T19:53:09.0025381Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:09.0027911Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:09.0029017Z ^ 2025-05-07T19:53:09.0029375Z 2025-05-07T19:53:09.0030865Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:09.0032750Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:09.0033278Z ^ 2025-05-07T19:53:09.0033574Z 2025-05-07T19:53:09.0035054Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:09.0036979Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:09.0037500Z ^ 2025-05-07T19:53:09.0037786Z 2025-05-07T19:53:09.0039290Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:09.0041061Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:09.0041597Z ^ 2025-05-07T19:53:09.0041877Z 2025-05-07T19:53:09.0043405Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:09.0046291Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:09.0047426Z ^ 2025-05-07T19:53:09.0047660Z 2025-05-07T19:53:09.0048112Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:09.0048734Z 2025-05-07T19:53:09.0050413Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:09.0052940Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:09.0054057Z ^ 2025-05-07T19:53:09.0054422Z 2025-05-07T19:53:09.0055896Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:09.0058090Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:09.0058527Z ^ 2025-05-07T19:53:09.0058798Z 2025-05-07T19:53:09.0060334Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:09.0062256Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:09.0062820Z ^ 2025-05-07T19:53:09.0063118Z 2025-05-07T19:53:09.0064692Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:09.0066697Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:09.0067260Z ^ 2025-05-07T19:53:09.0067547Z 2025-05-07T19:53:11.0341928Z [138/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_cuda.cu.o 2025-05-07T19:53:11.0366020Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:11.0368615Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:11.0369763Z ^ 2025-05-07T19:53:11.0370019Z 2025-05-07T19:53:11.0370752Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:11.0371398Z 2025-05-07T19:53:11.0372952Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:11.0375612Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:11.0376809Z ^ 2025-05-07T19:53:11.0377186Z 2025-05-07T19:53:11.0378758Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:11.0381503Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:11.0382667Z ^ 2025-05-07T19:53:11.0382924Z 2025-05-07T19:53:11.0383361Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:11.0384020Z 2025-05-07T19:53:11.0385716Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:11.0388368Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:11.0389495Z ^ 2025-05-07T19:53:11.0389861Z 2025-05-07T19:53:11.0391472Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:11.0394108Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:11.0395258Z ^ 2025-05-07T19:53:11.0395508Z 2025-05-07T19:53:11.0395954Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:11.0396616Z 2025-05-07T19:53:11.0398246Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:11.0400905Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:11.0402393Z ^ 2025-05-07T19:53:11.0402766Z 2025-05-07T19:53:11.0404382Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:11.0407013Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:11.0408426Z ^ 2025-05-07T19:53:11.0408701Z 2025-05-07T19:53:11.0409157Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:11.0409821Z 2025-05-07T19:53:11.0411517Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:11.0414263Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:11.0415420Z ^ 2025-05-07T19:53:11.0415781Z 2025-05-07T19:53:11.0417570Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:11.0420083Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:11.0421139Z ^ 2025-05-07T19:53:11.0421394Z 2025-05-07T19:53:11.0421838Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:11.0422529Z 2025-05-07T19:53:11.0424226Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:11.0426968Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:11.0428073Z ^ 2025-05-07T19:53:11.0428363Z 2025-05-07T19:53:16.9159661Z [139/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu.o 2025-05-07T19:53:16.9180956Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:16.9183499Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:16.9184620Z ^ 2025-05-07T19:53:16.9184836Z 2025-05-07T19:53:16.9185268Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:16.9185886Z 2025-05-07T19:53:16.9187413Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:16.9189715Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:16.9190627Z ^ 2025-05-07T19:53:16.9190907Z 2025-05-07T19:53:16.9192158Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:16.9194555Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:16.9195600Z ^ 2025-05-07T19:53:16.9195840Z 2025-05-07T19:53:16.9196230Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:16.9196781Z 2025-05-07T19:53:16.9198150Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:16.9200329Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:16.9201193Z ^ 2025-05-07T19:53:16.9201447Z 2025-05-07T19:53:16.9202583Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:16.9204674Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:16.9205593Z ^ 2025-05-07T19:53:16.9205803Z 2025-05-07T19:53:16.9206159Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:16.9206753Z 2025-05-07T19:53:16.9208213Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:16.9210782Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:16.9211782Z ^ 2025-05-07T19:53:16.9212058Z 2025-05-07T19:53:16.9213633Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:16.9215966Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:16.9217060Z ^ 2025-05-07T19:53:16.9217281Z 2025-05-07T19:53:16.9217651Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:16.9218191Z 2025-05-07T19:53:16.9219615Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:16.9222139Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:16.9223280Z ^ 2025-05-07T19:53:16.9223618Z 2025-05-07T19:53:16.9225133Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:16.9227307Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:16.9228409Z ^ 2025-05-07T19:53:16.9228641Z 2025-05-07T19:53:16.9229059Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:16.9229657Z 2025-05-07T19:53:16.9231107Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:16.9233464Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:16.9234489Z ^ 2025-05-07T19:53:16.9234820Z 2025-05-07T19:53:17.6862446Z [140/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu.o 2025-05-07T19:53:17.6885881Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:17.6888529Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:17.6889662Z ^ 2025-05-07T19:53:17.6889914Z 2025-05-07T19:53:17.6890351Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:17.6891009Z 2025-05-07T19:53:17.6892668Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:17.6895379Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:17.6896589Z ^ 2025-05-07T19:53:17.6896944Z 2025-05-07T19:53:17.6898436Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:17.6900492Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:53:17.6901185Z ^ 2025-05-07T19:53:17.6901459Z 2025-05-07T19:53:17.6902957Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:17.6904845Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:17.6905394Z ^ 2025-05-07T19:53:17.6905675Z 2025-05-07T19:53:17.6907162Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:17.6908925Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:17.6909435Z ^ 2025-05-07T19:53:17.6909701Z 2025-05-07T19:53:17.6911169Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:17.6913364Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:17.6913882Z ^ 2025-05-07T19:53:17.6914141Z 2025-05-07T19:53:17.6915708Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:17.6918220Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:17.6919536Z ^ 2025-05-07T19:53:17.6919776Z 2025-05-07T19:53:17.6920163Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:17.6920721Z 2025-05-07T19:53:17.6922156Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:17.6924698Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:17.6925860Z ^ 2025-05-07T19:53:17.6926209Z 2025-05-07T19:53:17.6927728Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:17.6929816Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:53:17.6930534Z ^ 2025-05-07T19:53:17.6930811Z 2025-05-07T19:53:17.6932322Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:17.6934242Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:17.6934779Z ^ 2025-05-07T19:53:17.6935064Z 2025-05-07T19:53:17.6936723Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:17.6938681Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:17.6939208Z ^ 2025-05-07T19:53:17.6939470Z 2025-05-07T19:53:17.6941044Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:17.6942926Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:17.6943438Z ^ 2025-05-07T19:53:17.6943718Z 2025-05-07T19:53:17.6945286Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:17.6947805Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:17.6948881Z ^ 2025-05-07T19:53:17.6949150Z 2025-05-07T19:53:17.6949597Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:17.6950267Z 2025-05-07T19:53:17.6951887Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:17.6954553Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:17.6955891Z ^ 2025-05-07T19:53:17.6956225Z 2025-05-07T19:53:17.6957555Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:17.6959488Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:53:17.6960161Z ^ 2025-05-07T19:53:17.6960440Z 2025-05-07T19:53:17.6961999Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:17.6963885Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:17.6964362Z ^ 2025-05-07T19:53:17.6964636Z 2025-05-07T19:53:17.6966111Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:17.6967925Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:17.6968454Z ^ 2025-05-07T19:53:17.6968708Z 2025-05-07T19:53:17.6970454Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:17.6972167Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:17.6972672Z ^ 2025-05-07T19:53:17.6972943Z 2025-05-07T19:53:17.6974475Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:17.6977020Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:17.6978114Z ^ 2025-05-07T19:53:17.6978343Z 2025-05-07T19:53:17.6978750Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:17.6979331Z 2025-05-07T19:53:17.6980889Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:17.6983291Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:17.6984477Z ^ 2025-05-07T19:53:17.6984844Z 2025-05-07T19:53:17.6986390Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:17.6988585Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:53:17.6989332Z ^ 2025-05-07T19:53:17.6989632Z 2025-05-07T19:53:17.6991225Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:17.6993209Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:17.6993760Z ^ 2025-05-07T19:53:17.6994028Z 2025-05-07T19:53:17.6995535Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:17.6997775Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:17.6998309Z ^ 2025-05-07T19:53:17.6998549Z 2025-05-07T19:53:17.7000113Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:17.7001912Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:17.7002444Z ^ 2025-05-07T19:53:17.7002918Z 2025-05-07T19:53:17.7004467Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:17.7007009Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:17.7008157Z ^ 2025-05-07T19:53:17.7008400Z 2025-05-07T19:53:17.7008833Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:17.7009503Z 2025-05-07T19:53:17.7011136Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:17.7013800Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:17.7014956Z ^ 2025-05-07T19:53:17.7015332Z 2025-05-07T19:53:17.7017023Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:17.7019110Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:53:17.7019821Z ^ 2025-05-07T19:53:17.7020091Z 2025-05-07T19:53:17.7021602Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:17.7023462Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:17.7024019Z ^ 2025-05-07T19:53:17.7024290Z 2025-05-07T19:53:17.7025749Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:17.7027639Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:17.7028062Z ^ 2025-05-07T19:53:17.7028285Z 2025-05-07T19:53:17.7029452Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:17.7030929Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:17.7031366Z ^ 2025-05-07T19:53:17.7031598Z 2025-05-07T19:53:18.8226558Z [141/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu.o 2025-05-07T19:53:18.8251159Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:18.8253945Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:18.8255137Z ^ 2025-05-07T19:53:18.8255424Z 2025-05-07T19:53:18.8255907Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:18.8256722Z 2025-05-07T19:53:18.8258435Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:18.8261167Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:18.8262413Z ^ 2025-05-07T19:53:18.8262785Z 2025-05-07T19:53:18.8264334Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:18.8266162Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:53:18.8266843Z ^ 2025-05-07T19:53:18.8267109Z 2025-05-07T19:53:18.8268391Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:18.8270062Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:18.8270837Z ^ 2025-05-07T19:53:18.8271468Z 2025-05-07T19:53:18.8272837Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:18.8274532Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:18.8275016Z ^ 2025-05-07T19:53:18.8275291Z 2025-05-07T19:53:18.8276839Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:18.8278705Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:18.8279248Z ^ 2025-05-07T19:53:18.8279515Z 2025-05-07T19:53:18.8281179Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:18.8283785Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:18.8284932Z ^ 2025-05-07T19:53:18.8285180Z 2025-05-07T19:53:18.8285626Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:18.8286253Z 2025-05-07T19:53:18.8287895Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:18.8290518Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:18.8291691Z ^ 2025-05-07T19:53:18.8292054Z 2025-05-07T19:53:18.8293541Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:18.8295642Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:53:18.8296522Z ^ 2025-05-07T19:53:18.8296839Z 2025-05-07T19:53:18.8298367Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:18.8300286Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:18.8300815Z ^ 2025-05-07T19:53:18.8301116Z 2025-05-07T19:53:18.8302613Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:18.8304540Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:18.8305092Z ^ 2025-05-07T19:53:18.8305369Z 2025-05-07T19:53:18.8306916Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:18.8308819Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:18.8309404Z ^ 2025-05-07T19:53:18.8309681Z 2025-05-07T19:53:18.8311328Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:18.8313905Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:18.8315293Z ^ 2025-05-07T19:53:18.8315543Z 2025-05-07T19:53:18.8315986Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:18.8316661Z 2025-05-07T19:53:18.8318305Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:18.8321033Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:18.8322151Z ^ 2025-05-07T19:53:18.8322541Z 2025-05-07T19:53:18.8324052Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:18.8326152Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:53:18.8326891Z ^ 2025-05-07T19:53:18.8327179Z 2025-05-07T19:53:18.8328682Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:18.8330587Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:18.8331167Z ^ 2025-05-07T19:53:18.8331444Z 2025-05-07T19:53:18.8332942Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:18.8334823Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:18.8335387Z ^ 2025-05-07T19:53:18.8335674Z 2025-05-07T19:53:18.8337304Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:18.8339226Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:18.8339787Z ^ 2025-05-07T19:53:18.8340062Z 2025-05-07T19:53:18.8341674Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:18.8344319Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:18.8345519Z ^ 2025-05-07T19:53:18.8345782Z 2025-05-07T19:53:18.8346227Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:18.8346910Z 2025-05-07T19:53:18.8348597Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:18.8351210Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:18.8352420Z ^ 2025-05-07T19:53:18.8352789Z 2025-05-07T19:53:18.8354377Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:18.8356540Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:53:18.8357324Z ^ 2025-05-07T19:53:18.8357837Z 2025-05-07T19:53:18.8359422Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:18.8361390Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:18.8361988Z ^ 2025-05-07T19:53:18.8362277Z 2025-05-07T19:53:18.8363967Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:18.8365947Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:18.8366511Z ^ 2025-05-07T19:53:18.8366782Z 2025-05-07T19:53:18.8368285Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:18.8370536Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:18.8371105Z ^ 2025-05-07T19:53:18.8371406Z 2025-05-07T19:53:18.8373078Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:18.8375848Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:18.8377145Z ^ 2025-05-07T19:53:18.8377436Z 2025-05-07T19:53:18.8377894Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:18.8378576Z 2025-05-07T19:53:18.8380299Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:18.8383035Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:18.8384264Z ^ 2025-05-07T19:53:18.8384633Z 2025-05-07T19:53:18.8386204Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:18.8388344Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:53:18.8389101Z ^ 2025-05-07T19:53:18.8389385Z 2025-05-07T19:53:18.8390935Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:18.8392885Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:18.8393449Z ^ 2025-05-07T19:53:18.8393730Z 2025-05-07T19:53:18.8395283Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:18.8397248Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:18.8397810Z ^ 2025-05-07T19:53:18.8409040Z 2025-05-07T19:53:18.8410715Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:18.8412730Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:18.8413281Z ^ 2025-05-07T19:53:18.8413905Z 2025-05-07T19:53:20.2902799Z [142/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_utils_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/radix_sort_pairs.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/radix_sort_pairs.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_utils/radix_sort_pairs.cu -o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/radix_sort_pairs.cu.o 2025-05-07T19:53:20.2923678Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:20.2926187Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:20.2927345Z ^ 2025-05-07T19:53:20.2927569Z 2025-05-07T19:53:20.2928007Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:20.2928658Z 2025-05-07T19:53:20.2930296Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:20.2932897Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:20.2934085Z ^ 2025-05-07T19:53:20.2934445Z 2025-05-07T19:53:20.2936127Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:20.2938937Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:20.2940134Z ^ 2025-05-07T19:53:20.2940742Z 2025-05-07T19:53:20.2941185Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:20.2941815Z 2025-05-07T19:53:20.2943352Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:20.2945982Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:20.2947306Z ^ 2025-05-07T19:53:20.2947660Z 2025-05-07T19:53:20.2949180Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:20.2951765Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:20.2952838Z ^ 2025-05-07T19:53:20.2953092Z 2025-05-07T19:53:20.2953521Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:20.2954153Z 2025-05-07T19:53:20.2955805Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:20.2958391Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:20.2959499Z ^ 2025-05-07T19:53:20.2959849Z 2025-05-07T19:53:20.2961484Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:20.2964114Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:20.2965220Z ^ 2025-05-07T19:53:20.2965459Z 2025-05-07T19:53:20.2965895Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:20.2966571Z 2025-05-07T19:53:20.2968233Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:20.2971078Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:20.2972235Z ^ 2025-05-07T19:53:20.2972606Z 2025-05-07T19:53:20.2974211Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:20.2976925Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:20.2978072Z ^ 2025-05-07T19:53:20.2978333Z 2025-05-07T19:53:20.2978765Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:20.2979418Z 2025-05-07T19:53:20.2981119Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:20.2983690Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:20.2985201Z ^ 2025-05-07T19:53:20.2985567Z 2025-05-07T19:53:20.9272886Z [143/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_utils.so -o fbgemm_gpu_tbe_utils.so CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils.cpp.o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/generate_vbe_metadata.cu.o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/get_infos_metadata.cu.o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/radix_sort_pairs.cu.o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/transpose_embedding_input.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,/lib/intel64:/lib/intel64_win:/lib/win-x64:/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib:/github/home/miniconda/envs/build_binary/lib/stubs: /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so" -Wl,--as-needed -lcudadevrt -lcudart_static -ldl && : 2025-05-07T19:53:21.5277477Z [144/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_sparse_async_cumsum.so -o fbgemm_gpu_sparse_async_cumsum.so CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cpp.o CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm_gpu_tbe_utils.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -lcudadevrt -lcudart_static -ldl && : 2025-05-07T19:53:22.7822637Z [145/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_ssd_weighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_cuda.cu.o 2025-05-07T19:53:22.7845954Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:22.7848708Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:22.7849931Z ^ 2025-05-07T19:53:22.7850189Z 2025-05-07T19:53:22.7850657Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:22.7851725Z 2025-05-07T19:53:22.7853437Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:22.7856169Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:22.7857558Z ^ 2025-05-07T19:53:22.7857941Z 2025-05-07T19:53:22.7859841Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:22.7862539Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:22.7863778Z ^ 2025-05-07T19:53:22.7864034Z 2025-05-07T19:53:22.7864510Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:22.7865187Z 2025-05-07T19:53:22.7866953Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:22.7869719Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:22.7871236Z ^ 2025-05-07T19:53:22.7871626Z 2025-05-07T19:53:22.7873322Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:22.7876078Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:22.7877288Z ^ 2025-05-07T19:53:22.7877547Z 2025-05-07T19:53:22.7878004Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:22.7878698Z 2025-05-07T19:53:22.7880412Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:22.7883153Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:22.7884401Z ^ 2025-05-07T19:53:22.7884781Z 2025-05-07T19:53:22.7886473Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:22.7889217Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:22.7890435Z ^ 2025-05-07T19:53:22.7890699Z 2025-05-07T19:53:22.7891186Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:22.7891859Z 2025-05-07T19:53:22.7893571Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:22.7896426Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:22.7897648Z ^ 2025-05-07T19:53:22.7898308Z 2025-05-07T19:53:22.7899997Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:22.7902686Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:22.7903856Z ^ 2025-05-07T19:53:22.7904125Z 2025-05-07T19:53:22.7904724Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:22.7905407Z 2025-05-07T19:53:22.7907117Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:22.7909845Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:22.7911061Z ^ 2025-05-07T19:53:22.7911443Z 2025-05-07T19:53:24.9173177Z [146/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu.o 2025-05-07T19:53:24.9199540Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:24.9202441Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:24.9205459Z ^ 2025-05-07T19:53:24.9205711Z 2025-05-07T19:53:24.9206154Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:24.9206841Z 2025-05-07T19:53:24.9208488Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:24.9211375Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:24.9212570Z ^ 2025-05-07T19:53:24.9212946Z 2025-05-07T19:53:24.9214521Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:24.9217314Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:24.9218471Z ^ 2025-05-07T19:53:24.9218767Z 2025-05-07T19:53:24.9219231Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:24.9219916Z 2025-05-07T19:53:24.9221628Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:24.9224323Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:24.9225415Z ^ 2025-05-07T19:53:24.9225771Z 2025-05-07T19:53:24.9227456Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:24.9230125Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:24.9231286Z ^ 2025-05-07T19:53:24.9231512Z 2025-05-07T19:53:24.9231900Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:24.9232570Z 2025-05-07T19:53:24.9234148Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:24.9236798Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:24.9238001Z ^ 2025-05-07T19:53:24.9238390Z 2025-05-07T19:53:24.9240078Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:24.9242595Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:24.9243766Z ^ 2025-05-07T19:53:24.9244055Z 2025-05-07T19:53:24.9244502Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:24.9245157Z 2025-05-07T19:53:24.9246820Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:24.9249788Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:24.9250971Z ^ 2025-05-07T19:53:24.9251324Z 2025-05-07T19:53:24.9252920Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:24.9255737Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:24.9256992Z ^ 2025-05-07T19:53:24.9257239Z 2025-05-07T19:53:24.9257670Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:24.9258345Z 2025-05-07T19:53:24.9259977Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:24.9262724Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:24.9263893Z ^ 2025-05-07T19:53:24.9264283Z 2025-05-07T19:53:28.0734283Z [147/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu.o 2025-05-07T19:53:28.0757896Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:28.0760648Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:28.0761805Z ^ 2025-05-07T19:53:28.0762162Z 2025-05-07T19:53:28.0762618Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:28.0763826Z 2025-05-07T19:53:28.0765443Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:28.0768113Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:28.0769356Z ^ 2025-05-07T19:53:28.0769759Z 2025-05-07T19:53:28.0771709Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:28.0774358Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:28.0775555Z ^ 2025-05-07T19:53:28.0775818Z 2025-05-07T19:53:28.0776368Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:28.0777032Z 2025-05-07T19:53:28.0778688Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:28.0781247Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:28.0782390Z ^ 2025-05-07T19:53:28.0782774Z 2025-05-07T19:53:28.0784401Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:28.0787141Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:28.0788334Z ^ 2025-05-07T19:53:28.0788601Z 2025-05-07T19:53:28.0789056Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:28.0789717Z 2025-05-07T19:53:28.0791366Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:28.0794027Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:28.0795184Z ^ 2025-05-07T19:53:28.0795557Z 2025-05-07T19:53:28.0797112Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:28.0799743Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:28.0800945Z ^ 2025-05-07T19:53:28.0801208Z 2025-05-07T19:53:28.0801693Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:28.0802835Z 2025-05-07T19:53:28.0804544Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:28.0807190Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:28.0808393Z ^ 2025-05-07T19:53:28.0808758Z 2025-05-07T19:53:28.0810621Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:28.0813204Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:28.0814343Z ^ 2025-05-07T19:53:28.0814624Z 2025-05-07T19:53:28.0815043Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:28.0815673Z 2025-05-07T19:53:28.0817424Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:28.0819999Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:28.0821222Z ^ 2025-05-07T19:53:28.0821601Z 2025-05-07T19:53:29.8359030Z [148/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_ssd_unweighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_cuda.cu.o 2025-05-07T19:53:29.8381809Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:29.8384788Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:29.8385923Z ^ 2025-05-07T19:53:29.8386184Z 2025-05-07T19:53:29.8386614Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:29.8387278Z 2025-05-07T19:53:29.8388903Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:29.8391510Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:29.8392688Z ^ 2025-05-07T19:53:29.8393041Z 2025-05-07T19:53:29.8394647Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:29.8397226Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:29.8398343Z ^ 2025-05-07T19:53:29.8398583Z 2025-05-07T19:53:29.8399000Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:29.8399651Z 2025-05-07T19:53:29.8401267Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:29.8403869Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:29.8405001Z ^ 2025-05-07T19:53:29.8405352Z 2025-05-07T19:53:29.8406909Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:29.8409454Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:29.8410605Z ^ 2025-05-07T19:53:29.8410841Z 2025-05-07T19:53:29.8411210Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:29.8411738Z 2025-05-07T19:53:29.8413086Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:29.8415611Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:29.8416829Z ^ 2025-05-07T19:53:29.8417165Z 2025-05-07T19:53:29.8418732Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:29.8421321Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:29.8422730Z ^ 2025-05-07T19:53:29.8422984Z 2025-05-07T19:53:29.8423414Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:29.8424062Z 2025-05-07T19:53:29.8425548Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:29.8428206Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:29.8429323Z ^ 2025-05-07T19:53:29.8429662Z 2025-05-07T19:53:29.8431182Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:29.8433665Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:29.8434798Z ^ 2025-05-07T19:53:29.8435049Z 2025-05-07T19:53:29.8435499Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:29.8436137Z 2025-05-07T19:53:29.8437721Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:29.8440241Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:29.8441361Z ^ 2025-05-07T19:53:29.8441713Z 2025-05-07T19:53:43.9861613Z [149/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adagrad_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adagrad_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adagrad_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_split_adagrad_pt2_autograd.cpp 2025-05-07T19:53:43.9881689Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:53:44.4013544Z [150/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu.o 2025-05-07T19:53:44.4036323Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:44.4039070Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:44.4040221Z ^ 2025-05-07T19:53:44.4040473Z 2025-05-07T19:53:44.4040911Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:44.4041575Z 2025-05-07T19:53:44.4043137Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:44.4045726Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:44.4046902Z ^ 2025-05-07T19:53:44.4051238Z 2025-05-07T19:53:44.4052933Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:44.4055558Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:44.4056878Z ^ 2025-05-07T19:53:44.4057146Z 2025-05-07T19:53:44.4057763Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:44.4058434Z 2025-05-07T19:53:44.4060120Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:44.4062805Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:44.4063964Z ^ 2025-05-07T19:53:44.4064300Z 2025-05-07T19:53:44.4065848Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:44.4068526Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:44.4069659Z ^ 2025-05-07T19:53:44.4069910Z 2025-05-07T19:53:44.4070623Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:44.4071270Z 2025-05-07T19:53:44.4072916Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:44.4075400Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:44.4076349Z ^ 2025-05-07T19:53:44.4076723Z 2025-05-07T19:53:44.4078373Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:44.4080952Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:44.4082121Z ^ 2025-05-07T19:53:44.4082371Z 2025-05-07T19:53:44.4082845Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:44.4083516Z 2025-05-07T19:53:44.4085215Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:44.4087955Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:44.4089180Z ^ 2025-05-07T19:53:44.4089551Z 2025-05-07T19:53:44.4091244Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:44.4093960Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:44.4095150Z ^ 2025-05-07T19:53:44.4095403Z 2025-05-07T19:53:44.4095849Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:44.4096962Z 2025-05-07T19:53:44.4098641Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:44.4101358Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:44.4102562Z ^ 2025-05-07T19:53:44.4103162Z 2025-05-07T19:53:49.6961442Z [151/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_sgd_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_sgd_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_sgd_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_split_sgd_pt2_autograd.cpp 2025-05-07T19:53:49.6979777Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:53:53.7793305Z [152/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lamb_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lamb_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lamb_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_split_lamb_pt2_autograd.cpp 2025-05-07T19:53:53.7812295Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:53:53.8497805Z [153/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu.o 2025-05-07T19:53:53.8517618Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:53.8520511Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:53.8521543Z ^ 2025-05-07T19:53:53.8521782Z 2025-05-07T19:53:53.8522198Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:53.8522772Z 2025-05-07T19:53:53.8524437Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:53.8526839Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:53.8527905Z ^ 2025-05-07T19:53:53.8528258Z 2025-05-07T19:53:53.8529704Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:53.8531964Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:53.8532876Z ^ 2025-05-07T19:53:53.8533078Z 2025-05-07T19:53:53.8533427Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:53.8533948Z 2025-05-07T19:53:53.8535198Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:53.8537393Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:53.8538291Z ^ 2025-05-07T19:53:53.8538582Z 2025-05-07T19:53:53.8539921Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:53.8542286Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:53.8543344Z ^ 2025-05-07T19:53:53.8543555Z 2025-05-07T19:53:53.8543979Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:53.8544543Z 2025-05-07T19:53:53.8545918Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:53.8548089Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:53.8549030Z ^ 2025-05-07T19:53:53.8549321Z 2025-05-07T19:53:53.8550646Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:53.8552779Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:53.8553766Z ^ 2025-05-07T19:53:53.8553994Z 2025-05-07T19:53:53.8554366Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:53.8554916Z 2025-05-07T19:53:53.8556289Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:53.8558664Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:53.8559649Z ^ 2025-05-07T19:53:53.8559960Z 2025-05-07T19:53:53.8561612Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:53.8564070Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:53.8565130Z ^ 2025-05-07T19:53:53.8565359Z 2025-05-07T19:53:53.8565775Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:53.8566403Z 2025-05-07T19:53:53.8567939Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:53.8570848Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:53.8571923Z ^ 2025-05-07T19:53:53.8572289Z 2025-05-07T19:53:58.2180112Z [154/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:53:58.2199010Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:53:59.6815100Z [155/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adam_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adam_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adam_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_split_adam_pt2_autograd.cpp 2025-05-07T19:53:59.6834199Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:06.4856161Z [156/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lars_sgd_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lars_sgd_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lars_sgd_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_split_lars_sgd_pt2_autograd.cpp 2025-05-07T19:54:06.4874281Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:08.2546126Z [157/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp 2025-05-07T19:54:08.2565281Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:10.5031284Z [158/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_sgd_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_sgd_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_sgd_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_split_approx_sgd_pt2_autograd.cpp 2025-05-07T19:54:10.5048733Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:11.8966932Z [159/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:54:11.8987049Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:13.6492045Z [160/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp 2025-05-07T19:54:13.6511459Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:14.5532331Z [161/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_none_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_none_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_none_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_split_none_pt2_autograd.cpp 2025-05-07T19:54:14.5550848Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:16.9557354Z [162/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu.o 2025-05-07T19:54:16.9579891Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:16.9582498Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:16.9583637Z ^ 2025-05-07T19:54:16.9583877Z 2025-05-07T19:54:16.9584325Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:16.9584935Z 2025-05-07T19:54:16.9586435Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:16.9588907Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:16.9589866Z ^ 2025-05-07T19:54:16.9590184Z 2025-05-07T19:54:16.9591653Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:16.9594973Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:16.9598943Z ^ 2025-05-07T19:54:16.9599154Z 2025-05-07T19:54:16.9599493Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:16.9600028Z 2025-05-07T19:54:16.9601475Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:16.9603619Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:16.9604700Z ^ 2025-05-07T19:54:16.9605096Z 2025-05-07T19:54:16.9606618Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:16.9609195Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:16.9610345Z ^ 2025-05-07T19:54:16.9610614Z 2025-05-07T19:54:16.9611050Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:16.9611713Z 2025-05-07T19:54:16.9613395Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:16.9616044Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:16.9617406Z ^ 2025-05-07T19:54:16.9617741Z 2025-05-07T19:54:16.9619281Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:16.9621845Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:16.9623009Z ^ 2025-05-07T19:54:16.9623271Z 2025-05-07T19:54:16.9623716Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:16.9624384Z 2025-05-07T19:54:16.9625975Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:16.9628586Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:16.9629720Z ^ 2025-05-07T19:54:16.9630110Z 2025-05-07T19:54:16.9631708Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:16.9634280Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:16.9635415Z ^ 2025-05-07T19:54:16.9635663Z 2025-05-07T19:54:16.9636127Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:16.9636796Z 2025-05-07T19:54:16.9638434Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:16.9641278Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:16.9642381Z ^ 2025-05-07T19:54:16.9642724Z 2025-05-07T19:54:17.4105512Z [163/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu.o 2025-05-07T19:54:17.4127037Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:17.4129665Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:17.4130838Z ^ 2025-05-07T19:54:17.4131121Z 2025-05-07T19:54:17.4131599Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:17.4132286Z 2025-05-07T19:54:17.4133794Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:17.4136519Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:17.4138088Z ^ 2025-05-07T19:54:17.4138440Z 2025-05-07T19:54:17.4140045Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:17.4142596Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:17.4143872Z ^ 2025-05-07T19:54:17.4144135Z 2025-05-07T19:54:17.4144604Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:17.4145244Z 2025-05-07T19:54:17.4146843Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:17.4149433Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:17.4150577Z ^ 2025-05-07T19:54:17.4150961Z 2025-05-07T19:54:17.4152555Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:17.4155103Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:17.4156233Z ^ 2025-05-07T19:54:17.4156503Z 2025-05-07T19:54:17.4156938Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:17.4157575Z 2025-05-07T19:54:17.4159178Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:17.4161758Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:17.4162922Z ^ 2025-05-07T19:54:17.4163277Z 2025-05-07T19:54:17.4164876Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:17.4167402Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:17.4168531Z ^ 2025-05-07T19:54:17.4168785Z 2025-05-07T19:54:17.4169218Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:17.4169902Z 2025-05-07T19:54:17.4171752Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:17.4174337Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:17.4175482Z ^ 2025-05-07T19:54:17.4175854Z 2025-05-07T19:54:17.4177261Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:17.4179779Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:17.4181188Z ^ 2025-05-07T19:54:17.4181430Z 2025-05-07T19:54:17.4181784Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:17.4182314Z 2025-05-07T19:54:17.4183786Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:17.4186568Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:17.4187750Z ^ 2025-05-07T19:54:17.4188107Z 2025-05-07T19:54:19.0884068Z [164/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:54:19.0904625Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:19.1234321Z [165/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:54:19.1253937Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:19.1580379Z [166/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:54:19.1599301Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:19.1996050Z [167/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:54:19.2015092Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:19.2341863Z [168/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:54:19.2359764Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:19.2685360Z [169/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:54:19.2703209Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:19.3104600Z [170/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp 2025-05-07T19:54:19.3123335Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:19.3451924Z [171/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:54:19.3471658Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:19.3872754Z [172/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:54:19.3892693Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:19.4213488Z [173/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:54:19.4233318Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:19.4628799Z [174/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:54:19.4648143Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:19.5044000Z [175/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:54:19.5064491Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:19.5384465Z [176/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:54:19.5403477Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:19.5799182Z [177/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:54:19.5819487Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:20.4093827Z [178/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:54:20.4117694Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:20.4576541Z [179/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu.o 2025-05-07T19:54:20.4598573Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:20.4601145Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:20.4602257Z ^ 2025-05-07T19:54:20.4602529Z 2025-05-07T19:54:20.4602965Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:20.4603933Z 2025-05-07T19:54:20.4605519Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:20.4608069Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:20.4609232Z ^ 2025-05-07T19:54:20.4609573Z 2025-05-07T19:54:20.4611392Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:20.4613939Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:20.4615062Z ^ 2025-05-07T19:54:20.4615321Z 2025-05-07T19:54:20.4615769Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:20.4616540Z 2025-05-07T19:54:20.4618087Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:20.4620697Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:20.4621812Z ^ 2025-05-07T19:54:20.4622172Z 2025-05-07T19:54:20.4623766Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:20.4626325Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:20.4627473Z ^ 2025-05-07T19:54:20.4627731Z 2025-05-07T19:54:20.4628186Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:20.4628848Z 2025-05-07T19:54:20.4630424Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:20.4632962Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:20.4634143Z ^ 2025-05-07T19:54:20.4634505Z 2025-05-07T19:54:20.4636060Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:20.4638645Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:20.4639796Z ^ 2025-05-07T19:54:20.4640053Z 2025-05-07T19:54:20.4640480Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:20.4641105Z 2025-05-07T19:54:20.4642732Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:20.4645311Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:20.4646473Z ^ 2025-05-07T19:54:20.4647073Z 2025-05-07T19:54:20.4648623Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:20.4651185Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:20.4652314Z ^ 2025-05-07T19:54:20.4652561Z 2025-05-07T19:54:20.4653195Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:20.4653858Z 2025-05-07T19:54:20.4655424Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:20.4658141Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:20.4659302Z ^ 2025-05-07T19:54:20.4659692Z 2025-05-07T19:54:21.0925116Z [180/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_v2_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_v2_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_v2_kernel.cu.o 2025-05-07T19:54:21.0948734Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:21.0951517Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:21.0953123Z ^ 2025-05-07T19:54:21.0953379Z 2025-05-07T19:54:21.0953826Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:21.0954504Z 2025-05-07T19:54:21.0956196Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:21.0959093Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:21.0960277Z ^ 2025-05-07T19:54:21.0960663Z 2025-05-07T19:54:21.0961951Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.0963967Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.0964861Z ^ 2025-05-07T19:54:21.0968336Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 955 2025-05-07T19:54:21.0971778Z 2025-05-07T19:54:21.0973060Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.0975050Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.0975945Z ^ 2025-05-07T19:54:21.0979453Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1007 2025-05-07T19:54:21.0982619Z 2025-05-07T19:54:21.0983873Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.0985857Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.0986741Z ^ 2025-05-07T19:54:21.0990146Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1059 2025-05-07T19:54:21.0993293Z 2025-05-07T19:54:21.0994524Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.0996377Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.0997598Z ^ 2025-05-07T19:54:21.1001259Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1111 2025-05-07T19:54:21.1004535Z 2025-05-07T19:54:21.1005845Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.1007842Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.1008751Z ^ 2025-05-07T19:54:21.1012229Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1163 2025-05-07T19:54:21.1015475Z 2025-05-07T19:54:21.1016895Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.1018888Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.1019809Z ^ 2025-05-07T19:54:21.1023305Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1215 2025-05-07T19:54:21.1026540Z 2025-05-07T19:54:21.1027839Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.1029830Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.1030752Z ^ 2025-05-07T19:54:21.1034256Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1267 2025-05-07T19:54:21.1037503Z 2025-05-07T19:54:21.1038784Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.1040798Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.1041874Z ^ 2025-05-07T19:54:21.1045372Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1319 2025-05-07T19:54:21.1048731Z 2025-05-07T19:54:21.1050015Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.1051692Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.1052499Z ^ 2025-05-07T19:54:21.1055851Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1371 2025-05-07T19:54:21.1059188Z 2025-05-07T19:54:21.1060446Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.1062399Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.1063279Z ^ 2025-05-07T19:54:21.1066597Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1423 2025-05-07T19:54:21.1069618Z 2025-05-07T19:54:21.1071078Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.1073109Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.1073964Z ^ 2025-05-07T19:54:21.1077077Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1475 2025-05-07T19:54:21.1079984Z 2025-05-07T19:54:21.1081283Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.1083274Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.1084173Z ^ 2025-05-07T19:54:21.1087846Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1527 2025-05-07T19:54:21.1091065Z 2025-05-07T19:54:21.1092551Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.1094520Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.1095418Z ^ 2025-05-07T19:54:21.1098837Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1579 2025-05-07T19:54:21.1101908Z 2025-05-07T19:54:21.1103180Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.1105022Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.1105861Z ^ 2025-05-07T19:54:21.1109174Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1631 2025-05-07T19:54:21.1112240Z 2025-05-07T19:54:21.1113497Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.1115360Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.1116227Z ^ 2025-05-07T19:54:21.1119563Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1683 2025-05-07T19:54:21.1122375Z 2025-05-07T19:54:21.1123498Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.1125337Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.1126201Z ^ 2025-05-07T19:54:21.1129507Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1735 2025-05-07T19:54:21.1132925Z 2025-05-07T19:54:21.1134323Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.1136386Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.1137268Z ^ 2025-05-07T19:54:21.1140580Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1787 2025-05-07T19:54:21.1143668Z 2025-05-07T19:54:21.1144918Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.1146872Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.1147720Z ^ 2025-05-07T19:54:21.1151035Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1839 2025-05-07T19:54:21.1154024Z 2025-05-07T19:54:21.1155211Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.1157113Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.1157987Z ^ 2025-05-07T19:54:21.1161433Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1891 2025-05-07T19:54:21.1164600Z 2025-05-07T19:54:21.1166092Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.1168004Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.1168865Z ^ 2025-05-07T19:54:21.1172485Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1943 2025-05-07T19:54:21.1176038Z 2025-05-07T19:54:21.1177168Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.1179129Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.1179912Z ^ 2025-05-07T19:54:21.1183278Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1995 2025-05-07T19:54:21.1185891Z 2025-05-07T19:54:21.1186803Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.1188342Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.1189169Z ^ 2025-05-07T19:54:21.1192480Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2047 2025-05-07T19:54:21.1195611Z 2025-05-07T19:54:21.1196758Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.1198399Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.1199161Z ^ 2025-05-07T19:54:21.1201917Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2099 2025-05-07T19:54:21.1204680Z 2025-05-07T19:54:21.1205844Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.1207649Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.1208393Z ^ 2025-05-07T19:54:21.1211327Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2151 2025-05-07T19:54:21.1214267Z 2025-05-07T19:54:21.1215824Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:21.1218687Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:21.1219833Z ^ 2025-05-07T19:54:21.1220074Z 2025-05-07T19:54:21.1220511Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:21.1221166Z 2025-05-07T19:54:21.1222765Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:21.1225177Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:21.1226292Z ^ 2025-05-07T19:54:21.1226623Z 2025-05-07T19:54:21.1227825Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.1229611Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.1230466Z ^ 2025-05-07T19:54:21.1233748Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 955 2025-05-07T19:54:21.1236754Z 2025-05-07T19:54:21.1238010Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.1240024Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.1240952Z ^ 2025-05-07T19:54:21.1244439Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1007 2025-05-07T19:54:21.1247388Z 2025-05-07T19:54:21.1248596Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.1250566Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.1251477Z ^ 2025-05-07T19:54:21.1255001Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1059 2025-05-07T19:54:21.1258205Z 2025-05-07T19:54:21.1259413Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.1261354Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.1262212Z ^ 2025-05-07T19:54:21.1265383Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1111 2025-05-07T19:54:21.1268492Z 2025-05-07T19:54:21.1269707Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.1271795Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.1272669Z ^ 2025-05-07T19:54:21.1275898Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1163 2025-05-07T19:54:21.1278976Z 2025-05-07T19:54:21.1280203Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.1282092Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.1282914Z ^ 2025-05-07T19:54:21.1286189Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1215 2025-05-07T19:54:21.1289300Z 2025-05-07T19:54:21.1290509Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.1292372Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.1293240Z ^ 2025-05-07T19:54:21.1296655Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1267 2025-05-07T19:54:21.1300047Z 2025-05-07T19:54:21.1301297Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.1303370Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.1304223Z ^ 2025-05-07T19:54:21.1307519Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1319 2025-05-07T19:54:21.1310375Z 2025-05-07T19:54:21.1311530Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.1313208Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.1314017Z ^ 2025-05-07T19:54:21.1317147Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1371 2025-05-07T19:54:21.1320043Z 2025-05-07T19:54:21.1321219Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.1323012Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.1323837Z ^ 2025-05-07T19:54:21.1327023Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1423 2025-05-07T19:54:21.1330019Z 2025-05-07T19:54:21.1331283Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.1333192Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.1334063Z ^ 2025-05-07T19:54:21.1337519Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1475 2025-05-07T19:54:21.1340921Z 2025-05-07T19:54:21.1342202Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.1344151Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.1345104Z ^ 2025-05-07T19:54:21.1348523Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1527 2025-05-07T19:54:21.1351600Z 2025-05-07T19:54:21.1352794Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.1354637Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.1355466Z ^ 2025-05-07T19:54:21.1358581Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1579 2025-05-07T19:54:21.1361248Z 2025-05-07T19:54:21.1362359Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.1364130Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.1365011Z ^ 2025-05-07T19:54:21.1368235Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1631 2025-05-07T19:54:21.1371579Z 2025-05-07T19:54:21.1372759Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.1374641Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.1375503Z ^ 2025-05-07T19:54:21.1378888Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1683 2025-05-07T19:54:21.1385725Z 2025-05-07T19:54:21.1386941Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.1388824Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.1389869Z ^ 2025-05-07T19:54:21.1393225Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1735 2025-05-07T19:54:21.1396292Z 2025-05-07T19:54:21.1397535Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.1399380Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.1400256Z ^ 2025-05-07T19:54:21.1403587Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1787 2025-05-07T19:54:21.1406730Z 2025-05-07T19:54:21.1407942Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.1409843Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.1410698Z ^ 2025-05-07T19:54:21.1414052Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1839 2025-05-07T19:54:21.1417245Z 2025-05-07T19:54:21.1418475Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.1420343Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.1421190Z ^ 2025-05-07T19:54:21.1424549Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1891 2025-05-07T19:54:21.1427883Z 2025-05-07T19:54:21.1429089Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.1430972Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.1431821Z ^ 2025-05-07T19:54:21.1435328Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1943 2025-05-07T19:54:21.1438411Z 2025-05-07T19:54:21.1439619Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.1441515Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.1442358Z ^ 2025-05-07T19:54:21.1445663Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1995 2025-05-07T19:54:21.1448778Z 2025-05-07T19:54:21.1449997Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.1451856Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.1452721Z ^ 2025-05-07T19:54:21.1456087Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2047 2025-05-07T19:54:21.1459367Z 2025-05-07T19:54:21.1460584Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.1462497Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.1463357Z ^ 2025-05-07T19:54:21.1466748Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2099 2025-05-07T19:54:21.1470078Z 2025-05-07T19:54:21.1471532Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.1473363Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.1474207Z ^ 2025-05-07T19:54:21.1477863Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2151 2025-05-07T19:54:21.1481021Z 2025-05-07T19:54:21.1482589Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:21.1485157Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:21.1486246Z ^ 2025-05-07T19:54:21.1486505Z 2025-05-07T19:54:21.1486936Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:21.1487566Z 2025-05-07T19:54:21.1489186Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:21.1491729Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:21.1492897Z ^ 2025-05-07T19:54:21.1493256Z 2025-05-07T19:54:21.1494430Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.1496406Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.1497287Z ^ 2025-05-07T19:54:21.1500527Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 955 2025-05-07T19:54:21.1503557Z 2025-05-07T19:54:21.1504774Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.1506631Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.1507476Z ^ 2025-05-07T19:54:21.1510691Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1007 2025-05-07T19:54:21.1514036Z 2025-05-07T19:54:21.1515249Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.1517097Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.1517945Z ^ 2025-05-07T19:54:21.1521375Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1059 2025-05-07T19:54:21.1524464Z 2025-05-07T19:54:21.1525665Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.1527527Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.1528403Z ^ 2025-05-07T19:54:21.1531701Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1111 2025-05-07T19:54:21.1534788Z 2025-05-07T19:54:21.1536003Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.1538023Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.1538872Z ^ 2025-05-07T19:54:21.1542185Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1163 2025-05-07T19:54:21.1545278Z 2025-05-07T19:54:21.1546500Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.1548371Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.1549253Z ^ 2025-05-07T19:54:21.1552567Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1215 2025-05-07T19:54:21.1555846Z 2025-05-07T19:54:21.1557085Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.1558983Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.1559837Z ^ 2025-05-07T19:54:21.1563368Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1267 2025-05-07T19:54:21.1566484Z 2025-05-07T19:54:21.1567701Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.1569577Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.1570708Z ^ 2025-05-07T19:54:21.1573995Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1319 2025-05-07T19:54:21.1577280Z 2025-05-07T19:54:21.1578508Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.1580399Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.1581262Z ^ 2025-05-07T19:54:21.1584577Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1371 2025-05-07T19:54:21.1587690Z 2025-05-07T19:54:21.1588883Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.1590764Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.1591622Z ^ 2025-05-07T19:54:21.1594875Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1423 2025-05-07T19:54:21.1597936Z 2025-05-07T19:54:21.1599521Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.1601385Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.1602215Z ^ 2025-05-07T19:54:21.1605772Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1475 2025-05-07T19:54:21.1608810Z 2025-05-07T19:54:21.1610049Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.1611866Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.1612731Z ^ 2025-05-07T19:54:21.1616042Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1527 2025-05-07T19:54:21.1619237Z 2025-05-07T19:54:21.1620422Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.1622301Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.1623160Z ^ 2025-05-07T19:54:21.1626455Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1579 2025-05-07T19:54:21.1629511Z 2025-05-07T19:54:21.1630742Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.1632622Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.1633463Z ^ 2025-05-07T19:54:21.1636776Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1631 2025-05-07T19:54:21.1639814Z 2025-05-07T19:54:21.1641018Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.1643056Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.1643864Z ^ 2025-05-07T19:54:21.1646517Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1683 2025-05-07T19:54:21.1648726Z 2025-05-07T19:54:21.1649870Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.1651759Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.1652550Z ^ 2025-05-07T19:54:21.1655631Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1735 2025-05-07T19:54:21.1658196Z 2025-05-07T19:54:21.1659289Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.1661260Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.1662167Z ^ 2025-05-07T19:54:21.1665456Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1787 2025-05-07T19:54:21.1668539Z 2025-05-07T19:54:21.1669549Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.1671539Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.1672279Z ^ 2025-05-07T19:54:21.1691306Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1839 2025-05-07T19:54:21.1694643Z 2025-05-07T19:54:21.1695936Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.1698461Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.1699376Z ^ 2025-05-07T19:54:21.1703119Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1891 2025-05-07T19:54:21.1706430Z 2025-05-07T19:54:21.1707700Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.1709487Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.1710287Z ^ 2025-05-07T19:54:21.1713686Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1943 2025-05-07T19:54:21.1716856Z 2025-05-07T19:54:21.1718056Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.1719967Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.1720849Z ^ 2025-05-07T19:54:21.1724223Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1995 2025-05-07T19:54:21.1727418Z 2025-05-07T19:54:21.1728748Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.1730755Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.1731661Z ^ 2025-05-07T19:54:21.1735224Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2047 2025-05-07T19:54:21.1738669Z 2025-05-07T19:54:21.1739957Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.1742177Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.1743080Z ^ 2025-05-07T19:54:21.1746725Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2099 2025-05-07T19:54:21.1749633Z 2025-05-07T19:54:21.1750882Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.1752869Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.1753769Z ^ 2025-05-07T19:54:21.1757234Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2151 2025-05-07T19:54:21.1760403Z 2025-05-07T19:54:21.1762029Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:21.1764674Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:21.1765816Z ^ 2025-05-07T19:54:21.1766065Z 2025-05-07T19:54:21.1766507Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:21.1767162Z 2025-05-07T19:54:21.1768627Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:21.1771256Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:21.1772294Z ^ 2025-05-07T19:54:21.1772621Z 2025-05-07T19:54:21.1773818Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.1775597Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.1776515Z ^ 2025-05-07T19:54:21.1779511Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 955 2025-05-07T19:54:21.1782602Z 2025-05-07T19:54:21.1783839Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.1786095Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.1787006Z ^ 2025-05-07T19:54:21.1790634Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1007 2025-05-07T19:54:21.1793864Z 2025-05-07T19:54:21.1795138Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.1797138Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.1798043Z ^ 2025-05-07T19:54:21.1801541Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1059 2025-05-07T19:54:21.1804764Z 2025-05-07T19:54:21.1806039Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.1808014Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.1808903Z ^ 2025-05-07T19:54:21.1812389Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1111 2025-05-07T19:54:21.1815630Z 2025-05-07T19:54:21.1817031Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.1819018Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.1819905Z ^ 2025-05-07T19:54:21.1823397Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1163 2025-05-07T19:54:21.1826603Z 2025-05-07T19:54:21.1827858Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.1829931Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.1830812Z ^ 2025-05-07T19:54:21.1834273Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1215 2025-05-07T19:54:21.1837522Z 2025-05-07T19:54:21.1838790Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.1840757Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.1841659Z ^ 2025-05-07T19:54:21.1845127Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1267 2025-05-07T19:54:21.1848368Z 2025-05-07T19:54:21.1849647Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.1851628Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.1852515Z ^ 2025-05-07T19:54:21.1856014Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1319 2025-05-07T19:54:21.1859365Z 2025-05-07T19:54:21.1860652Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.1862604Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.1863493Z ^ 2025-05-07T19:54:21.1866988Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1371 2025-05-07T19:54:21.1870399Z 2025-05-07T19:54:21.1871685Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.1873921Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.1874827Z ^ 2025-05-07T19:54:21.1878439Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1423 2025-05-07T19:54:21.1881506Z 2025-05-07T19:54:21.1882751Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.1884667Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.1885566Z ^ 2025-05-07T19:54:21.1889006Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1475 2025-05-07T19:54:21.1892160Z 2025-05-07T19:54:21.1893405Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.1895361Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.1896350Z ^ 2025-05-07T19:54:21.1899775Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1527 2025-05-07T19:54:21.1903010Z 2025-05-07T19:54:21.1904276Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.1906235Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.1907130Z ^ 2025-05-07T19:54:21.1910562Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1579 2025-05-07T19:54:21.1913773Z 2025-05-07T19:54:21.1915026Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.1916966Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.1918025Z ^ 2025-05-07T19:54:21.1923981Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1631 2025-05-07T19:54:21.1927145Z 2025-05-07T19:54:21.1928409Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.1930285Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.1931074Z ^ 2025-05-07T19:54:21.1933995Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1683 2025-05-07T19:54:21.1937386Z 2025-05-07T19:54:21.1938694Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.1940704Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.1941631Z ^ 2025-05-07T19:54:21.1945225Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1735 2025-05-07T19:54:21.1948538Z 2025-05-07T19:54:21.1949814Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.1951813Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.1952733Z ^ 2025-05-07T19:54:21.1956258Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1787 2025-05-07T19:54:21.1959588Z 2025-05-07T19:54:21.1960879Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.1962857Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.1963941Z ^ 2025-05-07T19:54:21.1967478Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1839 2025-05-07T19:54:21.1971246Z 2025-05-07T19:54:21.1972537Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.1974540Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.1975425Z ^ 2025-05-07T19:54:21.1979081Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1891 2025-05-07T19:54:21.1982409Z 2025-05-07T19:54:21.1983687Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.1985675Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.1986567Z ^ 2025-05-07T19:54:21.1990111Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1943 2025-05-07T19:54:21.1993436Z 2025-05-07T19:54:21.1994729Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.1996716Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.1997612Z ^ 2025-05-07T19:54:21.2001121Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1995 2025-05-07T19:54:21.2004425Z 2025-05-07T19:54:21.2005729Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.2007728Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.2008623Z ^ 2025-05-07T19:54:21.2012412Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2047 2025-05-07T19:54:21.2015725Z 2025-05-07T19:54:21.2017226Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.2019206Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.2020120Z ^ 2025-05-07T19:54:21.2023627Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2099 2025-05-07T19:54:21.2026891Z 2025-05-07T19:54:21.2028187Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.2030152Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.2031055Z ^ 2025-05-07T19:54:21.2034613Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2151 2025-05-07T19:54:21.2037895Z 2025-05-07T19:54:21.2039586Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:21.2042265Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:21.2043436Z ^ 2025-05-07T19:54:21.2043695Z 2025-05-07T19:54:21.2044143Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:21.2044827Z 2025-05-07T19:54:21.2046496Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:21.2049184Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:21.2050375Z ^ 2025-05-07T19:54:21.2050752Z 2025-05-07T19:54:21.2052029Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.2053997Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.2055038Z ^ 2025-05-07T19:54:21.2058708Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 955 2025-05-07T19:54:21.2061928Z 2025-05-07T19:54:21.2063212Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.2065175Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.2066085Z ^ 2025-05-07T19:54:21.2069430Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1007 2025-05-07T19:54:21.2072766Z 2025-05-07T19:54:21.2074051Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.2076006Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.2076908Z ^ 2025-05-07T19:54:21.2080317Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1059 2025-05-07T19:54:21.2083478Z 2025-05-07T19:54:21.2084751Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.2086686Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.2087575Z ^ 2025-05-07T19:54:21.2091040Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1111 2025-05-07T19:54:21.2094230Z 2025-05-07T19:54:21.2095478Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.2097488Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.2098369Z ^ 2025-05-07T19:54:21.2102015Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1163 2025-05-07T19:54:21.2105190Z 2025-05-07T19:54:21.2106582Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.2108479Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.2109358Z ^ 2025-05-07T19:54:21.2112722Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1215 2025-05-07T19:54:21.2115831Z 2025-05-07T19:54:21.2117064Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.2118872Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.2119609Z ^ 2025-05-07T19:54:21.2122801Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1267 2025-05-07T19:54:21.2126062Z 2025-05-07T19:54:21.2127346Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.2129313Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.2130201Z ^ 2025-05-07T19:54:21.2133725Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1319 2025-05-07T19:54:21.2137115Z 2025-05-07T19:54:21.2138400Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.2140388Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.2141274Z ^ 2025-05-07T19:54:21.2144808Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1371 2025-05-07T19:54:21.2148253Z 2025-05-07T19:54:21.2149656Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.2151644Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.2152550Z ^ 2025-05-07T19:54:21.2156066Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1423 2025-05-07T19:54:21.2159349Z 2025-05-07T19:54:21.2160658Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.2162645Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.2163552Z ^ 2025-05-07T19:54:21.2167073Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1475 2025-05-07T19:54:21.2170557Z 2025-05-07T19:54:21.2171865Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.2173840Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.2174752Z ^ 2025-05-07T19:54:21.2178430Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1527 2025-05-07T19:54:21.2181723Z 2025-05-07T19:54:21.2183035Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.2185011Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.2185919Z ^ 2025-05-07T19:54:21.2189439Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1579 2025-05-07T19:54:21.2192987Z 2025-05-07T19:54:21.2194437Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.2196444Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.2197349Z ^ 2025-05-07T19:54:21.2200853Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1631 2025-05-07T19:54:21.2204138Z 2025-05-07T19:54:21.2205433Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.2207442Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.2208352Z ^ 2025-05-07T19:54:21.2211336Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1683 2025-05-07T19:54:21.2214154Z 2025-05-07T19:54:21.2215395Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.2217438Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.2218318Z ^ 2025-05-07T19:54:21.2221798Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1735 2025-05-07T19:54:21.2225050Z 2025-05-07T19:54:21.2226324Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.2228285Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.2229179Z ^ 2025-05-07T19:54:21.2232669Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1787 2025-05-07T19:54:21.2235902Z 2025-05-07T19:54:21.2237193Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.2239293Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.2240141Z ^ 2025-05-07T19:54:21.2243269Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1839 2025-05-07T19:54:21.2246223Z 2025-05-07T19:54:21.2247303Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.2249094Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.2249962Z ^ 2025-05-07T19:54:21.2253348Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1891 2025-05-07T19:54:21.2256397Z 2025-05-07T19:54:21.2257550Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.2259412Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.2260231Z ^ 2025-05-07T19:54:21.2263530Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1943 2025-05-07T19:54:21.2266567Z 2025-05-07T19:54:21.2267549Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.2269326Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.2270371Z ^ 2025-05-07T19:54:21.2273791Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1995 2025-05-07T19:54:21.2277404Z 2025-05-07T19:54:21.2278667Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.2280782Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.2281685Z ^ 2025-05-07T19:54:21.2284989Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2047 2025-05-07T19:54:21.2287976Z 2025-05-07T19:54:21.2289253Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.2291232Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.2292137Z ^ 2025-05-07T19:54:21.2295632Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2099 2025-05-07T19:54:21.2299023Z 2025-05-07T19:54:21.2300296Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:21.2302269Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:21.2303157Z ^ 2025-05-07T19:54:21.2306702Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2151 2025-05-07T19:54:21.2310003Z 2025-05-07T19:54:22.6502567Z [181/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:54:22.6521769Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:23.8612896Z [182/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp 2025-05-07T19:54:23.8632654Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:26.4456018Z [183/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:54:26.4476521Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:27.3371793Z [184/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_dense_weighted_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_kernel.cu.o 2025-05-07T19:54:27.3389012Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:27.3390790Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:27.3391577Z ^ 2025-05-07T19:54:27.3391756Z 2025-05-07T19:54:27.3392057Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:27.3392562Z 2025-05-07T19:54:27.3393771Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:27.3395733Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:27.3396623Z ^ 2025-05-07T19:54:27.3396916Z 2025-05-07T19:54:27.3398127Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:27.3400112Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:27.3400967Z ^ 2025-05-07T19:54:27.3401158Z 2025-05-07T19:54:27.3401474Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:27.3401915Z 2025-05-07T19:54:27.3403244Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:27.3405460Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:27.3406435Z ^ 2025-05-07T19:54:27.3406722Z 2025-05-07T19:54:27.3408204Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:27.3410194Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:27.3411030Z ^ 2025-05-07T19:54:27.3411232Z 2025-05-07T19:54:27.3411571Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:27.3412101Z 2025-05-07T19:54:27.3413274Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:27.3415571Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:27.3416561Z ^ 2025-05-07T19:54:27.3416823Z 2025-05-07T19:54:27.3418191Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:27.3420163Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:27.3421355Z ^ 2025-05-07T19:54:27.3421615Z 2025-05-07T19:54:27.3422093Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:27.3422809Z 2025-05-07T19:54:27.3424516Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:27.3427357Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:27.3428649Z ^ 2025-05-07T19:54:27.3429041Z 2025-05-07T19:54:27.3430762Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:27.3433588Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:27.3434864Z ^ 2025-05-07T19:54:27.3435127Z 2025-05-07T19:54:27.3435595Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:27.3436284Z 2025-05-07T19:54:27.3437962Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:27.3439974Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:27.3440847Z ^ 2025-05-07T19:54:27.3441184Z 2025-05-07T19:54:27.7282155Z [185/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:54:27.7300431Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:30.6325924Z [186/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:54:30.6345192Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:31.2703328Z [187/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:54:31.2722464Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:31.3011012Z [188/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:54:31.3030951Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:31.3330775Z [189/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:54:31.3350599Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:31.3648820Z [190/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:54:31.3669034Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:31.3965965Z [191/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:54:31.3986549Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:31.4281170Z [192/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:54:31.4298577Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:31.4600865Z [193/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:54:31.4621473Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:36.1277413Z [194/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu.o 2025-05-07T19:54:36.1300552Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:36.1303016Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:36.1304038Z ^ 2025-05-07T19:54:36.1304243Z 2025-05-07T19:54:36.1304623Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:36.1305207Z 2025-05-07T19:54:36.1306727Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:36.1309266Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:36.1310841Z ^ 2025-05-07T19:54:36.1311211Z 2025-05-07T19:54:36.1312847Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:36.1315325Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:36.1316571Z ^ 2025-05-07T19:54:36.1316833Z 2025-05-07T19:54:36.1317261Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:36.1317905Z 2025-05-07T19:54:36.1319318Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:36.1321975Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:36.1323152Z ^ 2025-05-07T19:54:36.1323510Z 2025-05-07T19:54:36.1325152Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:36.1327788Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:36.1328949Z ^ 2025-05-07T19:54:36.1329193Z 2025-05-07T19:54:36.1329622Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:36.1330284Z 2025-05-07T19:54:36.1331924Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:36.1334620Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:36.1335778Z ^ 2025-05-07T19:54:36.1336148Z 2025-05-07T19:54:36.1337930Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:36.1340489Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:36.1341465Z ^ 2025-05-07T19:54:36.1341710Z 2025-05-07T19:54:36.1342083Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:36.1342722Z 2025-05-07T19:54:36.1344365Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:36.1347028Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:36.1348212Z ^ 2025-05-07T19:54:36.1348569Z 2025-05-07T19:54:36.1350208Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:36.1352857Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:36.1354246Z ^ 2025-05-07T19:54:36.1354495Z 2025-05-07T19:54:36.1354906Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:36.1355562Z 2025-05-07T19:54:36.1357215Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:36.1360057Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:36.1361222Z ^ 2025-05-07T19:54:36.1361588Z 2025-05-07T19:54:36.2075838Z [195/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:54:36.2094121Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:37.5841098Z [196/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:54:37.5861277Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:37.6819324Z [197/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu.o 2025-05-07T19:54:37.6842165Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:37.6844881Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:37.6846046Z ^ 2025-05-07T19:54:37.6846316Z 2025-05-07T19:54:37.6847133Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:37.6847813Z 2025-05-07T19:54:37.6849525Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:37.6852256Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:37.6853270Z ^ 2025-05-07T19:54:37.6853616Z 2025-05-07T19:54:37.6855237Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:37.6858029Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:37.6859201Z ^ 2025-05-07T19:54:37.6859445Z 2025-05-07T19:54:37.6859893Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:37.6860550Z 2025-05-07T19:54:37.6862169Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:37.6864823Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:37.6865973Z ^ 2025-05-07T19:54:37.6866362Z 2025-05-07T19:54:37.6868047Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:37.6870983Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:37.6872159Z ^ 2025-05-07T19:54:37.6872429Z 2025-05-07T19:54:37.6872885Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:37.6873555Z 2025-05-07T19:54:37.6875250Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:37.6877965Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:37.6879164Z ^ 2025-05-07T19:54:37.6879532Z 2025-05-07T19:54:37.6881196Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:37.6883887Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:37.6885079Z ^ 2025-05-07T19:54:37.6885337Z 2025-05-07T19:54:37.6885790Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:37.6886798Z 2025-05-07T19:54:37.6888350Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:37.6890901Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:37.6892047Z ^ 2025-05-07T19:54:37.6892561Z 2025-05-07T19:54:37.6894196Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:37.6896991Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:37.6898133Z ^ 2025-05-07T19:54:37.6898380Z 2025-05-07T19:54:37.6898831Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:37.6899492Z 2025-05-07T19:54:37.6900845Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:37.6902936Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:37.6903904Z ^ 2025-05-07T19:54:37.6904231Z 2025-05-07T19:54:38.6740206Z [198/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_dense_unweighted_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_kernel.cu.o 2025-05-07T19:54:38.6761862Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:38.6764761Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:38.6765887Z ^ 2025-05-07T19:54:38.6766121Z 2025-05-07T19:54:38.6766551Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:38.6767195Z 2025-05-07T19:54:38.6768841Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:38.6771715Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:38.6772855Z ^ 2025-05-07T19:54:38.6773208Z 2025-05-07T19:54:38.6774823Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:38.6777513Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:38.6778578Z ^ 2025-05-07T19:54:38.6778825Z 2025-05-07T19:54:38.6779246Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:38.6779823Z 2025-05-07T19:54:38.6781093Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:38.6783100Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:38.6783969Z ^ 2025-05-07T19:54:38.6784250Z 2025-05-07T19:54:38.6785521Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:38.6787563Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:38.6788480Z ^ 2025-05-07T19:54:38.6788678Z 2025-05-07T19:54:38.6789038Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:38.6789584Z 2025-05-07T19:54:38.6790899Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:38.6792923Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:38.6793821Z ^ 2025-05-07T19:54:38.6794147Z 2025-05-07T19:54:38.6795579Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:38.6798295Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:38.6799293Z ^ 2025-05-07T19:54:38.6799524Z 2025-05-07T19:54:38.6799902Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:38.6800484Z 2025-05-07T19:54:38.6802120Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:38.6804367Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:38.6805382Z ^ 2025-05-07T19:54:38.6805688Z 2025-05-07T19:54:38.6807013Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:38.6809229Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:38.6810139Z ^ 2025-05-07T19:54:38.6810342Z 2025-05-07T19:54:38.6810701Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:38.6811233Z 2025-05-07T19:54:38.6812674Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:38.6815167Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:38.6816253Z ^ 2025-05-07T19:54:38.6816809Z 2025-05-07T19:54:38.7533995Z [199/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel_small.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel_small.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_nobag_kernel_small.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel_small.cu.o 2025-05-07T19:54:38.7557205Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:38.7559721Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:38.7560846Z ^ 2025-05-07T19:54:38.7561095Z 2025-05-07T19:54:38.7561431Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:38.7562020Z 2025-05-07T19:54:38.7563515Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:38.7566100Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:38.7567246Z ^ 2025-05-07T19:54:38.7567607Z 2025-05-07T19:54:38.7569327Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:38.7572209Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:38.7573357Z ^ 2025-05-07T19:54:38.7573607Z 2025-05-07T19:54:38.7574044Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:38.7574681Z 2025-05-07T19:54:38.7576409Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:38.7579018Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:38.7580155Z ^ 2025-05-07T19:54:38.7580527Z 2025-05-07T19:54:38.7582109Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:38.7584704Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:38.7585829Z ^ 2025-05-07T19:54:38.7586091Z 2025-05-07T19:54:38.7586537Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:38.7587166Z 2025-05-07T19:54:38.7588794Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:38.7591408Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:38.7592563Z ^ 2025-05-07T19:54:38.7593352Z 2025-05-07T19:54:38.7594941Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:38.7597498Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:38.7598603Z ^ 2025-05-07T19:54:38.7598851Z 2025-05-07T19:54:38.7599438Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:38.7600085Z 2025-05-07T19:54:38.7601671Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:38.7604255Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:38.7605410Z ^ 2025-05-07T19:54:38.7605783Z 2025-05-07T19:54:38.7607346Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:38.7609915Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:38.7611027Z ^ 2025-05-07T19:54:38.7611284Z 2025-05-07T19:54:38.7611694Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:38.7612310Z 2025-05-07T19:54:38.7613952Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:38.7616642Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:38.7617676Z ^ 2025-05-07T19:54:38.7618024Z 2025-05-07T19:54:39.6913468Z [200/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu.o 2025-05-07T19:54:39.6935624Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:39.6938499Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:39.6939659Z ^ 2025-05-07T19:54:39.6939907Z 2025-05-07T19:54:39.6940359Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:39.6941023Z 2025-05-07T19:54:39.6942693Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:39.6945355Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:39.6946531Z ^ 2025-05-07T19:54:39.6946904Z 2025-05-07T19:54:39.6948530Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:39.6951164Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:39.6952215Z ^ 2025-05-07T19:54:39.6952441Z 2025-05-07T19:54:39.6952893Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:39.6953553Z 2025-05-07T19:54:39.6955180Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:39.6957620Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:39.6958853Z ^ 2025-05-07T19:54:39.6959206Z 2025-05-07T19:54:39.6960740Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:39.6963065Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:39.6964192Z ^ 2025-05-07T19:54:39.6964425Z 2025-05-07T19:54:39.6964831Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:39.6965409Z 2025-05-07T19:54:39.6966955Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:39.6969880Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:39.6971289Z ^ 2025-05-07T19:54:39.6971630Z 2025-05-07T19:54:39.6972990Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:39.6975970Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:39.6977263Z ^ 2025-05-07T19:54:39.6977493Z 2025-05-07T19:54:39.6977900Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:39.6978578Z 2025-05-07T19:54:39.6980162Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:39.6982671Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:39.6983663Z ^ 2025-05-07T19:54:39.6984021Z 2025-05-07T19:54:39.6985539Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:39.6988032Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:39.6989093Z ^ 2025-05-07T19:54:39.6989338Z 2025-05-07T19:54:39.6989773Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:39.6990407Z 2025-05-07T19:54:39.6991946Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:39.6994500Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:39.6995620Z ^ 2025-05-07T19:54:39.6995950Z 2025-05-07T19:54:40.5400754Z [201/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:54:40.5420300Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:40.7977104Z [202/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:54:40.7996713Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:40.9215468Z [203/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu.o 2025-05-07T19:54:40.9239876Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:40.9242686Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:40.9243896Z ^ 2025-05-07T19:54:40.9244150Z 2025-05-07T19:54:40.9244619Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:40.9245306Z 2025-05-07T19:54:40.9246986Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:40.9249691Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:40.9250902Z ^ 2025-05-07T19:54:40.9251278Z 2025-05-07T19:54:40.9252926Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:40.9254961Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:40.9255527Z ^ 2025-05-07T19:54:40.9255838Z 2025-05-07T19:54:40.9257624Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:40.9259665Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:40.9260219Z ^ 2025-05-07T19:54:40.9263907Z 2025-05-07T19:54:40.9265533Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:40.9267582Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:40.9268152Z ^ 2025-05-07T19:54:40.9268454Z 2025-05-07T19:54:40.9270650Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:40.9273382Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:40.9274572Z ^ 2025-05-07T19:54:40.9274823Z 2025-05-07T19:54:40.9275281Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:40.9275978Z 2025-05-07T19:54:40.9277467Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:40.9279703Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:40.9280694Z ^ 2025-05-07T19:54:40.9281014Z 2025-05-07T19:54:40.9282454Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:40.9284192Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:40.9284676Z ^ 2025-05-07T19:54:40.9284939Z 2025-05-07T19:54:40.9286331Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:40.9288090Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:40.9288609Z ^ 2025-05-07T19:54:40.9288880Z 2025-05-07T19:54:40.9290341Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:40.9292312Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:40.9292860Z ^ 2025-05-07T19:54:40.9293153Z 2025-05-07T19:54:40.9294763Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:40.9297539Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:40.9298685Z ^ 2025-05-07T19:54:40.9298924Z 2025-05-07T19:54:40.9299356Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:40.9299977Z 2025-05-07T19:54:40.9301593Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:40.9304242Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:40.9305409Z ^ 2025-05-07T19:54:40.9305780Z 2025-05-07T19:54:40.9307653Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:40.9309566Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:40.9310107Z ^ 2025-05-07T19:54:40.9310399Z 2025-05-07T19:54:40.9312127Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:40.9314082Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:40.9314631Z ^ 2025-05-07T19:54:40.9314919Z 2025-05-07T19:54:40.9316497Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:40.9318403Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:40.9318954Z ^ 2025-05-07T19:54:40.9319241Z 2025-05-07T19:54:40.9320783Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:40.9323381Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:40.9324518Z ^ 2025-05-07T19:54:40.9324766Z 2025-05-07T19:54:40.9325191Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:40.9325857Z 2025-05-07T19:54:40.9327454Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:40.9330022Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:40.9331169Z ^ 2025-05-07T19:54:40.9331534Z 2025-05-07T19:54:40.9333109Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:40.9335015Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:40.9335557Z ^ 2025-05-07T19:54:40.9335843Z 2025-05-07T19:54:40.9337538Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:40.9339434Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:40.9339968Z ^ 2025-05-07T19:54:40.9340241Z 2025-05-07T19:54:40.9341766Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:40.9343681Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:40.9344228Z ^ 2025-05-07T19:54:40.9344531Z 2025-05-07T19:54:40.9346128Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:40.9348691Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:40.9350038Z ^ 2025-05-07T19:54:40.9350276Z 2025-05-07T19:54:40.9350706Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:40.9351382Z 2025-05-07T19:54:40.9353056Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:40.9355823Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:40.9356986Z ^ 2025-05-07T19:54:40.9357344Z 2025-05-07T19:54:40.9358951Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:40.9360956Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:40.9361530Z ^ 2025-05-07T19:54:40.9361827Z 2025-05-07T19:54:40.9363438Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:40.9365450Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:40.9366016Z ^ 2025-05-07T19:54:40.9366327Z 2025-05-07T19:54:40.9367930Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:40.9369973Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:40.9370790Z ^ 2025-05-07T19:54:40.9371085Z 2025-05-07T19:54:41.0699459Z [204/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:54:41.0718774Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:41.3925245Z [205/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:54:41.3943460Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:42.9685858Z [206/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp 2025-05-07T19:54:42.9704978Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:45.2285583Z [207/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:54:45.2304283Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:45.9495969Z [208/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:54:45.9515975Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:47.3018732Z [209/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:54:47.3038013Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:49.4704854Z [210/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:54:49.4723370Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:49.5034007Z [211/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_split_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_split_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_split_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_dense_split_cpu.cpp 2025-05-07T19:54:49.5049872Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:51.6192553Z [212/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adam_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adam_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adam_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_adam_cpu.cpp 2025-05-07T19:54:51.6209928Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:52.5063775Z [213/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lamb_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lamb_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lamb_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_lamb_cpu.cpp 2025-05-07T19:54:52.5082876Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:52.6547307Z [214/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/codegen/training/backward/embedding_backward_dense_host_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/codegen/training/backward/embedding_backward_dense_host_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/codegen/training/backward/embedding_backward_dense_host_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/training/backward/embedding_backward_dense_host_cpu.cpp 2025-05-07T19:54:52.6567127Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:52.9520093Z [215/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lars_sgd_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lars_sgd_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lars_sgd_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_lars_sgd_cpu.cpp 2025-05-07T19:54:52.9536841Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:53.4147934Z [216/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp 2025-05-07T19:54:53.4167335Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:53.5365892Z [217/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_none_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_none_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_none_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_none_cpu.cpp 2025-05-07T19:54:53.5385892Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:53.9046498Z [218/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp 2025-05-07T19:54:53.9061670Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:54.2695213Z [219/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu.o 2025-05-07T19:54:54.2717531Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:54.2720178Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:54.2721624Z ^ 2025-05-07T19:54:54.2721865Z 2025-05-07T19:54:54.2722298Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:54.2722972Z 2025-05-07T19:54:54.2724649Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:54.2727424Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:54.2728632Z ^ 2025-05-07T19:54:54.2728997Z 2025-05-07T19:54:54.2730581Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:54.2733303Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:54.2734467Z ^ 2025-05-07T19:54:54.2734725Z 2025-05-07T19:54:54.2735170Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:54.2735833Z 2025-05-07T19:54:54.2737679Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:54.2740371Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:54.2741516Z ^ 2025-05-07T19:54:54.2741901Z 2025-05-07T19:54:54.2743587Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:54.2746239Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:54.2747335Z ^ 2025-05-07T19:54:54.2747599Z 2025-05-07T19:54:54.2748070Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:54.2748743Z 2025-05-07T19:54:54.2750395Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:54.2753134Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:54.2754258Z ^ 2025-05-07T19:54:54.2754654Z 2025-05-07T19:54:54.2756229Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:54.2758916Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:54.2760035Z ^ 2025-05-07T19:54:54.2760324Z 2025-05-07T19:54:54.2760778Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:54.2761415Z 2025-05-07T19:54:54.2763069Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:54.2765724Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:54.2767211Z ^ 2025-05-07T19:54:54.2767575Z 2025-05-07T19:54:54.2769251Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:54.2772407Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:54.2773604Z ^ 2025-05-07T19:54:54.2773866Z 2025-05-07T19:54:54.2774323Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:54.2775022Z 2025-05-07T19:54:54.2776703Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:54.2779293Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:54.2780439Z ^ 2025-05-07T19:54:54.2780832Z 2025-05-07T19:54:54.3079031Z [220/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:54:54.3095782Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:55.9951834Z [221/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adagrad_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adagrad_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adagrad_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_adagrad_cpu.cpp 2025-05-07T19:54:55.9969129Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:56.1863830Z [222/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_sgd_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_sgd_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_sgd_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_approx_sgd_cpu.cpp 2025-05-07T19:54:56.1880631Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:56.3833862Z [223/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_rowwise_adagrad_cpu.cpp 2025-05-07T19:54:56.3851292Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:57.2418388Z [224/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_sgd_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_sgd_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_sgd_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_sgd_cpu.cpp 2025-05-07T19:54:57.2437476Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:57.6379547Z [225/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp 2025-05-07T19:54:57.6398737Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:58.1130794Z [226/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:54:58.1152069Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:58.5168907Z [227/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:54:58.5186097Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:58.8264044Z [228/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_training_backward_pt2.so -o fbgemm_gpu_tbe_training_backward_pt2.so CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adagrad_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_sgd_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adam_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lamb_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lars_sgd_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_none_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_sgd_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm_gpu_tbe_cache.so fbgemm_gpu_tbe_common.so fbgemm_gpu_sparse_async_cumsum.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so" -Wl,--as-needed fbgemm.so fbgemm_gpu_config.so fbgemm_gpu_tbe_utils.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && : 2025-05-07T19:54:59.7822650Z [229/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:54:59.7839902Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:02.0457570Z [230/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp 2025-05-07T19:55:02.0478213Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:02.2677578Z [231/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:55:02.2695226Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:02.8566726Z [232/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu.o 2025-05-07T19:55:02.8591376Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:02.8594117Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:02.8595313Z ^ 2025-05-07T19:55:02.8595571Z 2025-05-07T19:55:02.8595947Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:02.8596530Z 2025-05-07T19:55:02.8598071Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:02.8600718Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:02.8601910Z ^ 2025-05-07T19:55:02.8602264Z 2025-05-07T19:55:02.8603840Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:02.8605780Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:02.8606287Z ^ 2025-05-07T19:55:02.8606573Z 2025-05-07T19:55:02.8608112Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:02.8610043Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:02.8610598Z ^ 2025-05-07T19:55:02.8610892Z 2025-05-07T19:55:02.8612444Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:02.8614719Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:02.8615271Z ^ 2025-05-07T19:55:02.8615550Z 2025-05-07T19:55:02.8617113Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:02.8619498Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:02.8620818Z ^ 2025-05-07T19:55:02.8621049Z 2025-05-07T19:55:02.8621437Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:02.8621936Z 2025-05-07T19:55:02.8623428Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:02.8626031Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:02.8627186Z ^ 2025-05-07T19:55:02.8627547Z 2025-05-07T19:55:02.8629116Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:02.8631108Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:02.8631662Z ^ 2025-05-07T19:55:02.8631950Z 2025-05-07T19:55:02.8633511Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:02.8635416Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:02.8635954Z ^ 2025-05-07T19:55:02.8636227Z 2025-05-07T19:55:02.8637782Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:02.8639745Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:02.8640284Z ^ 2025-05-07T19:55:02.8640566Z 2025-05-07T19:55:02.8642196Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:02.8644656Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:02.8645832Z ^ 2025-05-07T19:55:02.8646080Z 2025-05-07T19:55:02.8646492Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:02.8647162Z 2025-05-07T19:55:02.8648756Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:02.8651392Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:02.8652561Z ^ 2025-05-07T19:55:02.8652928Z 2025-05-07T19:55:02.8654523Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:02.8656730Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:02.8657377Z ^ 2025-05-07T19:55:02.8657678Z 2025-05-07T19:55:02.8659270Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:02.8661218Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:02.8661741Z ^ 2025-05-07T19:55:02.8662027Z 2025-05-07T19:55:02.8663696Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:02.8665651Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:02.8666210Z ^ 2025-05-07T19:55:02.8666501Z 2025-05-07T19:55:02.8667975Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:02.8670465Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:02.8671530Z ^ 2025-05-07T19:55:02.8671783Z 2025-05-07T19:55:02.8672208Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:02.8672877Z 2025-05-07T19:55:02.8674501Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:02.8677023Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:02.8678139Z ^ 2025-05-07T19:55:02.8678407Z 2025-05-07T19:55:02.8680009Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:02.8682011Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:02.8682528Z ^ 2025-05-07T19:55:02.8682823Z 2025-05-07T19:55:02.8684206Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:02.8686097Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:02.8686616Z ^ 2025-05-07T19:55:02.8686912Z 2025-05-07T19:55:02.8688482Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:02.8690429Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:02.8690973Z ^ 2025-05-07T19:55:02.8691282Z 2025-05-07T19:55:02.8692889Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:02.8695323Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:02.8696348Z ^ 2025-05-07T19:55:02.8696728Z 2025-05-07T19:55:02.8697063Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:02.8697804Z 2025-05-07T19:55:02.8699259Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:02.8701627Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:02.8702578Z ^ 2025-05-07T19:55:02.8702904Z 2025-05-07T19:55:02.8704606Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:02.8706362Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:02.8706857Z ^ 2025-05-07T19:55:02.8707125Z 2025-05-07T19:55:02.8708537Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:02.8710308Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:02.8710853Z ^ 2025-05-07T19:55:02.8711150Z 2025-05-07T19:55:02.8712601Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:02.8714570Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:02.8715109Z ^ 2025-05-07T19:55:02.8715394Z 2025-05-07T19:55:05.1336936Z [233/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adagrad_split_cpu.cpp 2025-05-07T19:55:05.1356311Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:05.7784879Z [234/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu.o 2025-05-07T19:55:05.7807722Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:05.7810337Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:05.7811521Z ^ 2025-05-07T19:55:05.7811776Z 2025-05-07T19:55:05.7812205Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:05.7812857Z 2025-05-07T19:55:05.7814469Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:05.7816832Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:05.7817845Z ^ 2025-05-07T19:55:05.7818194Z 2025-05-07T19:55:05.7819614Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:05.7821945Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:05.7822646Z ^ 2025-05-07T19:55:05.7822910Z 2025-05-07T19:55:05.7824334Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:05.7826284Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:05.7826824Z ^ 2025-05-07T19:55:05.7827083Z 2025-05-07T19:55:05.7828565Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:05.7830458Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:05.7830997Z ^ 2025-05-07T19:55:05.7831251Z 2025-05-07T19:55:05.7832728Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:05.7834658Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:05.7835191Z ^ 2025-05-07T19:55:05.7835465Z 2025-05-07T19:55:05.7837073Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:05.7839708Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:05.7840823Z ^ 2025-05-07T19:55:05.7841084Z 2025-05-07T19:55:05.7841495Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:05.7842135Z 2025-05-07T19:55:05.7843776Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:05.7846347Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:05.7847512Z ^ 2025-05-07T19:55:05.7847865Z 2025-05-07T19:55:05.7849408Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:05.7851449Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:05.7852180Z ^ 2025-05-07T19:55:05.7852465Z 2025-05-07T19:55:05.7853877Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:05.7855795Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:05.7856344Z ^ 2025-05-07T19:55:05.7856746Z 2025-05-07T19:55:05.7858178Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:05.7860098Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:05.7860612Z ^ 2025-05-07T19:55:05.7860866Z 2025-05-07T19:55:05.7862362Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:05.7864435Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:05.7864965Z ^ 2025-05-07T19:55:05.7865251Z 2025-05-07T19:55:05.7866859Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:05.7869535Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:05.7870891Z ^ 2025-05-07T19:55:05.7871153Z 2025-05-07T19:55:05.7871572Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:05.7872213Z 2025-05-07T19:55:05.7873836Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:05.7876463Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:05.7877532Z ^ 2025-05-07T19:55:05.7877883Z 2025-05-07T19:55:05.7879422Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:05.7881315Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:05.7881907Z ^ 2025-05-07T19:55:05.7882150Z 2025-05-07T19:55:05.7883488Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:05.7885297Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:05.7885810Z ^ 2025-05-07T19:55:05.7886085Z 2025-05-07T19:55:05.7887493Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:05.7889370Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:05.7889882Z ^ 2025-05-07T19:55:05.7890155Z 2025-05-07T19:55:05.7891577Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:05.7893428Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:05.7893948Z ^ 2025-05-07T19:55:05.7894219Z 2025-05-07T19:55:05.7895717Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:05.7898385Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:05.7899518Z ^ 2025-05-07T19:55:05.7899772Z 2025-05-07T19:55:05.7900221Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:05.7900868Z 2025-05-07T19:55:05.7902492Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:05.7905389Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:05.7906539Z ^ 2025-05-07T19:55:05.7906888Z 2025-05-07T19:55:05.7908379Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:05.7910630Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:05.7911349Z ^ 2025-05-07T19:55:05.7911628Z 2025-05-07T19:55:05.7913129Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:05.7914978Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:05.7915516Z ^ 2025-05-07T19:55:05.7915804Z 2025-05-07T19:55:05.7917324Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:05.7919236Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:05.7919764Z ^ 2025-05-07T19:55:05.7920032Z 2025-05-07T19:55:05.7921587Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:05.7923470Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:05.7924006Z ^ 2025-05-07T19:55:05.7924272Z 2025-05-07T19:55:05.7925832Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:05.7928362Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:05.7929514Z ^ 2025-05-07T19:55:05.7929760Z 2025-05-07T19:55:05.7930188Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:05.7930851Z 2025-05-07T19:55:05.7932458Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:05.7935026Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:05.7936117Z ^ 2025-05-07T19:55:05.7936612Z 2025-05-07T19:55:05.7938107Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:05.7940196Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:05.7940912Z ^ 2025-05-07T19:55:05.7941202Z 2025-05-07T19:55:05.7942742Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:05.7944646Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:05.7945205Z ^ 2025-05-07T19:55:05.7945488Z 2025-05-07T19:55:05.7946815Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:05.7948653Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:05.7949155Z ^ 2025-05-07T19:55:05.7949417Z 2025-05-07T19:55:05.7950721Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:05.7952701Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:05.7953212Z ^ 2025-05-07T19:55:05.7953482Z 2025-05-07T19:55:06.3087419Z [235/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_sgd_split_cpu.cpp 2025-05-07T19:55:06.3106286Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:06.5807170Z [236/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_cpu.cpp 2025-05-07T19:55:06.5825979Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:07.7541958Z [237/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu.o 2025-05-07T19:55:07.7564338Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:07.7567407Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:07.7568512Z ^ 2025-05-07T19:55:07.7568738Z 2025-05-07T19:55:07.7569158Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:07.7569814Z 2025-05-07T19:55:07.7572087Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:07.7574576Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:07.7575575Z ^ 2025-05-07T19:55:07.7575925Z 2025-05-07T19:55:07.7577421Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:07.7579469Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:07.7580172Z ^ 2025-05-07T19:55:07.7580444Z 2025-05-07T19:55:07.7581932Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:07.7583703Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:07.7584246Z ^ 2025-05-07T19:55:07.7584528Z 2025-05-07T19:55:07.7586055Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:07.7587977Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:07.7588521Z ^ 2025-05-07T19:55:07.7588788Z 2025-05-07T19:55:07.7590026Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:07.7591460Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:07.7591868Z ^ 2025-05-07T19:55:07.7592060Z 2025-05-07T19:55:07.7593312Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:07.7595434Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:07.7596500Z ^ 2025-05-07T19:55:07.7596727Z 2025-05-07T19:55:07.7597132Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:07.7597756Z 2025-05-07T19:55:07.7599216Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:07.7601522Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:07.7602620Z ^ 2025-05-07T19:55:07.7602987Z 2025-05-07T19:55:07.7604474Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:07.7608346Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:07.7608995Z ^ 2025-05-07T19:55:07.7609223Z 2025-05-07T19:55:07.7610612Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:07.7612663Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:07.7613191Z ^ 2025-05-07T19:55:07.7613471Z 2025-05-07T19:55:07.7615008Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:07.7617072Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:07.7617607Z ^ 2025-05-07T19:55:07.7617877Z 2025-05-07T19:55:07.7619341Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:07.7621227Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:07.7621764Z ^ 2025-05-07T19:55:07.7622035Z 2025-05-07T19:55:07.7623704Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:07.7626338Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:07.7627528Z ^ 2025-05-07T19:55:07.7627788Z 2025-05-07T19:55:07.7628232Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:07.7628910Z 2025-05-07T19:55:07.7630559Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:07.7633169Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:07.7634345Z ^ 2025-05-07T19:55:07.7634704Z 2025-05-07T19:55:07.7636199Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:07.7638252Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:07.7639036Z ^ 2025-05-07T19:55:07.7639337Z 2025-05-07T19:55:07.7640868Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:07.7642769Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:07.7643334Z ^ 2025-05-07T19:55:07.7643619Z 2025-05-07T19:55:07.7645136Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:07.7647089Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:07.7647641Z ^ 2025-05-07T19:55:07.7647923Z 2025-05-07T19:55:07.7649584Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:07.7651533Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:07.7652088Z ^ 2025-05-07T19:55:07.7652401Z 2025-05-07T19:55:07.7654187Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:07.7657040Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:07.7658211Z ^ 2025-05-07T19:55:07.7658493Z 2025-05-07T19:55:07.7658957Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:07.7659622Z 2025-05-07T19:55:07.7661282Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:07.7663910Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:07.7665089Z ^ 2025-05-07T19:55:07.7665453Z 2025-05-07T19:55:07.7667010Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:07.7669105Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:07.7669877Z ^ 2025-05-07T19:55:07.7670386Z 2025-05-07T19:55:07.7671942Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:07.7673922Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:07.7674498Z ^ 2025-05-07T19:55:07.7674792Z 2025-05-07T19:55:07.7676324Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:07.7678237Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:07.7678799Z ^ 2025-05-07T19:55:07.7679103Z 2025-05-07T19:55:07.7680630Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:07.7682501Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:07.7683057Z ^ 2025-05-07T19:55:07.7683362Z 2025-05-07T19:55:07.7685009Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:07.7687646Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:07.7688807Z ^ 2025-05-07T19:55:07.7689064Z 2025-05-07T19:55:07.7689542Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:07.7690216Z 2025-05-07T19:55:07.7691880Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:07.7694848Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:07.7696052Z ^ 2025-05-07T19:55:07.7696410Z 2025-05-07T19:55:07.7698052Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:07.7700399Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:07.7701167Z ^ 2025-05-07T19:55:07.7701458Z 2025-05-07T19:55:07.7703025Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:07.7704988Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:07.7705556Z ^ 2025-05-07T19:55:07.7705870Z 2025-05-07T19:55:07.7707387Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:07.7709321Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:07.7709890Z ^ 2025-05-07T19:55:07.7710176Z 2025-05-07T19:55:07.7711728Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:07.7713672Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:07.7714254Z ^ 2025-05-07T19:55:07.7714536Z 2025-05-07T19:55:14.8982816Z [238/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_dense_weighted_vbe_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_kernel.cu.o 2025-05-07T19:55:14.9004789Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:14.9007510Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:14.9008634Z ^ 2025-05-07T19:55:14.9008903Z 2025-05-07T19:55:14.9009356Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:14.9010030Z 2025-05-07T19:55:14.9011688Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:14.9014390Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:14.9015603Z ^ 2025-05-07T19:55:14.9015974Z 2025-05-07T19:55:14.9017786Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:14.9020391Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:14.9021438Z ^ 2025-05-07T19:55:14.9021689Z 2025-05-07T19:55:14.9022163Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:14.9022813Z 2025-05-07T19:55:14.9024394Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:14.9026788Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:14.9027870Z ^ 2025-05-07T19:55:14.9028253Z 2025-05-07T19:55:14.9029750Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:14.9031990Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:14.9032972Z ^ 2025-05-07T19:55:14.9033225Z 2025-05-07T19:55:14.9033592Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:14.9034189Z 2025-05-07T19:55:14.9035764Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:14.9038272Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:14.9039380Z ^ 2025-05-07T19:55:14.9039726Z 2025-05-07T19:55:14.9041324Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:14.9044090Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:14.9045236Z ^ 2025-05-07T19:55:14.9045476Z 2025-05-07T19:55:14.9045858Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:14.9046486Z 2025-05-07T19:55:14.9048131Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:14.9050602Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:14.9051724Z ^ 2025-05-07T19:55:14.9052082Z 2025-05-07T19:55:14.9053565Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:14.9056149Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:14.9057427Z ^ 2025-05-07T19:55:14.9057702Z 2025-05-07T19:55:14.9058163Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:14.9058808Z 2025-05-07T19:55:14.9060369Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:14.9063042Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:14.9064230Z ^ 2025-05-07T19:55:14.9064598Z 2025-05-07T19:55:22.5324421Z [239/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu.o 2025-05-07T19:55:22.5347786Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:22.5350414Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:22.5351605Z ^ 2025-05-07T19:55:22.5351896Z 2025-05-07T19:55:22.5352347Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:22.5353018Z 2025-05-07T19:55:22.5354749Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:22.5357458Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:22.5358683Z ^ 2025-05-07T19:55:22.5359062Z 2025-05-07T19:55:22.5360658Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:22.5362820Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:22.5363573Z ^ 2025-05-07T19:55:22.5363865Z 2025-05-07T19:55:22.5365442Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:22.5367426Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:22.5368002Z ^ 2025-05-07T19:55:22.5368294Z 2025-05-07T19:55:22.5369848Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:22.5372130Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:22.5372699Z ^ 2025-05-07T19:55:22.5373008Z 2025-05-07T19:55:22.5374567Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:22.5376683Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:22.5377240Z ^ 2025-05-07T19:55:22.5377546Z 2025-05-07T19:55:22.5379130Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:22.5381664Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:22.5383008Z ^ 2025-05-07T19:55:22.5383219Z 2025-05-07T19:55:22.5383591Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:22.5384133Z 2025-05-07T19:55:22.5385696Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:22.5388587Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:22.5389769Z ^ 2025-05-07T19:55:22.5390128Z 2025-05-07T19:55:22.5391657Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:22.5393798Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:22.5394571Z ^ 2025-05-07T19:55:22.5394845Z 2025-05-07T19:55:22.5396365Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:22.5398323Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:22.5398863Z ^ 2025-05-07T19:55:22.5399160Z 2025-05-07T19:55:22.5400680Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:22.5402614Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:22.5403155Z ^ 2025-05-07T19:55:22.5403460Z 2025-05-07T19:55:22.5404841Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:22.5406567Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:22.5407120Z ^ 2025-05-07T19:55:22.5407389Z 2025-05-07T19:55:22.5409061Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:22.5411715Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:22.5412907Z ^ 2025-05-07T19:55:22.5413162Z 2025-05-07T19:55:22.5413649Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:22.5414325Z 2025-05-07T19:55:22.5415992Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:22.5418861Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:22.5420055Z ^ 2025-05-07T19:55:22.5420461Z 2025-05-07T19:55:22.5422035Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:22.5424176Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:22.5425122Z ^ 2025-05-07T19:55:22.5425439Z 2025-05-07T19:55:22.5426994Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:22.5428982Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:22.5429546Z ^ 2025-05-07T19:55:22.5429826Z 2025-05-07T19:55:22.5431533Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:22.5433436Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:22.5433968Z ^ 2025-05-07T19:55:22.5434240Z 2025-05-07T19:55:22.5435691Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:22.5437582Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:22.5438125Z ^ 2025-05-07T19:55:22.5438395Z 2025-05-07T19:55:22.5440019Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:22.5442431Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:22.5443568Z ^ 2025-05-07T19:55:22.5443814Z 2025-05-07T19:55:22.5444258Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:22.5444935Z 2025-05-07T19:55:22.5446541Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:22.5448881Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:22.5450062Z ^ 2025-05-07T19:55:22.5450419Z 2025-05-07T19:55:22.5451929Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:22.5453942Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:22.5454677Z ^ 2025-05-07T19:55:22.5454954Z 2025-05-07T19:55:22.5456688Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:22.5458631Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:22.5459083Z ^ 2025-05-07T19:55:22.5459307Z 2025-05-07T19:55:22.5460450Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:22.5461915Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:22.5462328Z ^ 2025-05-07T19:55:22.5462510Z 2025-05-07T19:55:22.5463613Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:22.5465110Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:22.5465772Z ^ 2025-05-07T19:55:22.5465992Z 2025-05-07T19:55:22.5467267Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:22.5469365Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:22.5470504Z ^ 2025-05-07T19:55:22.5470693Z 2025-05-07T19:55:22.5471240Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:22.5471726Z 2025-05-07T19:55:22.5472963Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:22.5475216Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:22.5476197Z ^ 2025-05-07T19:55:22.5476504Z 2025-05-07T19:55:22.5477817Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:22.5479603Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:22.5480242Z ^ 2025-05-07T19:55:22.5480480Z 2025-05-07T19:55:22.5481719Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:22.5483366Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:22.5483847Z ^ 2025-05-07T19:55:22.5484072Z 2025-05-07T19:55:22.5485329Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:22.5486858Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:22.5487302Z ^ 2025-05-07T19:55:22.5487529Z 2025-05-07T19:55:22.5488875Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:22.5490659Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:22.5491173Z ^ 2025-05-07T19:55:22.5491443Z 2025-05-07T19:55:22.5786409Z [240/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu.o 2025-05-07T19:55:22.5810804Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:22.5813598Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:22.5814844Z ^ 2025-05-07T19:55:22.5815123Z 2025-05-07T19:55:22.5815587Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:22.5816258Z 2025-05-07T19:55:22.5817959Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:22.5820609Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:22.5821881Z ^ 2025-05-07T19:55:22.5822371Z 2025-05-07T19:55:22.5823957Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:22.5826157Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:22.5826943Z ^ 2025-05-07T19:55:22.5827245Z 2025-05-07T19:55:22.5828858Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:22.5830803Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:22.5831284Z ^ 2025-05-07T19:55:22.5831538Z 2025-05-07T19:55:22.5832894Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:22.5834559Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:22.5835066Z ^ 2025-05-07T19:55:22.5835340Z 2025-05-07T19:55:22.5836826Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:22.5838998Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:22.5839569Z ^ 2025-05-07T19:55:22.5839854Z 2025-05-07T19:55:22.5841474Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:22.5844219Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:22.5845394Z ^ 2025-05-07T19:55:22.5845649Z 2025-05-07T19:55:22.5846087Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:22.5846735Z 2025-05-07T19:55:22.5848372Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:22.5851017Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:22.5852215Z ^ 2025-05-07T19:55:22.5852597Z 2025-05-07T19:55:22.5854075Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:22.5856109Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:22.5856999Z ^ 2025-05-07T19:55:22.5857301Z 2025-05-07T19:55:22.5858888Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:22.5860761Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:22.5861336Z ^ 2025-05-07T19:55:22.5861617Z 2025-05-07T19:55:22.5863055Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:22.5865019Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:22.5865561Z ^ 2025-05-07T19:55:22.5865835Z 2025-05-07T19:55:22.5867265Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:22.5869103Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:22.5869622Z ^ 2025-05-07T19:55:22.5869910Z 2025-05-07T19:55:22.5871731Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:22.5874283Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:22.5875391Z ^ 2025-05-07T19:55:22.5875675Z 2025-05-07T19:55:22.5876088Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:22.5876722Z 2025-05-07T19:55:22.5878193Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:22.5881082Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:22.5882274Z ^ 2025-05-07T19:55:22.5882649Z 2025-05-07T19:55:22.5884121Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:22.5886453Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:22.5887243Z ^ 2025-05-07T19:55:22.5887534Z 2025-05-07T19:55:22.5889078Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:22.5891060Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:22.5891654Z ^ 2025-05-07T19:55:22.5891951Z 2025-05-07T19:55:22.5893403Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:22.5895272Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:22.5895802Z ^ 2025-05-07T19:55:22.5896100Z 2025-05-07T19:55:22.5897679Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:22.5899685Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:22.5900244Z ^ 2025-05-07T19:55:22.5900520Z 2025-05-07T19:55:22.5902206Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:22.5904858Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:22.5905954Z ^ 2025-05-07T19:55:22.5906191Z 2025-05-07T19:55:22.5906600Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:22.5907225Z 2025-05-07T19:55:22.5908780Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:22.5911499Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:22.5912712Z ^ 2025-05-07T19:55:22.5913086Z 2025-05-07T19:55:22.5914629Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:22.5916615Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:22.5917318Z ^ 2025-05-07T19:55:22.5917579Z 2025-05-07T19:55:22.5919061Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:22.5921028Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:22.5921574Z ^ 2025-05-07T19:55:22.5921860Z 2025-05-07T19:55:22.5923344Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:22.5925270Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:22.5925762Z ^ 2025-05-07T19:55:22.5926005Z 2025-05-07T19:55:22.5927390Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:22.5929336Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:22.5929834Z ^ 2025-05-07T19:55:22.5930116Z 2025-05-07T19:55:22.5931657Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:22.5934188Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:22.5935248Z ^ 2025-05-07T19:55:22.5935520Z 2025-05-07T19:55:22.5935924Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:22.5936715Z 2025-05-07T19:55:22.5938297Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:22.5940675Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:22.5941809Z ^ 2025-05-07T19:55:22.5942089Z 2025-05-07T19:55:22.5943578Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:22.5945762Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:22.5946530Z ^ 2025-05-07T19:55:22.5946823Z 2025-05-07T19:55:22.5948390Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:22.5950377Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:22.5950948Z ^ 2025-05-07T19:55:22.5951236Z 2025-05-07T19:55:22.5952782Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:22.5954689Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:22.5955223Z ^ 2025-05-07T19:55:22.5955513Z 2025-05-07T19:55:22.5956873Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:22.5958700Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:22.5959223Z ^ 2025-05-07T19:55:22.5959479Z 2025-05-07T19:55:39.1491354Z [241/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu.o 2025-05-07T19:55:39.1514454Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:39.1517038Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:39.1518275Z ^ 2025-05-07T19:55:39.1518546Z 2025-05-07T19:55:39.1518988Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:39.1519637Z 2025-05-07T19:55:39.1521268Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:39.1523859Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:39.1524973Z ^ 2025-05-07T19:55:39.1525323Z 2025-05-07T19:55:39.1526873Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(219): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:39.1528799Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:39.1529356Z ^ 2025-05-07T19:55:39.1529656Z 2025-05-07T19:55:39.1531179Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(236): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:39.1533318Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:39.1533876Z ^ 2025-05-07T19:55:39.1534162Z 2025-05-07T19:55:39.1535680Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(246): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:39.1537703Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:39.1538235Z ^ 2025-05-07T19:55:39.1538539Z 2025-05-07T19:55:39.1540264Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:39.1542846Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:39.1543972Z ^ 2025-05-07T19:55:39.1544231Z 2025-05-07T19:55:39.1544658Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:39.1545296Z 2025-05-07T19:55:39.1546915Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:39.1549491Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:39.1550875Z ^ 2025-05-07T19:55:39.1551228Z 2025-05-07T19:55:39.1552758Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(219): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:39.1554687Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:39.1555245Z ^ 2025-05-07T19:55:39.1555534Z 2025-05-07T19:55:39.1557174Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(236): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:39.1559098Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:39.1559629Z ^ 2025-05-07T19:55:39.1559937Z 2025-05-07T19:55:39.1561474Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(246): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:39.1563353Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:39.1563889Z ^ 2025-05-07T19:55:39.1564185Z 2025-05-07T19:55:39.1565775Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:39.1568363Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:39.1569477Z ^ 2025-05-07T19:55:39.1569732Z 2025-05-07T19:55:39.1570362Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:39.1571007Z 2025-05-07T19:55:39.1572620Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:39.1575175Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:39.1576791Z ^ 2025-05-07T19:55:39.1577145Z 2025-05-07T19:55:39.1578662Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(219): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:39.1580583Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:39.1581289Z ^ 2025-05-07T19:55:39.1581579Z 2025-05-07T19:55:39.1583293Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(236): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:39.1585324Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:39.1585859Z ^ 2025-05-07T19:55:39.1586158Z 2025-05-07T19:55:39.1587640Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(246): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:39.1589561Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:39.1590106Z ^ 2025-05-07T19:55:39.1590412Z 2025-05-07T19:55:39.1592005Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:39.1594570Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:39.1595695Z ^ 2025-05-07T19:55:39.1595934Z 2025-05-07T19:55:39.1596377Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:39.1597020Z 2025-05-07T19:55:39.1598634Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:39.1601168Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:39.1602303Z ^ 2025-05-07T19:55:39.1602652Z 2025-05-07T19:55:39.1604188Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(219): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:39.1606121Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:39.1606636Z ^ 2025-05-07T19:55:39.1606937Z 2025-05-07T19:55:39.1608454Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(236): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:39.1610384Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:39.1610924Z ^ 2025-05-07T19:55:39.1611225Z 2025-05-07T19:55:39.1612743Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(246): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:39.1614673Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:39.1615380Z ^ 2025-05-07T19:55:39.1615672Z 2025-05-07T19:55:39.1617393Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:39.1620126Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:39.1621261Z ^ 2025-05-07T19:55:39.1621504Z 2025-05-07T19:55:39.1621943Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:39.1622585Z 2025-05-07T19:55:39.1624322Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:39.1626868Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:39.1627996Z ^ 2025-05-07T19:55:39.1628347Z 2025-05-07T19:55:39.1629877Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(219): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:39.1631799Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:39.1632340Z ^ 2025-05-07T19:55:39.1632640Z 2025-05-07T19:55:39.1634160Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(236): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:39.1636082Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:39.1636617Z ^ 2025-05-07T19:55:39.1636917Z 2025-05-07T19:55:39.1638406Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(246): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:39.1640498Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:39.1641114Z ^ 2025-05-07T19:55:39.1641411Z 2025-05-07T19:55:42.1288618Z [242/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_kernel.cu.o 2025-05-07T19:55:42.1310264Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:42.1312808Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:42.1314042Z ^ 2025-05-07T19:55:42.1314289Z 2025-05-07T19:55:42.1314729Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:42.1315378Z 2025-05-07T19:55:42.1316957Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:42.1319336Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:42.1320485Z ^ 2025-05-07T19:55:42.1320847Z 2025-05-07T19:55:42.1322335Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:42.1324876Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:42.1325970Z ^ 2025-05-07T19:55:42.1326210Z 2025-05-07T19:55:42.1326619Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:42.1327241Z 2025-05-07T19:55:42.1328784Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:42.1331165Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:42.1332208Z ^ 2025-05-07T19:55:42.1332556Z 2025-05-07T19:55:42.1334185Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:42.1336801Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:42.1337887Z ^ 2025-05-07T19:55:42.1338125Z 2025-05-07T19:55:42.1338550Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:42.1339153Z 2025-05-07T19:55:42.1340726Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:42.1343294Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:42.1344726Z ^ 2025-05-07T19:55:42.1345081Z 2025-05-07T19:55:42.1346604Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:42.1349157Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:42.1350277Z ^ 2025-05-07T19:55:42.1350525Z 2025-05-07T19:55:42.1351116Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:42.1351739Z 2025-05-07T19:55:42.1353329Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:42.1355759Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:42.1356899Z ^ 2025-05-07T19:55:42.1357258Z 2025-05-07T19:55:42.1358927Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:42.1361433Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:42.1362596Z ^ 2025-05-07T19:55:42.1362844Z 2025-05-07T19:55:42.1363278Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:42.1363907Z 2025-05-07T19:55:42.1365417Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:42.1367903Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:42.1369078Z ^ 2025-05-07T19:55:42.1369446Z 2025-05-07T19:55:51.4930558Z [243/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_grad_embedding_ops.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_grad_embedding_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_grad_embedding_ops.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_grad_embedding_ops.cu.o 2025-05-07T19:55:51.4955470Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:51.4958170Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:51.4959342Z ^ 2025-05-07T19:55:51.4959597Z 2025-05-07T19:55:51.4960044Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:51.4960731Z 2025-05-07T19:55:51.4962417Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:51.4965118Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:51.4966342Z ^ 2025-05-07T19:55:51.4966701Z 2025-05-07T19:55:51.4968365Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:51.4971257Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:51.4972426Z ^ 2025-05-07T19:55:51.4972679Z 2025-05-07T19:55:51.4973133Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:51.4973815Z 2025-05-07T19:55:51.4975536Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:51.4978301Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:51.4979487Z ^ 2025-05-07T19:55:51.4979870Z 2025-05-07T19:55:51.4981511Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:51.4984204Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:51.4985365Z ^ 2025-05-07T19:55:51.4985627Z 2025-05-07T19:55:51.4986072Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:51.4986739Z 2025-05-07T19:55:51.4988411Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:51.4991455Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:51.4992652Z ^ 2025-05-07T19:55:51.4993015Z 2025-05-07T19:55:51.4994843Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:51.4997491Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:51.4998663Z ^ 2025-05-07T19:55:51.4998912Z 2025-05-07T19:55:51.4999352Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:51.5000028Z 2025-05-07T19:55:51.5001687Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:51.5004377Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:51.5005550Z ^ 2025-05-07T19:55:51.5005934Z 2025-05-07T19:55:51.5007591Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:51.5010266Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:51.5011412Z ^ 2025-05-07T19:55:51.5011678Z 2025-05-07T19:55:51.5012123Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:51.5012787Z 2025-05-07T19:55:51.5014454Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:51.5017282Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:51.5018470Z ^ 2025-05-07T19:55:51.5018831Z 2025-05-07T19:55:52.0182554Z [244/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_v2_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_v2_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_v2_kernel.cu.o 2025-05-07T19:55:52.0205823Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:52.0208510Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:52.0209694Z ^ 2025-05-07T19:55:52.0209953Z 2025-05-07T19:55:52.0210400Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:52.0211081Z 2025-05-07T19:55:52.0212752Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:52.0215439Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:52.0216737Z ^ 2025-05-07T19:55:52.0217117Z 2025-05-07T19:55:52.0218391Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.0220391Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.0221280Z ^ 2025-05-07T19:55:52.0224694Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 946 2025-05-07T19:55:52.0227800Z 2025-05-07T19:55:52.0229103Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.0231090Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.0232001Z ^ 2025-05-07T19:55:52.0235384Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 996 2025-05-07T19:55:52.0238682Z 2025-05-07T19:55:52.0239989Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.0242069Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.0242973Z ^ 2025-05-07T19:55:52.0246370Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1046 2025-05-07T19:55:52.0249497Z 2025-05-07T19:55:52.0250798Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.0252770Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.0253679Z ^ 2025-05-07T19:55:52.0257167Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1096 2025-05-07T19:55:52.0260267Z 2025-05-07T19:55:52.0261549Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.0263490Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.0264377Z ^ 2025-05-07T19:55:52.0267668Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1146 2025-05-07T19:55:52.0270943Z 2025-05-07T19:55:52.0272214Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.0274136Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.0275017Z ^ 2025-05-07T19:55:52.0278323Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1196 2025-05-07T19:55:52.0281687Z 2025-05-07T19:55:52.0282946Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.0284882Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.0285764Z ^ 2025-05-07T19:55:52.0289241Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1246 2025-05-07T19:55:52.0292332Z 2025-05-07T19:55:52.0293595Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.0295545Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.0296512Z ^ 2025-05-07T19:55:52.0299831Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1296 2025-05-07T19:55:52.0302934Z 2025-05-07T19:55:52.0304195Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.0306135Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.0307007Z ^ 2025-05-07T19:55:52.0310323Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1346 2025-05-07T19:55:52.0313410Z 2025-05-07T19:55:52.0314683Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.0316631Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.0317497Z ^ 2025-05-07T19:55:52.0320834Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1396 2025-05-07T19:55:52.0324058Z 2025-05-07T19:55:52.0325389Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.0327328Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.0328212Z ^ 2025-05-07T19:55:52.0331675Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1446 2025-05-07T19:55:52.0334778Z 2025-05-07T19:55:52.0336056Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.0338091Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.0338979Z ^ 2025-05-07T19:55:52.0358360Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1496 2025-05-07T19:55:52.0361543Z 2025-05-07T19:55:52.0362855Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.0364813Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.0365711Z ^ 2025-05-07T19:55:52.0369092Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1546 2025-05-07T19:55:52.0372389Z 2025-05-07T19:55:52.0373653Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.0375623Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.0376616Z ^ 2025-05-07T19:55:52.0379966Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1596 2025-05-07T19:55:52.0383052Z 2025-05-07T19:55:52.0384320Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.0386701Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.0387596Z ^ 2025-05-07T19:55:52.0391166Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1646 2025-05-07T19:55:52.0394277Z 2025-05-07T19:55:52.0395545Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.0397511Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.0398386Z ^ 2025-05-07T19:55:52.0401736Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1696 2025-05-07T19:55:52.0404848Z 2025-05-07T19:55:52.0406115Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.0408075Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.0408945Z ^ 2025-05-07T19:55:52.0412306Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1746 2025-05-07T19:55:52.0415425Z 2025-05-07T19:55:52.0416806Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.0418759Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.0419627Z ^ 2025-05-07T19:55:52.0423006Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1796 2025-05-07T19:55:52.0426119Z 2025-05-07T19:55:52.0427382Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.0429445Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.0430389Z ^ 2025-05-07T19:55:52.0433892Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1846 2025-05-07T19:55:52.0437027Z 2025-05-07T19:55:52.0438307Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.0440240Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.0441134Z ^ 2025-05-07T19:55:52.0444524Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1896 2025-05-07T19:55:52.0447656Z 2025-05-07T19:55:52.0448930Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.0450861Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.0451755Z ^ 2025-05-07T19:55:52.0455095Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1946 2025-05-07T19:55:52.0458314Z 2025-05-07T19:55:52.0459568Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.0461508Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.0462403Z ^ 2025-05-07T19:55:52.0465755Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1996 2025-05-07T19:55:52.0468911Z 2025-05-07T19:55:52.0470394Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.0472353Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.0473466Z ^ 2025-05-07T19:55:52.0476916Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2046 2025-05-07T19:55:52.0480052Z 2025-05-07T19:55:52.0481462Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.0483422Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.0484303Z ^ 2025-05-07T19:55:52.0487697Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2096 2025-05-07T19:55:52.0490839Z 2025-05-07T19:55:52.0492471Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:52.0495111Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:52.0496255Z ^ 2025-05-07T19:55:52.0496638Z 2025-05-07T19:55:52.0497078Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:52.0497727Z 2025-05-07T19:55:52.0499355Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:52.0501984Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:52.0503152Z ^ 2025-05-07T19:55:52.0503513Z 2025-05-07T19:55:52.0504792Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.0506725Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.0507623Z ^ 2025-05-07T19:55:52.0510963Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 946 2025-05-07T19:55:52.0514057Z 2025-05-07T19:55:52.0515316Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.0517266Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.0518319Z ^ 2025-05-07T19:55:52.0521739Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 996 2025-05-07T19:55:52.0524835Z 2025-05-07T19:55:52.0526099Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.0528048Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.0528934Z ^ 2025-05-07T19:55:52.0532274Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1046 2025-05-07T19:55:52.0535406Z 2025-05-07T19:55:52.0536788Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.0538740Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.0539613Z ^ 2025-05-07T19:55:52.0542923Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1096 2025-05-07T19:55:52.0546028Z 2025-05-07T19:55:52.0547296Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.0549241Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.0550107Z ^ 2025-05-07T19:55:52.0553440Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1146 2025-05-07T19:55:52.0556533Z 2025-05-07T19:55:52.0557799Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.0559734Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.0560625Z ^ 2025-05-07T19:55:52.0564128Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1196 2025-05-07T19:55:52.0567204Z 2025-05-07T19:55:52.0568576Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.0570680Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.0571568Z ^ 2025-05-07T19:55:52.0574899Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1246 2025-05-07T19:55:52.0578080Z 2025-05-07T19:55:52.0579361Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.0581292Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.0582175Z ^ 2025-05-07T19:55:52.0585525Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1296 2025-05-07T19:55:52.0588629Z 2025-05-07T19:55:52.0589920Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.0591905Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.0592823Z ^ 2025-05-07T19:55:52.0596187Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1346 2025-05-07T19:55:52.0599308Z 2025-05-07T19:55:52.0600586Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.0602561Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.0603472Z ^ 2025-05-07T19:55:52.0606849Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1396 2025-05-07T19:55:52.0610252Z 2025-05-07T19:55:52.0611696Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.0613650Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.0614544Z ^ 2025-05-07T19:55:52.0618002Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1446 2025-05-07T19:55:52.0621106Z 2025-05-07T19:55:52.0622382Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.0624328Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.0625218Z ^ 2025-05-07T19:55:52.0628596Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1496 2025-05-07T19:55:52.0631704Z 2025-05-07T19:55:52.0632979Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.0634911Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.0635799Z ^ 2025-05-07T19:55:52.0639125Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1546 2025-05-07T19:55:52.0642208Z 2025-05-07T19:55:52.0643466Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.0645420Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.0646307Z ^ 2025-05-07T19:55:52.0649634Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1596 2025-05-07T19:55:52.0652894Z 2025-05-07T19:55:52.0654156Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.0656208Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.0657210Z ^ 2025-05-07T19:55:52.0660522Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1646 2025-05-07T19:55:52.0663631Z 2025-05-07T19:55:52.0664892Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.0666838Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.0667709Z ^ 2025-05-07T19:55:52.0671304Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1696 2025-05-07T19:55:52.0674491Z 2025-05-07T19:55:52.0675770Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.0677762Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.0678648Z ^ 2025-05-07T19:55:52.0682079Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1746 2025-05-07T19:55:52.0685286Z 2025-05-07T19:55:52.0686570Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.0688561Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.0689453Z ^ 2025-05-07T19:55:52.0692917Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1796 2025-05-07T19:55:52.0696554Z 2025-05-07T19:55:52.0697852Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.0699812Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.0700898Z ^ 2025-05-07T19:55:52.0704358Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1846 2025-05-07T19:55:52.0707570Z 2025-05-07T19:55:52.0708879Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.0710866Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.0711770Z ^ 2025-05-07T19:55:52.0715238Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1896 2025-05-07T19:55:52.0718451Z 2025-05-07T19:55:52.0719747Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.0721722Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.0722630Z ^ 2025-05-07T19:55:52.0726084Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1946 2025-05-07T19:55:52.0729255Z 2025-05-07T19:55:52.0730542Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.0732531Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.0733440Z ^ 2025-05-07T19:55:52.0737017Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1996 2025-05-07T19:55:52.0740372Z 2025-05-07T19:55:52.0741654Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.0743637Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.0744532Z ^ 2025-05-07T19:55:52.0748082Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2046 2025-05-07T19:55:52.0751291Z 2025-05-07T19:55:52.0752583Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.0754566Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.0755451Z ^ 2025-05-07T19:55:52.0758931Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2096 2025-05-07T19:55:52.0762155Z 2025-05-07T19:55:52.0763807Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:52.0766476Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:52.0767632Z ^ 2025-05-07T19:55:52.0767890Z 2025-05-07T19:55:52.0768341Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:52.0769003Z 2025-05-07T19:55:52.0770889Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:52.0773549Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:52.0774745Z ^ 2025-05-07T19:55:52.0775105Z 2025-05-07T19:55:52.0776466Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.0778404Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.0779294Z ^ 2025-05-07T19:55:52.0782627Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 946 2025-05-07T19:55:52.0785993Z 2025-05-07T19:55:52.0787253Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.0789193Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.0790227Z ^ 2025-05-07T19:55:52.0793548Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 996 2025-05-07T19:55:52.0796657Z 2025-05-07T19:55:52.0797918Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.0799863Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.0800733Z ^ 2025-05-07T19:55:52.0804067Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1046 2025-05-07T19:55:52.0807160Z 2025-05-07T19:55:52.0808423Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.0810367Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.0811238Z ^ 2025-05-07T19:55:52.0814593Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1096 2025-05-07T19:55:52.0817829Z 2025-05-07T19:55:52.0819084Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.0821028Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.0821898Z ^ 2025-05-07T19:55:52.0825211Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1146 2025-05-07T19:55:52.0828441Z 2025-05-07T19:55:52.0829706Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.0831639Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.0832524Z ^ 2025-05-07T19:55:52.0835969Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1196 2025-05-07T19:55:52.0839061Z 2025-05-07T19:55:52.0840347Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.0842287Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.0843178Z ^ 2025-05-07T19:55:52.0846520Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1246 2025-05-07T19:55:52.0849609Z 2025-05-07T19:55:52.0850888Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.0852830Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.0853718Z ^ 2025-05-07T19:55:52.0857222Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1296 2025-05-07T19:55:52.0860361Z 2025-05-07T19:55:52.0861641Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.0863641Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.0864548Z ^ 2025-05-07T19:55:52.0867953Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1346 2025-05-07T19:55:52.0871271Z 2025-05-07T19:55:52.0872525Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.0874692Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.0875578Z ^ 2025-05-07T19:55:52.0879044Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1396 2025-05-07T19:55:52.0882176Z 2025-05-07T19:55:52.0883436Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.0885391Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.0886263Z ^ 2025-05-07T19:55:52.0889613Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1446 2025-05-07T19:55:52.0892708Z 2025-05-07T19:55:52.0893968Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.0895916Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.0896857Z ^ 2025-05-07T19:55:52.0900189Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1496 2025-05-07T19:55:52.0903314Z 2025-05-07T19:55:52.0904574Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.0906528Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.0907391Z ^ 2025-05-07T19:55:52.0910722Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1546 2025-05-07T19:55:52.0913803Z 2025-05-07T19:55:52.0915078Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.0917133Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.0918019Z ^ 2025-05-07T19:55:52.0921464Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1596 2025-05-07T19:55:52.0924574Z 2025-05-07T19:55:52.0925848Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.0927779Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.0928667Z ^ 2025-05-07T19:55:52.0932007Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1646 2025-05-07T19:55:52.0935115Z 2025-05-07T19:55:52.0936507Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.0938453Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.0939338Z ^ 2025-05-07T19:55:52.0942686Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1696 2025-05-07T19:55:52.0945791Z 2025-05-07T19:55:52.0947050Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.0949007Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.0949896Z ^ 2025-05-07T19:55:52.0953272Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1746 2025-05-07T19:55:52.0956399Z 2025-05-07T19:55:52.0957667Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.0959623Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.0960666Z ^ 2025-05-07T19:55:52.0964113Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1796 2025-05-07T19:55:52.0967258Z 2025-05-07T19:55:52.0968518Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.0970703Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.0971583Z ^ 2025-05-07T19:55:52.0974967Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1846 2025-05-07T19:55:52.0978209Z 2025-05-07T19:55:52.0979471Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.0981411Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.0982290Z ^ 2025-05-07T19:55:52.0985686Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1896 2025-05-07T19:55:52.0988832Z 2025-05-07T19:55:52.0990093Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.0992041Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.0992912Z ^ 2025-05-07T19:55:52.0996281Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1946 2025-05-07T19:55:52.0999404Z 2025-05-07T19:55:52.1000682Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.1002618Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.1003666Z ^ 2025-05-07T19:55:52.1007094Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1996 2025-05-07T19:55:52.1010213Z 2025-05-07T19:55:52.1011612Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.1013550Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.1014426Z ^ 2025-05-07T19:55:52.1017921Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2046 2025-05-07T19:55:52.1021054Z 2025-05-07T19:55:52.1022324Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.1024264Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.1025138Z ^ 2025-05-07T19:55:52.1028553Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2096 2025-05-07T19:55:52.1031692Z 2025-05-07T19:55:52.1033332Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:52.1035979Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:52.1037153Z ^ 2025-05-07T19:55:52.1037401Z 2025-05-07T19:55:52.1037849Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:52.1038504Z 2025-05-07T19:55:52.1040137Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:52.1042772Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:52.1043947Z ^ 2025-05-07T19:55:52.1044305Z 2025-05-07T19:55:52.1045567Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.1047506Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.1048507Z ^ 2025-05-07T19:55:52.1051907Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 946 2025-05-07T19:55:52.1054993Z 2025-05-07T19:55:52.1056262Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.1058329Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.1059202Z ^ 2025-05-07T19:55:52.1062527Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 996 2025-05-07T19:55:52.1065614Z 2025-05-07T19:55:52.1066877Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.1068835Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.1069728Z ^ 2025-05-07T19:55:52.1073246Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1046 2025-05-07T19:55:52.1076325Z 2025-05-07T19:55:52.1077605Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.1079529Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.1080413Z ^ 2025-05-07T19:55:52.1083773Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1096 2025-05-07T19:55:52.1086883Z 2025-05-07T19:55:52.1088169Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.1090099Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.1090988Z ^ 2025-05-07T19:55:52.1094542Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1146 2025-05-07T19:55:52.1097709Z 2025-05-07T19:55:52.1099209Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.1101130Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.1102034Z ^ 2025-05-07T19:55:52.1105404Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1196 2025-05-07T19:55:52.1108500Z 2025-05-07T19:55:52.1109779Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.1111757Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.1112674Z ^ 2025-05-07T19:55:52.1115999Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1246 2025-05-07T19:55:52.1119124Z 2025-05-07T19:55:52.1120405Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.1122402Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.1123313Z ^ 2025-05-07T19:55:52.1126670Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1296 2025-05-07T19:55:52.1129813Z 2025-05-07T19:55:52.1131096Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.1133079Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.1133968Z ^ 2025-05-07T19:55:52.1137438Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1346 2025-05-07T19:55:52.1140670Z 2025-05-07T19:55:52.1142024Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.1144004Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.1144891Z ^ 2025-05-07T19:55:52.1148280Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1396 2025-05-07T19:55:52.1151416Z 2025-05-07T19:55:52.1152690Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.1154668Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.1155561Z ^ 2025-05-07T19:55:52.1158946Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1446 2025-05-07T19:55:52.1162069Z 2025-05-07T19:55:52.1163366Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.1165321Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.1166237Z ^ 2025-05-07T19:55:52.1169630Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1496 2025-05-07T19:55:52.1172944Z 2025-05-07T19:55:52.1174235Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.1176196Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.1177191Z ^ 2025-05-07T19:55:52.1180574Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1546 2025-05-07T19:55:52.1183872Z 2025-05-07T19:55:52.1185172Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.1187258Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.1188177Z ^ 2025-05-07T19:55:52.1191534Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1596 2025-05-07T19:55:52.1194663Z 2025-05-07T19:55:52.1195932Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.1197916Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.1198827Z ^ 2025-05-07T19:55:52.1202174Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1646 2025-05-07T19:55:52.1205302Z 2025-05-07T19:55:52.1206575Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.1208549Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.1209447Z ^ 2025-05-07T19:55:52.1212827Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1696 2025-05-07T19:55:52.1215990Z 2025-05-07T19:55:52.1217381Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.1219357Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.1220259Z ^ 2025-05-07T19:55:52.1223650Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1746 2025-05-07T19:55:52.1226944Z 2025-05-07T19:55:52.1228217Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.1230179Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.1232470Z ^ 2025-05-07T19:55:52.1235887Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1796 2025-05-07T19:55:52.1239067Z 2025-05-07T19:55:52.1240343Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.1242315Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.1243214Z ^ 2025-05-07T19:55:52.1246612Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1846 2025-05-07T19:55:52.1249780Z 2025-05-07T19:55:52.1251071Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.1253030Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.1253943Z ^ 2025-05-07T19:55:52.1257508Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1896 2025-05-07T19:55:52.1260649Z 2025-05-07T19:55:52.1261951Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.1263888Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.1264806Z ^ 2025-05-07T19:55:52.1268215Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1946 2025-05-07T19:55:52.1271681Z 2025-05-07T19:55:52.1272934Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.1274879Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.1275764Z ^ 2025-05-07T19:55:52.1279340Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1996 2025-05-07T19:55:52.1282486Z 2025-05-07T19:55:52.1283742Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.1285679Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.1286573Z ^ 2025-05-07T19:55:52.1289937Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2046 2025-05-07T19:55:52.1293103Z 2025-05-07T19:55:52.1294367Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.1296320Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.1297364Z ^ 2025-05-07T19:55:52.1300772Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2096 2025-05-07T19:55:52.1303948Z 2025-05-07T19:55:52.1305571Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:52.1308204Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:52.1309348Z ^ 2025-05-07T19:55:52.1309610Z 2025-05-07T19:55:52.1310057Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:52.1310712Z 2025-05-07T19:55:52.1312364Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:52.1314991Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:52.1316382Z ^ 2025-05-07T19:55:52.1316743Z 2025-05-07T19:55:52.1318002Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.1319956Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.1320926Z ^ 2025-05-07T19:55:52.1324234Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 946 2025-05-07T19:55:52.1327306Z 2025-05-07T19:55:52.1328572Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.1330513Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.1331399Z ^ 2025-05-07T19:55:52.1334725Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 996 2025-05-07T19:55:52.1337935Z 2025-05-07T19:55:52.1339200Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.1341143Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.1342008Z ^ 2025-05-07T19:55:52.1345367Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1046 2025-05-07T19:55:52.1348489Z 2025-05-07T19:55:52.1349757Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.1351702Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.1352573Z ^ 2025-05-07T19:55:52.1355929Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1096 2025-05-07T19:55:52.1359161Z 2025-05-07T19:55:52.1360424Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.1362364Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.1363239Z ^ 2025-05-07T19:55:52.1366662Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1146 2025-05-07T19:55:52.1369753Z 2025-05-07T19:55:52.1371264Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.1373238Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.1374142Z ^ 2025-05-07T19:55:52.1377590Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1196 2025-05-07T19:55:52.1380671Z 2025-05-07T19:55:52.1381943Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.1383871Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.1384755Z ^ 2025-05-07T19:55:52.1388107Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1246 2025-05-07T19:55:52.1391188Z 2025-05-07T19:55:52.1392464Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.1394404Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.1395293Z ^ 2025-05-07T19:55:52.1398644Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1296 2025-05-07T19:55:52.1401742Z 2025-05-07T19:55:52.1403010Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.1405195Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.1406081Z ^ 2025-05-07T19:55:52.1409521Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1346 2025-05-07T19:55:52.1412623Z 2025-05-07T19:55:52.1413879Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.1415837Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.1416841Z ^ 2025-05-07T19:55:52.1420181Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1396 2025-05-07T19:55:52.1423275Z 2025-05-07T19:55:52.1424536Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.1426495Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.1427372Z ^ 2025-05-07T19:55:52.1430734Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1446 2025-05-07T19:55:52.1433857Z 2025-05-07T19:55:52.1435121Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.1437074Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.1437944Z ^ 2025-05-07T19:55:52.1441312Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1496 2025-05-07T19:55:52.1444430Z 2025-05-07T19:55:52.1445691Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.1447777Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.1448649Z ^ 2025-05-07T19:55:52.1452087Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1546 2025-05-07T19:55:52.1455177Z 2025-05-07T19:55:52.1456555Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.1458489Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.1459374Z ^ 2025-05-07T19:55:52.1462713Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1596 2025-05-07T19:55:52.1465809Z 2025-05-07T19:55:52.1467084Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.1469019Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.1469903Z ^ 2025-05-07T19:55:52.1473460Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1646 2025-05-07T19:55:52.1476550Z 2025-05-07T19:55:52.1477820Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.1479758Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.1480649Z ^ 2025-05-07T19:55:52.1484015Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1696 2025-05-07T19:55:52.1487115Z 2025-05-07T19:55:52.1488385Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.1490337Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.1491437Z ^ 2025-05-07T19:55:52.1494924Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1746 2025-05-07T19:55:52.1498176Z 2025-05-07T19:55:52.1499434Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.1501378Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.1502268Z ^ 2025-05-07T19:55:52.1505648Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1796 2025-05-07T19:55:52.1508796Z 2025-05-07T19:55:52.1510061Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.1512006Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.1512885Z ^ 2025-05-07T19:55:52.1516271Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1846 2025-05-07T19:55:52.1519414Z 2025-05-07T19:55:52.1520678Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.1522627Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.1523505Z ^ 2025-05-07T19:55:52.1526914Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1896 2025-05-07T19:55:52.1530083Z 2025-05-07T19:55:52.1531354Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.1533302Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.1534252Z ^ 2025-05-07T19:55:52.1537756Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1946 2025-05-07T19:55:52.1540876Z 2025-05-07T19:55:52.1542235Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.1544176Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.1545062Z ^ 2025-05-07T19:55:52.1548451Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1996 2025-05-07T19:55:52.1551588Z 2025-05-07T19:55:52.1552881Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.1554824Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.1555717Z ^ 2025-05-07T19:55:52.1559121Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2046 2025-05-07T19:55:52.1562234Z 2025-05-07T19:55:52.1563524Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:52.1565465Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:52.1566336Z ^ 2025-05-07T19:55:52.1569740Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2096 2025-05-07T19:55:52.1573069Z 2025-05-07T19:55:54.7292273Z [245/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_kernel.cu.o 2025-05-07T19:55:54.7303804Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:54.7305136Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:54.7305747Z ^ 2025-05-07T19:55:54.7305887Z 2025-05-07T19:55:54.7306144Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:54.7306485Z 2025-05-07T19:55:54.7307318Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:54.7308656Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:54.7309279Z ^ 2025-05-07T19:55:54.7309471Z 2025-05-07T19:55:54.7310333Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:54.7311669Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:54.7312272Z ^ 2025-05-07T19:55:54.7312408Z 2025-05-07T19:55:54.7312642Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:54.7312993Z 2025-05-07T19:55:54.7313818Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:54.7315280Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:54.7315883Z ^ 2025-05-07T19:55:54.7316072Z 2025-05-07T19:55:54.7316902Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:54.7318305Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:54.7318909Z ^ 2025-05-07T19:55:54.7319044Z 2025-05-07T19:55:54.7319287Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:54.7319629Z 2025-05-07T19:55:54.7320467Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:54.7321815Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:54.7322422Z ^ 2025-05-07T19:55:54.7322614Z 2025-05-07T19:55:54.7323435Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:54.7324764Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:54.7325346Z ^ 2025-05-07T19:55:54.7325494Z 2025-05-07T19:55:54.7325724Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:54.7326064Z 2025-05-07T19:55:54.7326897Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:54.7328222Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:54.7328839Z ^ 2025-05-07T19:55:54.7329026Z 2025-05-07T19:55:54.7329854Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:54.7331172Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:54.7331770Z ^ 2025-05-07T19:55:54.7331903Z 2025-05-07T19:55:54.7332131Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:54.7332480Z 2025-05-07T19:55:54.7333304Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:54.7334647Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:54.7335243Z ^ 2025-05-07T19:55:54.7335442Z 2025-05-07T19:56:02.7696119Z [246/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_dense_unweighted_vbe_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_kernel.cu.o 2025-05-07T19:56:02.7717632Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:02.7720149Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:02.7721261Z ^ 2025-05-07T19:56:02.7721518Z 2025-05-07T19:56:02.7721956Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:02.7722593Z 2025-05-07T19:56:02.7724146Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:02.7726532Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:02.7727606Z ^ 2025-05-07T19:56:02.7727954Z 2025-05-07T19:56:02.7729531Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:02.7731998Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:02.7733031Z ^ 2025-05-07T19:56:02.7733254Z 2025-05-07T19:56:02.7733887Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:02.7734577Z 2025-05-07T19:56:02.7736096Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:02.7738792Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:02.7739927Z ^ 2025-05-07T19:56:02.7740482Z 2025-05-07T19:56:02.7741921Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:02.7744407Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:02.7745562Z ^ 2025-05-07T19:56:02.7745807Z 2025-05-07T19:56:02.7746245Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:02.7746886Z 2025-05-07T19:56:02.7748333Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:02.7750717Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:02.7751851Z ^ 2025-05-07T19:56:02.7752213Z 2025-05-07T19:56:02.7753766Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:02.7756104Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:02.7757092Z ^ 2025-05-07T19:56:02.7757345Z 2025-05-07T19:56:02.7757769Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:02.7758358Z 2025-05-07T19:56:02.7759824Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:02.7762249Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:02.7763385Z ^ 2025-05-07T19:56:02.7763734Z 2025-05-07T19:56:02.7765234Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:02.7767515Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:02.7768582Z ^ 2025-05-07T19:56:02.7768831Z 2025-05-07T19:56:02.7769238Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:02.7769859Z 2025-05-07T19:56:02.7771653Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:02.7774190Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:02.7775593Z ^ 2025-05-07T19:56:02.7776062Z 2025-05-07T19:56:09.2858364Z [247/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu.o 2025-05-07T19:56:09.2889121Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:09.2892394Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:09.2893812Z ^ 2025-05-07T19:56:09.2894097Z 2025-05-07T19:56:09.2894636Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:09.2895429Z 2025-05-07T19:56:09.2897610Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:09.2900885Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:09.2902296Z ^ 2025-05-07T19:56:09.2902725Z 2025-05-07T19:56:09.2904658Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:09.2907440Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:09.2908110Z ^ 2025-05-07T19:56:09.2908488Z 2025-05-07T19:56:09.2910496Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:09.2913187Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:09.2913872Z ^ 2025-05-07T19:56:09.2914225Z 2025-05-07T19:56:09.2916162Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:09.2918615Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:09.2919276Z ^ 2025-05-07T19:56:09.2919657Z 2025-05-07T19:56:09.2921613Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:09.2924808Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:09.2926199Z ^ 2025-05-07T19:56:09.2926515Z 2025-05-07T19:56:09.2927044Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:09.2927828Z 2025-05-07T19:56:09.2929157Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:09.2932590Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:09.2934202Z ^ 2025-05-07T19:56:09.2934668Z 2025-05-07T19:56:09.2936800Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:09.2939202Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:09.2939840Z ^ 2025-05-07T19:56:09.2940170Z 2025-05-07T19:56:09.2942259Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:09.2944923Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:09.2945650Z ^ 2025-05-07T19:56:09.2946013Z 2025-05-07T19:56:09.2948108Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:09.2950713Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:09.2951393Z ^ 2025-05-07T19:56:09.2951778Z 2025-05-07T19:56:09.2953808Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:09.2957032Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:09.2958446Z ^ 2025-05-07T19:56:09.2959039Z 2025-05-07T19:56:09.2959573Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:09.2960362Z 2025-05-07T19:56:09.2962316Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:09.2965588Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:09.2967195Z ^ 2025-05-07T19:56:09.2967653Z 2025-05-07T19:56:09.2969602Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:09.2972495Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:09.2973204Z ^ 2025-05-07T19:56:09.2973556Z 2025-05-07T19:56:09.2975494Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:09.2978035Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:09.2978682Z ^ 2025-05-07T19:56:09.2979084Z 2025-05-07T19:56:09.2981060Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:09.2983693Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:09.2984421Z ^ 2025-05-07T19:56:09.2984819Z 2025-05-07T19:56:09.2986906Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:09.2990247Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:09.2991587Z ^ 2025-05-07T19:56:09.2991878Z 2025-05-07T19:56:09.2992405Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:09.2993206Z 2025-05-07T19:56:09.2995217Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:09.2998523Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:09.3000016Z ^ 2025-05-07T19:56:09.3000480Z 2025-05-07T19:56:09.3002541Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:09.3005176Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:09.3005888Z ^ 2025-05-07T19:56:09.3006269Z 2025-05-07T19:56:09.3008409Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:09.3010841Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:09.3011499Z ^ 2025-05-07T19:56:09.3011861Z 2025-05-07T19:56:09.3013838Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:09.3019139Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:09.3019793Z ^ 2025-05-07T19:56:09.3020161Z 2025-05-07T19:56:09.3022201Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:09.3025674Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:09.3027066Z ^ 2025-05-07T19:56:09.3027359Z 2025-05-07T19:56:09.3027885Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:09.3028684Z 2025-05-07T19:56:09.3030772Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:09.3034039Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:09.3035460Z ^ 2025-05-07T19:56:09.3035880Z 2025-05-07T19:56:09.3037822Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:09.3040269Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:09.3040919Z ^ 2025-05-07T19:56:09.3041282Z 2025-05-07T19:56:09.3043227Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:09.3045647Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:09.3046303Z ^ 2025-05-07T19:56:09.3046672Z 2025-05-07T19:56:09.3048583Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:09.3050982Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:09.3051647Z ^ 2025-05-07T19:56:09.3052001Z 2025-05-07T19:56:29.7832435Z [248/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu.o 2025-05-07T19:56:29.7856536Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:29.7859188Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:29.7860358Z ^ 2025-05-07T19:56:29.7860685Z 2025-05-07T19:56:29.7861128Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:29.7861803Z 2025-05-07T19:56:29.7863480Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:29.7866163Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:29.7867319Z ^ 2025-05-07T19:56:29.7867694Z 2025-05-07T19:56:29.7869237Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:29.7871582Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:29.7872309Z ^ 2025-05-07T19:56:29.7872602Z 2025-05-07T19:56:29.7874150Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:29.7876110Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:29.7876646Z ^ 2025-05-07T19:56:29.7876921Z 2025-05-07T19:56:29.7878472Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:29.7880392Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:29.7880937Z ^ 2025-05-07T19:56:29.7881208Z 2025-05-07T19:56:29.7882766Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:29.7884971Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:29.7885505Z ^ 2025-05-07T19:56:29.7885775Z 2025-05-07T19:56:29.7887421Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:29.7890090Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:29.7891422Z ^ 2025-05-07T19:56:29.7891668Z 2025-05-07T19:56:29.7892110Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:29.7892782Z 2025-05-07T19:56:29.7894449Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:29.7897223Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:29.7898378Z ^ 2025-05-07T19:56:29.7898737Z 2025-05-07T19:56:29.7900309Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:29.7902427Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:29.7903174Z ^ 2025-05-07T19:56:29.7903457Z 2025-05-07T19:56:29.7905003Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:29.7906943Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:29.7907492Z ^ 2025-05-07T19:56:29.7907771Z 2025-05-07T19:56:29.7909333Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:29.7911293Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:29.7911835Z ^ 2025-05-07T19:56:29.7912106Z 2025-05-07T19:56:29.7913659Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:29.7915607Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:29.7916146Z ^ 2025-05-07T19:56:29.7916412Z 2025-05-07T19:56:29.7918063Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:29.7920748Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:29.7921903Z ^ 2025-05-07T19:56:29.7922168Z 2025-05-07T19:56:29.7922607Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:29.7923270Z 2025-05-07T19:56:29.7924972Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:29.7927635Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:29.7929007Z ^ 2025-05-07T19:56:29.7929367Z 2025-05-07T19:56:29.7930943Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:29.7933055Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:29.7933796Z ^ 2025-05-07T19:56:29.7934078Z 2025-05-07T19:56:29.7935745Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:29.7937776Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:29.7938320Z ^ 2025-05-07T19:56:29.7938598Z 2025-05-07T19:56:29.7940147Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:29.7942160Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:29.7942729Z ^ 2025-05-07T19:56:29.7943012Z 2025-05-07T19:56:29.7944566Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:29.7946538Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:29.7947090Z ^ 2025-05-07T19:56:29.7947397Z 2025-05-07T19:56:29.7949052Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:29.7951756Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:29.7952923Z ^ 2025-05-07T19:56:29.7953207Z 2025-05-07T19:56:29.7953662Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:29.7954327Z 2025-05-07T19:56:29.7956034Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:29.7958706Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:29.7959930Z ^ 2025-05-07T19:56:29.7960302Z 2025-05-07T19:56:29.7961898Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:29.7964049Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:29.7964839Z ^ 2025-05-07T19:56:29.7965128Z 2025-05-07T19:56:29.7966685Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:29.7968699Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:29.7969292Z ^ 2025-05-07T19:56:29.7969589Z 2025-05-07T19:56:29.7971371Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:29.7973590Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:29.7974149Z ^ 2025-05-07T19:56:29.7974453Z 2025-05-07T19:56:29.7976015Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:29.7978045Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:29.7978590Z ^ 2025-05-07T19:56:29.7978899Z 2025-05-07T19:56:29.7980739Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:29.7983421Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:29.7984591Z ^ 2025-05-07T19:56:29.7984840Z 2025-05-07T19:56:29.7985314Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:29.7985978Z 2025-05-07T19:56:29.7987639Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:29.7990316Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:29.7991520Z ^ 2025-05-07T19:56:29.7991887Z 2025-05-07T19:56:29.7993448Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:29.7995612Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:29.7996394Z ^ 2025-05-07T19:56:29.7996678Z 2025-05-07T19:56:29.7998219Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:29.8000203Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:29.8000763Z ^ 2025-05-07T19:56:29.8001059Z 2025-05-07T19:56:29.8002620Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:29.8004587Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:29.8005136Z ^ 2025-05-07T19:56:29.8005413Z 2025-05-07T19:56:29.8006989Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:29.8008952Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:29.8009530Z ^ 2025-05-07T19:56:29.8009812Z 2025-05-07T19:56:31.9193858Z [249/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu.o 2025-05-07T19:56:31.9216188Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:31.9218909Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:31.9219936Z ^ 2025-05-07T19:56:31.9220207Z 2025-05-07T19:56:31.9220593Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:31.9221214Z 2025-05-07T19:56:31.9222766Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:31.9225218Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:31.9226426Z ^ 2025-05-07T19:56:31.9226757Z 2025-05-07T19:56:31.9228205Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(269): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:31.9230160Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:31.9230930Z ^ 2025-05-07T19:56:31.9231228Z 2025-05-07T19:56:31.9232784Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(274): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:31.9234617Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:31.9235151Z ^ 2025-05-07T19:56:31.9235645Z 2025-05-07T19:56:31.9237055Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(291): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:31.9238763Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:31.9239337Z ^ 2025-05-07T19:56:31.9239602Z 2025-05-07T19:56:31.9241112Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(303): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:31.9242878Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:31.9243384Z ^ 2025-05-07T19:56:31.9243669Z 2025-05-07T19:56:31.9245183Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:31.9247699Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:31.9249102Z ^ 2025-05-07T19:56:31.9249379Z 2025-05-07T19:56:31.9249833Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:31.9250488Z 2025-05-07T19:56:31.9252048Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:31.9254425Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:31.9255497Z ^ 2025-05-07T19:56:31.9255818Z 2025-05-07T19:56:31.9257323Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(269): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:31.9259332Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:31.9260058Z ^ 2025-05-07T19:56:31.9260311Z 2025-05-07T19:56:31.9261758Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(274): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:31.9263580Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:31.9264152Z ^ 2025-05-07T19:56:31.9264467Z 2025-05-07T19:56:31.9265973Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(291): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:31.9267826Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:31.9268353Z ^ 2025-05-07T19:56:31.9268646Z 2025-05-07T19:56:31.9269894Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(303): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:31.9271920Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:31.9272422Z ^ 2025-05-07T19:56:31.9272648Z 2025-05-07T19:56:31.9274170Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:31.9276831Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:31.9278295Z ^ 2025-05-07T19:56:31.9278533Z 2025-05-07T19:56:31.9278970Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:31.9279568Z 2025-05-07T19:56:31.9281311Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:31.9284180Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:31.9285258Z ^ 2025-05-07T19:56:31.9285561Z 2025-05-07T19:56:31.9286882Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(269): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:31.9288763Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:31.9289473Z ^ 2025-05-07T19:56:31.9289751Z 2025-05-07T19:56:31.9291058Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(274): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:31.9292756Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:31.9293269Z ^ 2025-05-07T19:56:31.9293537Z 2025-05-07T19:56:31.9294937Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(291): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:31.9296864Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:31.9297368Z ^ 2025-05-07T19:56:31.9297645Z 2025-05-07T19:56:31.9299166Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(303): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:31.9301043Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:31.9301559Z ^ 2025-05-07T19:56:31.9301801Z 2025-05-07T19:56:31.9303257Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:31.9305609Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:31.9306588Z ^ 2025-05-07T19:56:31.9306819Z 2025-05-07T19:56:31.9307219Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:31.9307835Z 2025-05-07T19:56:31.9309377Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:31.9312072Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:31.9313202Z ^ 2025-05-07T19:56:31.9313599Z 2025-05-07T19:56:31.9315146Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(269): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:31.9317212Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:31.9317895Z ^ 2025-05-07T19:56:31.9318156Z 2025-05-07T19:56:31.9319713Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(274): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:31.9321422Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:31.9334881Z ^ 2025-05-07T19:56:31.9335193Z 2025-05-07T19:56:31.9337144Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(291): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:31.9338977Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:31.9339437Z ^ 2025-05-07T19:56:31.9339726Z 2025-05-07T19:56:31.9341062Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(303): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:31.9342860Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:31.9343397Z ^ 2025-05-07T19:56:31.9343640Z 2025-05-07T19:56:31.9345158Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:31.9347695Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:31.9348886Z ^ 2025-05-07T19:56:31.9349154Z 2025-05-07T19:56:31.9349592Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:31.9350229Z 2025-05-07T19:56:31.9351743Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:31.9354121Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:31.9355128Z ^ 2025-05-07T19:56:31.9355454Z 2025-05-07T19:56:31.9356856Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(269): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:31.9358811Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:31.9359507Z ^ 2025-05-07T19:56:31.9359774Z 2025-05-07T19:56:31.9361155Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(274): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:31.9363086Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:31.9363645Z ^ 2025-05-07T19:56:31.9363927Z 2025-05-07T19:56:31.9365423Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(291): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:31.9367388Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:31.9367823Z ^ 2025-05-07T19:56:31.9368081Z 2025-05-07T19:56:31.9369403Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(303): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:31.9371373Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:31.9371909Z ^ 2025-05-07T19:56:31.9372169Z 2025-05-07T19:56:34.0626403Z [250/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu.o 2025-05-07T19:56:34.0650138Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:34.0652856Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:34.0653979Z ^ 2025-05-07T19:56:34.0654238Z 2025-05-07T19:56:34.0654671Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:34.0655340Z 2025-05-07T19:56:34.0656981Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:34.0659516Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:34.0660744Z ^ 2025-05-07T19:56:34.0661115Z 2025-05-07T19:56:34.0662588Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(253): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:34.0665058Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:34.0665784Z ^ 2025-05-07T19:56:34.0666069Z 2025-05-07T19:56:34.0667549Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(258): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:34.0669445Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:34.0670001Z ^ 2025-05-07T19:56:34.0670535Z 2025-05-07T19:56:34.0672334Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:34.0674288Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:34.0674824Z ^ 2025-05-07T19:56:34.0675107Z 2025-05-07T19:56:34.0676674Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(287): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:34.0678578Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:34.0679135Z ^ 2025-05-07T19:56:34.0679408Z 2025-05-07T19:56:34.0680943Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:34.0683540Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:34.0684701Z ^ 2025-05-07T19:56:34.0684955Z 2025-05-07T19:56:34.0685415Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:34.0686075Z 2025-05-07T19:56:34.0687725Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:34.0690366Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:34.0691496Z ^ 2025-05-07T19:56:34.0691879Z 2025-05-07T19:56:34.0693446Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(253): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:34.0695488Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:34.0696212Z ^ 2025-05-07T19:56:34.0696641Z 2025-05-07T19:56:34.0698206Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(258): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:34.0700205Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:34.0700771Z ^ 2025-05-07T19:56:34.0701058Z 2025-05-07T19:56:34.0702648Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:34.0704623Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:34.0705185Z ^ 2025-05-07T19:56:34.0705463Z 2025-05-07T19:56:34.0706978Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(287): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:34.0709244Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:34.0709798Z ^ 2025-05-07T19:56:34.0710076Z 2025-05-07T19:56:34.0711748Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:34.0714388Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:34.0715568Z ^ 2025-05-07T19:56:34.0715829Z 2025-05-07T19:56:34.0716280Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:34.0716966Z 2025-05-07T19:56:34.0718668Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:34.0721366Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:34.0722462Z ^ 2025-05-07T19:56:34.0722772Z 2025-05-07T19:56:34.0724100Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(253): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:34.0726122Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:34.0726871Z ^ 2025-05-07T19:56:34.0727159Z 2025-05-07T19:56:34.0728700Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(258): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:34.0730576Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:34.0731134Z ^ 2025-05-07T19:56:34.0731417Z 2025-05-07T19:56:34.0732954Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:34.0734905Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:34.0735460Z ^ 2025-05-07T19:56:34.0735748Z 2025-05-07T19:56:34.0737394Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(287): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:34.0739300Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:34.0739896Z ^ 2025-05-07T19:56:34.0740191Z 2025-05-07T19:56:34.0741713Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:34.0744384Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:34.0745549Z ^ 2025-05-07T19:56:34.0745786Z 2025-05-07T19:56:34.0746257Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:34.0746898Z 2025-05-07T19:56:34.0748508Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:34.0751221Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:34.0752695Z ^ 2025-05-07T19:56:34.0753069Z 2025-05-07T19:56:34.0754682Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(253): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:34.0756856Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:34.0757620Z ^ 2025-05-07T19:56:34.0758029Z 2025-05-07T19:56:34.0759489Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(258): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:34.0761400Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:34.0761973Z ^ 2025-05-07T19:56:34.0762251Z 2025-05-07T19:56:34.0763709Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:34.0765703Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:34.0766223Z ^ 2025-05-07T19:56:34.0766467Z 2025-05-07T19:56:34.0767917Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(287): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:34.0769841Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:34.0770599Z ^ 2025-05-07T19:56:34.0770885Z 2025-05-07T19:56:34.0772544Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:34.0775193Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:34.0776477Z ^ 2025-05-07T19:56:34.0776731Z 2025-05-07T19:56:34.0777185Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:34.0777819Z 2025-05-07T19:56:34.0779493Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:34.0782134Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:34.0783292Z ^ 2025-05-07T19:56:34.0783661Z 2025-05-07T19:56:34.0785205Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(253): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:34.0787350Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:34.0788114Z ^ 2025-05-07T19:56:34.0788400Z 2025-05-07T19:56:34.0789957Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(258): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:34.0791880Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:34.0792442Z ^ 2025-05-07T19:56:34.0792701Z 2025-05-07T19:56:34.0794156Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:34.0796432Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:34.0796976Z ^ 2025-05-07T19:56:34.0797267Z 2025-05-07T19:56:34.0798816Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(287): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:34.0800786Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:34.0801332Z ^ 2025-05-07T19:56:34.0801810Z 2025-05-07T19:56:35.6687857Z [251/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu.o 2025-05-07T19:56:35.6699712Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:35.6701071Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:35.6701662Z ^ 2025-05-07T19:56:35.6701814Z 2025-05-07T19:56:35.6702048Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:35.6702387Z 2025-05-07T19:56:35.6703230Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:35.6704870Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:35.6705482Z ^ 2025-05-07T19:56:35.6705673Z 2025-05-07T19:56:35.6706560Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:35.6707636Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:35.6708042Z ^ 2025-05-07T19:56:35.6708193Z 2025-05-07T19:56:35.6708945Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:35.6709914Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:35.6710217Z ^ 2025-05-07T19:56:35.6710366Z 2025-05-07T19:56:35.6711126Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:35.6712092Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:35.6712382Z ^ 2025-05-07T19:56:35.6712545Z 2025-05-07T19:56:35.6713298Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:35.6714270Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:35.6714553Z ^ 2025-05-07T19:56:35.6714700Z 2025-05-07T19:56:35.6715529Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:35.6716849Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:35.6717443Z ^ 2025-05-07T19:56:35.6717576Z 2025-05-07T19:56:35.6717822Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:35.6718162Z 2025-05-07T19:56:35.6718980Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:35.6720320Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:35.6720936Z ^ 2025-05-07T19:56:35.6721125Z 2025-05-07T19:56:35.6721885Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:35.6722951Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:35.6723344Z ^ 2025-05-07T19:56:35.6723508Z 2025-05-07T19:56:35.6724269Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:35.6725236Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:35.6725590Z ^ 2025-05-07T19:56:35.6725794Z 2025-05-07T19:56:35.6726552Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:35.6727521Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:35.6727803Z ^ 2025-05-07T19:56:35.6727949Z 2025-05-07T19:56:35.6728789Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:35.6729742Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:35.6730044Z ^ 2025-05-07T19:56:35.6730190Z 2025-05-07T19:56:35.6731031Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:35.6732357Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:35.6732955Z ^ 2025-05-07T19:56:35.6733091Z 2025-05-07T19:56:35.6733321Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:35.6733672Z 2025-05-07T19:56:35.6734500Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:35.6735836Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:35.6736582Z ^ 2025-05-07T19:56:35.6736789Z 2025-05-07T19:56:35.6737559Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:35.6738623Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:35.6739013Z ^ 2025-05-07T19:56:35.6739180Z 2025-05-07T19:56:35.6739936Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:35.6740906Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:35.6741194Z ^ 2025-05-07T19:56:35.6741345Z 2025-05-07T19:56:35.6742123Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:35.6743086Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:35.6743381Z ^ 2025-05-07T19:56:35.6743528Z 2025-05-07T19:56:35.6744296Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:35.6745249Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:35.6745545Z ^ 2025-05-07T19:56:35.6745690Z 2025-05-07T19:56:35.6746505Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:35.6747842Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:35.6748533Z ^ 2025-05-07T19:56:35.6748667Z 2025-05-07T19:56:35.6748898Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:35.6749246Z 2025-05-07T19:56:35.6750077Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:35.6751527Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:35.6752128Z ^ 2025-05-07T19:56:35.6752318Z 2025-05-07T19:56:35.6753088Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:35.6754141Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:35.6754540Z ^ 2025-05-07T19:56:35.6754689Z 2025-05-07T19:56:35.6755455Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:35.6756407Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:35.6756709Z ^ 2025-05-07T19:56:35.6756859Z 2025-05-07T19:56:35.6757617Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:35.6758589Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:35.6758888Z ^ 2025-05-07T19:56:35.6759035Z 2025-05-07T19:56:35.6759790Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:35.6760758Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:35.6761039Z ^ 2025-05-07T19:56:35.6761195Z 2025-05-07T19:56:35.6762025Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:35.6763354Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:35.6763937Z ^ 2025-05-07T19:56:35.6764084Z 2025-05-07T19:56:35.6764315Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:35.6764659Z 2025-05-07T19:56:35.6765497Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:35.6766829Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:35.6767437Z ^ 2025-05-07T19:56:35.6767625Z 2025-05-07T19:56:35.6768404Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:35.6769453Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:35.6769906Z ^ 2025-05-07T19:56:35.6770369Z 2025-05-07T19:56:35.6771140Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:35.6772116Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:35.6772422Z ^ 2025-05-07T19:56:35.6772574Z 2025-05-07T19:56:35.6773472Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:35.6774454Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:35.6774739Z ^ 2025-05-07T19:56:35.6774901Z 2025-05-07T19:56:35.6775659Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:35.6776736Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:35.6777021Z ^ 2025-05-07T19:56:35.6777181Z 2025-05-07T19:56:39.8032377Z [252/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu.o 2025-05-07T19:56:39.8055869Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:39.8060362Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:39.8061531Z ^ 2025-05-07T19:56:39.8061783Z 2025-05-07T19:56:39.8062228Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:39.8062902Z 2025-05-07T19:56:39.8064721Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:39.8067410Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:39.8068563Z ^ 2025-05-07T19:56:39.8068947Z 2025-05-07T19:56:39.8070827Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:39.8073464Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:39.8074625Z ^ 2025-05-07T19:56:39.8074871Z 2025-05-07T19:56:39.8075331Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:39.8076007Z 2025-05-07T19:56:39.8077682Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:39.8080365Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:39.8081564Z ^ 2025-05-07T19:56:39.8081926Z 2025-05-07T19:56:39.8083567Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:39.8086210Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:39.8087392Z ^ 2025-05-07T19:56:39.8087644Z 2025-05-07T19:56:39.8088076Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:39.8088755Z 2025-05-07T19:56:39.8090256Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:39.8092909Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:39.8094104Z ^ 2025-05-07T19:56:39.8094472Z 2025-05-07T19:56:39.8096149Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:39.8098940Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:39.8100135Z ^ 2025-05-07T19:56:39.8100387Z 2025-05-07T19:56:39.8100840Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:39.8101505Z 2025-05-07T19:56:39.8103198Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:39.8106251Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:39.8107443Z ^ 2025-05-07T19:56:39.8107805Z 2025-05-07T19:56:39.8109633Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:39.8112339Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:39.8113513Z ^ 2025-05-07T19:56:39.8113781Z 2025-05-07T19:56:39.8114229Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:39.8114905Z 2025-05-07T19:56:39.8116567Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:39.8119269Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:39.8120463Z ^ 2025-05-07T19:56:39.8120831Z 2025-05-07T19:56:41.1941204Z [253/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu.o 2025-05-07T19:56:41.1959232Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:41.1961261Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:41.1962455Z ^ 2025-05-07T19:56:41.1962686Z 2025-05-07T19:56:41.1963033Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:41.1963557Z 2025-05-07T19:56:41.1965108Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:41.1967091Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:41.1968086Z ^ 2025-05-07T19:56:41.1968378Z 2025-05-07T19:56:41.1969757Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:41.1971971Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:41.1972763Z ^ 2025-05-07T19:56:41.1972936Z 2025-05-07T19:56:41.1973249Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:41.1973723Z 2025-05-07T19:56:41.1974910Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:41.1976883Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:41.1977762Z ^ 2025-05-07T19:56:41.1978054Z 2025-05-07T19:56:41.1979301Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:41.1981401Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:41.1982221Z ^ 2025-05-07T19:56:41.1982404Z 2025-05-07T19:56:41.1982755Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:41.1983193Z 2025-05-07T19:56:41.1984267Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:41.1985994Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:41.1986768Z ^ 2025-05-07T19:56:41.1987003Z 2025-05-07T19:56:41.1988061Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:41.1989763Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:41.1990847Z ^ 2025-05-07T19:56:41.1991050Z 2025-05-07T19:56:41.1991354Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:41.1991840Z 2025-05-07T19:56:41.1993208Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:41.1995654Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:41.1996815Z ^ 2025-05-07T19:56:41.1997162Z 2025-05-07T19:56:41.1998422Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:41.2000549Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:41.2001555Z ^ 2025-05-07T19:56:41.2001800Z 2025-05-07T19:56:41.2002247Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:41.2002845Z 2025-05-07T19:56:41.2004183Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:41.2006050Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:41.2006874Z ^ 2025-05-07T19:56:41.2007136Z 2025-05-07T19:56:42.9274360Z [254/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu.o 2025-05-07T19:56:42.9297990Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:42.9300627Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:42.9301775Z ^ 2025-05-07T19:56:42.9302014Z 2025-05-07T19:56:42.9302438Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:42.9303101Z 2025-05-07T19:56:42.9304692Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:42.9307147Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:42.9308048Z ^ 2025-05-07T19:56:42.9308373Z 2025-05-07T19:56:42.9309672Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:42.9311661Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:42.9312640Z ^ 2025-05-07T19:56:42.9312883Z 2025-05-07T19:56:42.9313267Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:42.9313835Z 2025-05-07T19:56:42.9315308Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:42.9317626Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:42.9318672Z ^ 2025-05-07T19:56:42.9319007Z 2025-05-07T19:56:42.9320498Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:42.9322859Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:42.9324007Z ^ 2025-05-07T19:56:42.9324231Z 2025-05-07T19:56:42.9324637Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:42.9325270Z 2025-05-07T19:56:42.9326758Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:42.9329186Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:42.9330245Z ^ 2025-05-07T19:56:42.9330601Z 2025-05-07T19:56:42.9332226Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:42.9335170Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:42.9336533Z ^ 2025-05-07T19:56:42.9336825Z 2025-05-07T19:56:42.9337283Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:42.9337957Z 2025-05-07T19:56:42.9339812Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:42.9342482Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:42.9343707Z ^ 2025-05-07T19:56:42.9344078Z 2025-05-07T19:56:42.9345734Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:42.9348431Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:42.9349642Z ^ 2025-05-07T19:56:42.9349899Z 2025-05-07T19:56:42.9350364Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:42.9351063Z 2025-05-07T19:56:42.9352740Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:42.9355455Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:42.9356672Z ^ 2025-05-07T19:56:42.9357071Z 2025-05-07T19:56:45.3955498Z [255/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu.o 2025-05-07T19:56:45.3978597Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:45.3981276Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:45.3982456Z ^ 2025-05-07T19:56:45.3982712Z 2025-05-07T19:56:45.3983191Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:45.3983833Z 2025-05-07T19:56:45.3985473Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:45.3988075Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:45.3989266Z ^ 2025-05-07T19:56:45.3989638Z 2025-05-07T19:56:45.3991274Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:45.3993998Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:45.3995169Z ^ 2025-05-07T19:56:45.3995458Z 2025-05-07T19:56:45.3995915Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:45.3996585Z 2025-05-07T19:56:45.3998276Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:45.4000822Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:45.4002022Z ^ 2025-05-07T19:56:45.4002398Z 2025-05-07T19:56:45.4004023Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:45.4006629Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:45.4007821Z ^ 2025-05-07T19:56:45.4008085Z 2025-05-07T19:56:45.4008560Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:45.4009260Z 2025-05-07T19:56:45.4010904Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:45.4013884Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:45.4015168Z ^ 2025-05-07T19:56:45.4015566Z 2025-05-07T19:56:45.4017384Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:45.4020089Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:45.4021244Z ^ 2025-05-07T19:56:45.4021520Z 2025-05-07T19:56:45.4021959Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:45.4022605Z 2025-05-07T19:56:45.4024164Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:45.4026648Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:45.4027828Z ^ 2025-05-07T19:56:45.4028180Z 2025-05-07T19:56:45.4029631Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:45.4032186Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:45.4033374Z ^ 2025-05-07T19:56:45.4033631Z 2025-05-07T19:56:45.4034076Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:45.4034720Z 2025-05-07T19:56:45.4036240Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:45.4038883Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:45.4039965Z ^ 2025-05-07T19:56:45.4040341Z 2025-05-07T19:56:46.2901775Z [256/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adagrad_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_cuda.cu.o 2025-05-07T19:56:46.2924280Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:46.2926892Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:46.2928102Z ^ 2025-05-07T19:56:46.2928360Z 2025-05-07T19:56:46.2928806Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:46.2929465Z 2025-05-07T19:56:46.2931091Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:46.2933670Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:46.2934818Z ^ 2025-05-07T19:56:46.2935216Z 2025-05-07T19:56:46.2936961Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:46.2939510Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:46.2940633Z ^ 2025-05-07T19:56:46.2940922Z 2025-05-07T19:56:46.2941362Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:46.2941959Z 2025-05-07T19:56:46.2943660Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:46.2946280Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:46.2947465Z ^ 2025-05-07T19:56:46.2947822Z 2025-05-07T19:56:46.2949456Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:46.2952066Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:46.2953212Z ^ 2025-05-07T19:56:46.2953468Z 2025-05-07T19:56:46.2953911Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:46.2954730Z 2025-05-07T19:56:46.2956397Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:46.2959032Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:46.2960117Z ^ 2025-05-07T19:56:46.2960452Z 2025-05-07T19:56:46.2962209Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:46.2964750Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:46.2965893Z ^ 2025-05-07T19:56:46.2966182Z 2025-05-07T19:56:46.2966646Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:46.2967279Z 2025-05-07T19:56:46.2968940Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:46.2971785Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:46.2972968Z ^ 2025-05-07T19:56:46.2973319Z 2025-05-07T19:56:46.2974920Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:46.2977602Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:46.2978747Z ^ 2025-05-07T19:56:46.2978998Z 2025-05-07T19:56:46.2979408Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:46.2980112Z 2025-05-07T19:56:46.2981709Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:46.2984302Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:46.2985480Z ^ 2025-05-07T19:56:46.2985881Z 2025-05-07T19:56:51.2263668Z [257/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adagrad_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_cuda.cu.o 2025-05-07T19:56:51.2286506Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:51.2289217Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:51.2290330Z ^ 2025-05-07T19:56:51.2290613Z 2025-05-07T19:56:51.2291056Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:51.2291731Z 2025-05-07T19:56:51.2293362Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:51.2295895Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:51.2297156Z ^ 2025-05-07T19:56:51.2297517Z 2025-05-07T19:56:51.2299090Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:51.2301532Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:51.2302688Z ^ 2025-05-07T19:56:51.2302958Z 2025-05-07T19:56:51.2303379Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:51.2304048Z 2025-05-07T19:56:51.2305602Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:51.2308167Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:51.2309314Z ^ 2025-05-07T19:56:51.2309679Z 2025-05-07T19:56:51.2311176Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:51.2313709Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:51.2315197Z ^ 2025-05-07T19:56:51.2315446Z 2025-05-07T19:56:51.2315869Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:51.2316474Z 2025-05-07T19:56:51.2317953Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:51.2320690Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:51.2321839Z ^ 2025-05-07T19:56:51.2322192Z 2025-05-07T19:56:51.2323783Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:51.2326293Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:51.2327451Z ^ 2025-05-07T19:56:51.2327691Z 2025-05-07T19:56:51.2328123Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:51.2328761Z 2025-05-07T19:56:51.2330429Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:51.2332970Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:51.2334110Z ^ 2025-05-07T19:56:51.2334422Z 2025-05-07T19:56:51.2336014Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:51.2338686Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:51.2339697Z ^ 2025-05-07T19:56:51.2339946Z 2025-05-07T19:56:51.2340325Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:51.2340914Z 2025-05-07T19:56:51.2342511Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:51.2345023Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:51.2346145Z ^ 2025-05-07T19:56:51.2346475Z 2025-05-07T19:56:58.2306138Z [258/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu.o 2025-05-07T19:56:58.2328734Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:58.2331245Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:58.2332407Z ^ 2025-05-07T19:56:58.2332675Z 2025-05-07T19:56:58.2333107Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:58.2333757Z 2025-05-07T19:56:58.2335399Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:58.2338154Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:58.2339306Z ^ 2025-05-07T19:56:58.2339657Z 2025-05-07T19:56:58.2341278Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:58.2343875Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:58.2344992Z ^ 2025-05-07T19:56:58.2345244Z 2025-05-07T19:56:58.2345679Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:58.2346329Z 2025-05-07T19:56:58.2348021Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:58.2350555Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:58.2351573Z ^ 2025-05-07T19:56:58.2351907Z 2025-05-07T19:56:58.2353631Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:58.2356256Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:58.2357387Z ^ 2025-05-07T19:56:58.2357675Z 2025-05-07T19:56:58.2358116Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:58.2358775Z 2025-05-07T19:56:58.2360534Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:58.2363120Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:58.2364312Z ^ 2025-05-07T19:56:58.2364676Z 2025-05-07T19:56:58.2366286Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:58.2368850Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:58.2370028Z ^ 2025-05-07T19:56:58.2370587Z 2025-05-07T19:56:58.2371015Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:58.2371682Z 2025-05-07T19:56:58.2373296Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:58.2375922Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:58.2377155Z ^ 2025-05-07T19:56:58.2377532Z 2025-05-07T19:56:58.2379124Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:58.2381653Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:58.2382712Z ^ 2025-05-07T19:56:58.2382981Z 2025-05-07T19:56:58.2383412Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:58.2384016Z 2025-05-07T19:56:58.2385497Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:58.2388050Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:58.2389209Z ^ 2025-05-07T19:56:58.2389565Z 2025-05-07T19:57:01.4950138Z [259/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu.o 2025-05-07T19:57:01.4972755Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:01.4975190Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:01.4976294Z ^ 2025-05-07T19:57:01.4976630Z 2025-05-07T19:57:01.4977023Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:01.4977694Z 2025-05-07T19:57:01.4979194Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:01.4981602Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:01.4982801Z ^ 2025-05-07T19:57:01.4983161Z 2025-05-07T19:57:01.4984637Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:01.4987078Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:01.4987968Z ^ 2025-05-07T19:57:01.4988216Z 2025-05-07T19:57:01.4988609Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:01.4989251Z 2025-05-07T19:57:01.4990849Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:01.4993654Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:01.4994795Z ^ 2025-05-07T19:57:01.4995151Z 2025-05-07T19:57:01.4996710Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:01.4999512Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:01.5000686Z ^ 2025-05-07T19:57:01.5000927Z 2025-05-07T19:57:01.5001330Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:01.5002005Z 2025-05-07T19:57:01.5003593Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:01.5006047Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:01.5007117Z ^ 2025-05-07T19:57:01.5007449Z 2025-05-07T19:57:01.5008964Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:01.5011349Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:01.5012441Z ^ 2025-05-07T19:57:01.5012672Z 2025-05-07T19:57:01.5013105Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:01.5013713Z 2025-05-07T19:57:01.5015184Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:01.5017736Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:01.5018817Z ^ 2025-05-07T19:57:01.5019167Z 2025-05-07T19:57:01.5020608Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:01.5022914Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:01.5023922Z ^ 2025-05-07T19:57:01.5024159Z 2025-05-07T19:57:01.5024551Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:01.5025127Z 2025-05-07T19:57:01.5026584Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:01.5028925Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:01.5029968Z ^ 2025-05-07T19:57:01.5030282Z 2025-05-07T19:57:03.3294596Z [260/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T19:57:03.3317587Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:03.3320178Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:03.3321330Z ^ 2025-05-07T19:57:03.3321581Z 2025-05-07T19:57:03.3322039Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:03.3322709Z 2025-05-07T19:57:03.3324358Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:03.3326844Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:03.3328005Z ^ 2025-05-07T19:57:03.3328381Z 2025-05-07T19:57:03.3330080Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:03.3332695Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:03.3333760Z ^ 2025-05-07T19:57:03.3333996Z 2025-05-07T19:57:03.3334682Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:03.3335344Z 2025-05-07T19:57:03.3337146Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:03.3339689Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:03.3340812Z ^ 2025-05-07T19:57:03.3341352Z 2025-05-07T19:57:03.3342997Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:03.3345645Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:03.3346761Z ^ 2025-05-07T19:57:03.3346999Z 2025-05-07T19:57:03.3347434Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:03.3348075Z 2025-05-07T19:57:03.3349697Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:03.3352251Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:03.3353373Z ^ 2025-05-07T19:57:03.3353711Z 2025-05-07T19:57:03.3355308Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:03.3357948Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:03.3359125Z ^ 2025-05-07T19:57:03.3359398Z 2025-05-07T19:57:03.3359821Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:03.3360486Z 2025-05-07T19:57:03.3362039Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:03.3364636Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:03.3365817Z ^ 2025-05-07T19:57:03.3366169Z 2025-05-07T19:57:03.3367775Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:03.3370619Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:03.3371724Z ^ 2025-05-07T19:57:03.3371972Z 2025-05-07T19:57:03.3372357Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:03.3372970Z 2025-05-07T19:57:03.3374601Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:03.3377331Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:03.3378756Z ^ 2025-05-07T19:57:03.3379242Z 2025-05-07T19:57:03.7229695Z [261/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu.o 2025-05-07T19:57:03.7251010Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:03.7253015Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:03.7254020Z ^ 2025-05-07T19:57:03.7254218Z 2025-05-07T19:57:03.7254590Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:03.7255213Z 2025-05-07T19:57:03.7256944Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:03.7259498Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:03.7260597Z ^ 2025-05-07T19:57:03.7260953Z 2025-05-07T19:57:03.7262516Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:03.7265380Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:03.7266533Z ^ 2025-05-07T19:57:03.7266756Z 2025-05-07T19:57:03.7267166Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:03.7267839Z 2025-05-07T19:57:03.7269552Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:03.7272307Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:03.7273421Z ^ 2025-05-07T19:57:03.7273786Z 2025-05-07T19:57:03.7275320Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:03.7277805Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:03.7278869Z ^ 2025-05-07T19:57:03.7279116Z 2025-05-07T19:57:03.7279536Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:03.7280163Z 2025-05-07T19:57:03.7281759Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:03.7284267Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:03.7285406Z ^ 2025-05-07T19:57:03.7285746Z 2025-05-07T19:57:03.7287257Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:03.7289710Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:03.7290778Z ^ 2025-05-07T19:57:03.7291011Z 2025-05-07T19:57:03.7291431Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:03.7292050Z 2025-05-07T19:57:03.7293606Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:03.7296060Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:03.7297251Z ^ 2025-05-07T19:57:03.7297605Z 2025-05-07T19:57:03.7299124Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:03.7301566Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:03.7302643Z ^ 2025-05-07T19:57:03.7302888Z 2025-05-07T19:57:03.7303304Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:03.7303920Z 2025-05-07T19:57:03.7305473Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:03.7308063Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:03.7309174Z ^ 2025-05-07T19:57:03.7309533Z 2025-05-07T19:57:10.0484852Z [262/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu.o 2025-05-07T19:57:10.0508644Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:10.0511321Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:10.0512499Z ^ 2025-05-07T19:57:10.0512745Z 2025-05-07T19:57:10.0513196Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:10.0513885Z 2025-05-07T19:57:10.0515525Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:10.0518117Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:10.0519640Z ^ 2025-05-07T19:57:10.0519999Z 2025-05-07T19:57:10.0521566Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:10.0524188Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:10.0525308Z ^ 2025-05-07T19:57:10.0525737Z 2025-05-07T19:57:10.0526193Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:10.0526860Z 2025-05-07T19:57:10.0528522Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:10.0531229Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:10.0532430Z ^ 2025-05-07T19:57:10.0532771Z 2025-05-07T19:57:10.0534390Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:10.0537137Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:10.0538342Z ^ 2025-05-07T19:57:10.0538598Z 2025-05-07T19:57:10.0539049Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:10.0539746Z 2025-05-07T19:57:10.0541451Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:10.0544004Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:10.0545175Z ^ 2025-05-07T19:57:10.0545539Z 2025-05-07T19:57:10.0547099Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:10.0549693Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:10.0550801Z ^ 2025-05-07T19:57:10.0551058Z 2025-05-07T19:57:10.0551485Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:10.0552146Z 2025-05-07T19:57:10.0553795Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:10.0556427Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:10.0557559Z ^ 2025-05-07T19:57:10.0557897Z 2025-05-07T19:57:10.0559534Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:10.0562151Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:10.0563554Z ^ 2025-05-07T19:57:10.0563803Z 2025-05-07T19:57:10.0564229Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:10.0564855Z 2025-05-07T19:57:10.0566476Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:10.0569209Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:10.0570642Z ^ 2025-05-07T19:57:10.0570993Z 2025-05-07T19:57:10.3039026Z [263/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu.o 2025-05-07T19:57:10.3064930Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:10.3067817Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:10.3069109Z ^ 2025-05-07T19:57:10.3069390Z 2025-05-07T19:57:10.3069868Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:10.3070879Z 2025-05-07T19:57:10.3073023Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:10.3076027Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:10.3077313Z ^ 2025-05-07T19:57:10.3077704Z 2025-05-07T19:57:10.3079700Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:10.3082478Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:10.3083762Z ^ 2025-05-07T19:57:10.3084038Z 2025-05-07T19:57:10.3084542Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:10.3085261Z 2025-05-07T19:57:10.3087028Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:10.3089905Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:10.3091194Z ^ 2025-05-07T19:57:10.3091598Z 2025-05-07T19:57:10.3093275Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:10.3096148Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:10.3097389Z ^ 2025-05-07T19:57:10.3097683Z 2025-05-07T19:57:10.3098158Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:10.3098873Z 2025-05-07T19:57:10.3100649Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:10.3103515Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:10.3104816Z ^ 2025-05-07T19:57:10.3105204Z 2025-05-07T19:57:10.3106967Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:10.3109823Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:10.3111091Z ^ 2025-05-07T19:57:10.3111366Z 2025-05-07T19:57:10.3111844Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:10.3112574Z 2025-05-07T19:57:10.3114368Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:10.3117248Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:10.3118529Z ^ 2025-05-07T19:57:10.3118934Z 2025-05-07T19:57:10.3120697Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:10.3123816Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:10.3125075Z ^ 2025-05-07T19:57:10.3125362Z 2025-05-07T19:57:10.3125829Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:10.3126536Z 2025-05-07T19:57:10.3128493Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:10.3131373Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:10.3132676Z ^ 2025-05-07T19:57:10.3133071Z 2025-05-07T19:57:13.3977618Z [264/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu.o 2025-05-07T19:57:13.4001151Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:13.4003715Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:13.4005265Z ^ 2025-05-07T19:57:13.4005514Z 2025-05-07T19:57:13.4005949Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:13.4006596Z 2025-05-07T19:57:13.4008255Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:13.4011211Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:13.4012423Z ^ 2025-05-07T19:57:13.4012789Z 2025-05-07T19:57:13.4014418Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:13.4017136Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:13.4018237Z ^ 2025-05-07T19:57:13.4018492Z 2025-05-07T19:57:13.4018862Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:13.4019501Z 2025-05-07T19:57:13.4021128Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:13.4023813Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:13.4024977Z ^ 2025-05-07T19:57:13.4025350Z 2025-05-07T19:57:13.4027012Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:13.4029694Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:13.4030820Z ^ 2025-05-07T19:57:13.4031073Z 2025-05-07T19:57:13.4031532Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:13.4032173Z 2025-05-07T19:57:13.4033818Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:13.4036527Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:13.4037722Z ^ 2025-05-07T19:57:13.4038086Z 2025-05-07T19:57:13.4039681Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:13.4042318Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:13.4043513Z ^ 2025-05-07T19:57:13.4043783Z 2025-05-07T19:57:13.4044232Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:13.4044910Z 2025-05-07T19:57:13.4046626Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:13.4049565Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:13.4050784Z ^ 2025-05-07T19:57:13.4051149Z 2025-05-07T19:57:13.4052850Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:13.4055687Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:13.4056950Z ^ 2025-05-07T19:57:13.4057149Z 2025-05-07T19:57:13.4057572Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:13.4058233Z 2025-05-07T19:57:13.4059873Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:13.4062622Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:13.4063822Z ^ 2025-05-07T19:57:13.4064204Z 2025-05-07T19:57:22.5601444Z [265/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu.o 2025-05-07T19:57:22.5623442Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:22.5625754Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:22.5626909Z ^ 2025-05-07T19:57:22.5627133Z 2025-05-07T19:57:22.5627992Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:22.5628596Z 2025-05-07T19:57:22.5630017Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:22.5632295Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:22.5633446Z ^ 2025-05-07T19:57:22.5633772Z 2025-05-07T19:57:22.5635216Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:22.5637636Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:22.5638752Z ^ 2025-05-07T19:57:22.5638987Z 2025-05-07T19:57:22.5639376Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:22.5639936Z 2025-05-07T19:57:22.5641450Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:22.5643948Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:22.5645037Z ^ 2025-05-07T19:57:22.5645367Z 2025-05-07T19:57:22.5646894Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:22.5649313Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:22.5650411Z ^ 2025-05-07T19:57:22.5650651Z 2025-05-07T19:57:22.5651051Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:22.5651680Z 2025-05-07T19:57:22.5653179Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:22.5655425Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:22.5656565Z ^ 2025-05-07T19:57:22.5656898Z 2025-05-07T19:57:22.5658372Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:22.5660574Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:22.5661486Z ^ 2025-05-07T19:57:22.5661748Z 2025-05-07T19:57:22.5662343Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:22.5662986Z 2025-05-07T19:57:22.5664296Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:22.5666506Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:22.5667486Z ^ 2025-05-07T19:57:22.5667965Z 2025-05-07T19:57:22.5669371Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:22.5671935Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:22.5673046Z ^ 2025-05-07T19:57:22.5673290Z 2025-05-07T19:57:22.5673703Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:22.5674298Z 2025-05-07T19:57:22.5675818Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:22.5678318Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:22.5679396Z ^ 2025-05-07T19:57:22.5679763Z 2025-05-07T19:57:30.6928887Z [266/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu.o 2025-05-07T19:57:30.6952584Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:30.6955616Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:30.6956808Z ^ 2025-05-07T19:57:30.6957051Z 2025-05-07T19:57:30.6957502Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:30.6958145Z 2025-05-07T19:57:30.6959814Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:30.6962489Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:30.6963682Z ^ 2025-05-07T19:57:30.6964051Z 2025-05-07T19:57:30.6965706Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:30.6968336Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:30.6969480Z ^ 2025-05-07T19:57:30.6969751Z 2025-05-07T19:57:30.6970448Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:30.6971112Z 2025-05-07T19:57:30.6972740Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:30.6975421Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:30.6976702Z ^ 2025-05-07T19:57:30.6977066Z 2025-05-07T19:57:30.6978499Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:30.6980970Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:30.6982062Z ^ 2025-05-07T19:57:30.6982277Z 2025-05-07T19:57:30.6982696Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:30.6983315Z 2025-05-07T19:57:30.6984840Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:30.6987227Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:30.6988324Z ^ 2025-05-07T19:57:30.6988681Z 2025-05-07T19:57:30.6990267Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:30.6993145Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:30.6994267Z ^ 2025-05-07T19:57:30.6994521Z 2025-05-07T19:57:30.6994945Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:30.6995611Z 2025-05-07T19:57:30.6997444Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:30.6999848Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:30.7000842Z ^ 2025-05-07T19:57:30.7001163Z 2025-05-07T19:57:30.7002637Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:30.7005110Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:30.7006182Z ^ 2025-05-07T19:57:30.7006432Z 2025-05-07T19:57:30.7006876Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:30.7020501Z 2025-05-07T19:57:30.7022221Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:30.7024861Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:30.7026041Z ^ 2025-05-07T19:57:30.7026420Z 2025-05-07T19:57:32.1722935Z [267/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu.o 2025-05-07T19:57:32.1744496Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:32.1746923Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:32.1748026Z ^ 2025-05-07T19:57:32.1748287Z 2025-05-07T19:57:32.1748712Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:32.1749342Z 2025-05-07T19:57:32.1750946Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:32.1753470Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:32.1754624Z ^ 2025-05-07T19:57:32.1754982Z 2025-05-07T19:57:32.1756608Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:32.1759147Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:32.1760224Z ^ 2025-05-07T19:57:32.1760467Z 2025-05-07T19:57:32.1760845Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:32.1761485Z 2025-05-07T19:57:32.1763132Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:32.1765721Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:32.1766882Z ^ 2025-05-07T19:57:32.1767238Z 2025-05-07T19:57:32.1768818Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:32.1771661Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:32.1772788Z ^ 2025-05-07T19:57:32.1773052Z 2025-05-07T19:57:32.1773486Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:32.1774138Z 2025-05-07T19:57:32.1775783Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:32.1778487Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:32.1779965Z ^ 2025-05-07T19:57:32.1780303Z 2025-05-07T19:57:32.1781920Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:32.1784482Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:32.1785587Z ^ 2025-05-07T19:57:32.1785826Z 2025-05-07T19:57:32.1786451Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:32.1787089Z 2025-05-07T19:57:32.1788675Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:32.1791277Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:32.1792289Z ^ 2025-05-07T19:57:32.1792654Z 2025-05-07T19:57:32.1794233Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:32.1796788Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:32.1797909Z ^ 2025-05-07T19:57:32.1798171Z 2025-05-07T19:57:32.1798595Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:32.1799224Z 2025-05-07T19:57:32.1800867Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:32.1803411Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:32.1804522Z ^ 2025-05-07T19:57:32.1804857Z 2025-05-07T19:57:35.0479526Z [268/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T19:57:35.0502917Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:35.0505470Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:35.0506569Z ^ 2025-05-07T19:57:35.0506816Z 2025-05-07T19:57:35.0507237Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:35.0507889Z 2025-05-07T19:57:35.0509490Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:35.0512009Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:35.0513149Z ^ 2025-05-07T19:57:35.0513568Z 2025-05-07T19:57:35.0515150Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:35.0517659Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:35.0518788Z ^ 2025-05-07T19:57:35.0519040Z 2025-05-07T19:57:35.0519476Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:35.0520143Z 2025-05-07T19:57:35.0521784Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:35.0524442Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:35.0525561Z ^ 2025-05-07T19:57:35.0525935Z 2025-05-07T19:57:35.0527518Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:35.0530147Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:35.0531291Z ^ 2025-05-07T19:57:35.0531550Z 2025-05-07T19:57:35.0531991Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:35.0532636Z 2025-05-07T19:57:35.0534293Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:35.0537278Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:35.0538447Z ^ 2025-05-07T19:57:35.0538807Z 2025-05-07T19:57:35.0540626Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:35.0543241Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:35.0544341Z ^ 2025-05-07T19:57:35.0544559Z 2025-05-07T19:57:35.0545001Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:35.0545678Z 2025-05-07T19:57:35.0547338Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:35.0549894Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:35.0551015Z ^ 2025-05-07T19:57:35.0551383Z 2025-05-07T19:57:35.0553012Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:35.0555630Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:35.0556773Z ^ 2025-05-07T19:57:35.0557020Z 2025-05-07T19:57:35.0557478Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:35.0558120Z 2025-05-07T19:57:35.0559734Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:35.0562379Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:35.0563541Z ^ 2025-05-07T19:57:35.0563888Z 2025-05-07T19:57:35.7093606Z [269/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu.o 2025-05-07T19:57:35.7116755Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:35.7119451Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:35.7120614Z ^ 2025-05-07T19:57:35.7120905Z 2025-05-07T19:57:35.7121354Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:35.7122014Z 2025-05-07T19:57:35.7123665Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:35.7126336Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:35.7127527Z ^ 2025-05-07T19:57:35.7127899Z 2025-05-07T19:57:35.7129516Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:35.7132142Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:35.7133288Z ^ 2025-05-07T19:57:35.7133613Z 2025-05-07T19:57:35.7134062Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:35.7134723Z 2025-05-07T19:57:35.7136466Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:35.7139117Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:35.7140285Z ^ 2025-05-07T19:57:35.7140644Z 2025-05-07T19:57:35.7142022Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(288): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:35.7143769Z int error_code = 0; 2025-05-07T19:57:35.7144199Z ^ 2025-05-07T19:57:35.7144404Z 2025-05-07T19:57:35.7145790Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(289): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:35.7147722Z int64_t error_value; 2025-05-07T19:57:35.7148144Z ^ 2025-05-07T19:57:35.7148380Z 2025-05-07T19:57:35.7149735Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(136): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:35.7151479Z int error_code = 0; 2025-05-07T19:57:35.7152038Z ^ 2025-05-07T19:57:35.7152240Z 2025-05-07T19:57:35.7153627Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(137): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:35.7155354Z int64_t error_value; 2025-05-07T19:57:35.7155789Z ^ 2025-05-07T19:57:35.7156014Z 2025-05-07T19:57:35.7157379Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(774): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:35.7159111Z int error_code = 0; 2025-05-07T19:57:35.7159536Z ^ 2025-05-07T19:57:35.7159733Z 2025-05-07T19:57:35.7161117Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(775): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:35.7162870Z int64_t error_value; 2025-05-07T19:57:35.7163288Z ^ 2025-05-07T19:57:35.7163517Z 2025-05-07T19:57:35.7164882Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(627): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:35.7166607Z int error_code = 0; 2025-05-07T19:57:35.7167023Z ^ 2025-05-07T19:57:35.7167232Z 2025-05-07T19:57:35.7168600Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(628): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:35.7170604Z int64_t error_value; 2025-05-07T19:57:35.7171037Z ^ 2025-05-07T19:57:35.7171257Z 2025-05-07T19:57:35.7172894Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:35.7175507Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:35.7176728Z ^ 2025-05-07T19:57:35.7176974Z 2025-05-07T19:57:35.7177424Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:35.7178088Z 2025-05-07T19:57:35.7179731Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:35.7182359Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:35.7183508Z ^ 2025-05-07T19:57:35.7183877Z 2025-05-07T19:57:35.7185256Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(288): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:35.7186990Z int error_code = 0; 2025-05-07T19:57:35.7187405Z ^ 2025-05-07T19:57:35.7187617Z 2025-05-07T19:57:35.7188995Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(289): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:35.7191890Z int64_t error_value; 2025-05-07T19:57:35.7192328Z ^ 2025-05-07T19:57:35.7192547Z 2025-05-07T19:57:35.7193915Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(136): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:35.7195640Z int error_code = 0; 2025-05-07T19:57:35.7196061Z ^ 2025-05-07T19:57:35.7196416Z 2025-05-07T19:57:35.7197793Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(137): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:35.7199552Z int64_t error_value; 2025-05-07T19:57:35.7199988Z ^ 2025-05-07T19:57:35.7200210Z 2025-05-07T19:57:35.7201567Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(774): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:35.7203315Z int error_code = 0; 2025-05-07T19:57:35.7203733Z ^ 2025-05-07T19:57:35.7203951Z 2025-05-07T19:57:35.7205312Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(775): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:35.7207058Z int64_t error_value; 2025-05-07T19:57:35.7207475Z ^ 2025-05-07T19:57:35.7207694Z 2025-05-07T19:57:35.7209074Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(627): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:35.7210792Z int error_code = 0; 2025-05-07T19:57:35.7211224Z ^ 2025-05-07T19:57:35.7211418Z 2025-05-07T19:57:35.7212806Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(628): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:35.7214536Z int64_t error_value; 2025-05-07T19:57:35.7214970Z ^ 2025-05-07T19:57:35.7215192Z 2025-05-07T19:57:35.7216953Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:35.7219578Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:35.7220711Z ^ 2025-05-07T19:57:35.7220955Z 2025-05-07T19:57:35.7221395Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:35.7222072Z 2025-05-07T19:57:35.7223705Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:35.7226337Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:35.7227502Z ^ 2025-05-07T19:57:35.7227858Z 2025-05-07T19:57:35.7229250Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(288): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:35.7230962Z int error_code = 0; 2025-05-07T19:57:35.7231388Z ^ 2025-05-07T19:57:35.7231588Z 2025-05-07T19:57:35.7232983Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(289): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:35.7234843Z int64_t error_value; 2025-05-07T19:57:35.7235279Z ^ 2025-05-07T19:57:35.7235501Z 2025-05-07T19:57:35.7236868Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(136): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:35.7238598Z int error_code = 0; 2025-05-07T19:57:35.7239015Z ^ 2025-05-07T19:57:35.7239319Z 2025-05-07T19:57:35.7240698Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(137): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:35.7242437Z int64_t error_value; 2025-05-07T19:57:35.7242859Z ^ 2025-05-07T19:57:35.7243091Z 2025-05-07T19:57:35.7244454Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(774): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:35.7246185Z int error_code = 0; 2025-05-07T19:57:35.7246594Z ^ 2025-05-07T19:57:35.7246796Z 2025-05-07T19:57:35.7248175Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(775): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:35.7249916Z int64_t error_value; 2025-05-07T19:57:35.7250350Z ^ 2025-05-07T19:57:35.7250568Z 2025-05-07T19:57:35.7251940Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(627): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:35.7253660Z int error_code = 0; 2025-05-07T19:57:35.7254092Z ^ 2025-05-07T19:57:35.7254292Z 2025-05-07T19:57:35.7255664Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(628): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:35.7257538Z int64_t error_value; 2025-05-07T19:57:35.7257957Z ^ 2025-05-07T19:57:35.7258195Z 2025-05-07T19:57:35.7259812Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:35.7262454Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:35.7263591Z ^ 2025-05-07T19:57:35.7263849Z 2025-05-07T19:57:35.7264292Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:35.7264952Z 2025-05-07T19:57:35.7266625Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:35.7269262Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:35.7270641Z ^ 2025-05-07T19:57:35.7271001Z 2025-05-07T19:57:35.7272412Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(288): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:35.7274155Z int error_code = 0; 2025-05-07T19:57:35.7274587Z ^ 2025-05-07T19:57:35.7274791Z 2025-05-07T19:57:35.7276186Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(289): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:35.7278226Z int64_t error_value; 2025-05-07T19:57:35.7278654Z ^ 2025-05-07T19:57:35.7278895Z 2025-05-07T19:57:35.7280284Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(136): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:35.7282046Z int error_code = 0; 2025-05-07T19:57:35.7282464Z ^ 2025-05-07T19:57:35.7282679Z 2025-05-07T19:57:35.7284203Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(137): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:35.7285973Z int64_t error_value; 2025-05-07T19:57:35.7286397Z ^ 2025-05-07T19:57:35.7286619Z 2025-05-07T19:57:35.7288022Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(774): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:35.7289766Z int error_code = 0; 2025-05-07T19:57:35.7290189Z ^ 2025-05-07T19:57:35.7290392Z 2025-05-07T19:57:35.7291786Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(775): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:35.7293562Z int64_t error_value; 2025-05-07T19:57:35.7294008Z ^ 2025-05-07T19:57:35.7294232Z 2025-05-07T19:57:35.7295624Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(627): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:35.7297459Z int error_code = 0; 2025-05-07T19:57:35.7297863Z ^ 2025-05-07T19:57:35.7298082Z 2025-05-07T19:57:35.7299451Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(628): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:35.7301183Z int64_t error_value; 2025-05-07T19:57:35.7301604Z ^ 2025-05-07T19:57:35.7301833Z 2025-05-07T19:57:37.5305253Z [270/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_gwd_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_kernel.cu.o 2025-05-07T19:57:37.5326746Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:37.5329329Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:37.5330405Z ^ 2025-05-07T19:57:37.5330726Z 2025-05-07T19:57:37.5331125Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:37.5331723Z 2025-05-07T19:57:37.5333251Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:37.5335744Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:37.5337001Z ^ 2025-05-07T19:57:37.5337397Z 2025-05-07T19:57:37.5338998Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:37.5341551Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:37.5342641Z ^ 2025-05-07T19:57:37.5342892Z 2025-05-07T19:57:37.5343327Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:37.5343923Z 2025-05-07T19:57:37.5345431Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:37.5347905Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:37.5349099Z ^ 2025-05-07T19:57:37.5349454Z 2025-05-07T19:57:37.5350724Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_gwd_kernel.cu(240): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T19:57:37.5352335Z const auto offset_idx = idx * D_emb; 2025-05-07T19:57:37.5352860Z ^ 2025-05-07T19:57:37.5353116Z 2025-05-07T19:57:37.5354737Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:37.5357186Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:37.5358582Z ^ 2025-05-07T19:57:37.5358847Z 2025-05-07T19:57:37.5359286Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:37.5359886Z 2025-05-07T19:57:37.5361276Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:37.5363888Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:37.5365019Z ^ 2025-05-07T19:57:37.5365377Z 2025-05-07T19:57:37.5366645Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_gwd_kernel.cu(240): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T19:57:37.5368351Z const auto offset_idx = idx * D_emb; 2025-05-07T19:57:37.5368884Z ^ 2025-05-07T19:57:37.5369127Z 2025-05-07T19:57:37.5370799Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:37.5373187Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:37.5374327Z ^ 2025-05-07T19:57:37.5374578Z 2025-05-07T19:57:37.5374998Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:37.5375586Z 2025-05-07T19:57:37.5377091Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:37.5379482Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:37.5380509Z ^ 2025-05-07T19:57:37.5380821Z 2025-05-07T19:57:37.5381968Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_gwd_kernel.cu(240): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T19:57:37.5383555Z const auto offset_idx = idx * D_emb; 2025-05-07T19:57:37.5384083Z ^ 2025-05-07T19:57:37.5384339Z 2025-05-07T19:57:37.5385969Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:37.5388603Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:37.5389736Z ^ 2025-05-07T19:57:37.5389960Z 2025-05-07T19:57:37.5390383Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:37.5391036Z 2025-05-07T19:57:37.5392583Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:37.5394981Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:37.5396027Z ^ 2025-05-07T19:57:37.5396371Z 2025-05-07T19:57:37.5397616Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_gwd_kernel.cu(240): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T19:57:37.5399656Z const auto offset_idx = idx * D_emb; 2025-05-07T19:57:37.5400185Z ^ 2025-05-07T19:57:37.5400436Z 2025-05-07T19:57:37.7934195Z [271/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu.o 2025-05-07T19:57:37.7945964Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:37.7947332Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:37.7947963Z ^ 2025-05-07T19:57:37.7948112Z 2025-05-07T19:57:37.7948354Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:37.7948715Z 2025-05-07T19:57:37.7949554Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:37.7950898Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:37.7951503Z ^ 2025-05-07T19:57:37.7951696Z 2025-05-07T19:57:37.7952523Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:37.7954001Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:37.7954606Z ^ 2025-05-07T19:57:37.7954742Z 2025-05-07T19:57:37.7954988Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:37.7955329Z 2025-05-07T19:57:37.7956267Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:37.7957615Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:37.7958233Z ^ 2025-05-07T19:57:37.7958429Z 2025-05-07T19:57:37.7959247Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:37.7960576Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:37.7961181Z ^ 2025-05-07T19:57:37.7961315Z 2025-05-07T19:57:37.7961552Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:37.7961902Z 2025-05-07T19:57:37.7962732Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:37.7964073Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:37.7964685Z ^ 2025-05-07T19:57:37.7964879Z 2025-05-07T19:57:37.7965712Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:37.7967033Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:37.7967644Z ^ 2025-05-07T19:57:37.7967782Z 2025-05-07T19:57:37.7968022Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:37.7968357Z 2025-05-07T19:57:37.7969186Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:37.7970793Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:37.7971405Z ^ 2025-05-07T19:57:37.7971595Z 2025-05-07T19:57:37.7972407Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:37.7973751Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:37.7974335Z ^ 2025-05-07T19:57:37.7974489Z 2025-05-07T19:57:37.7974725Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:37.7975071Z 2025-05-07T19:57:37.7975925Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:37.7977488Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:37.7978132Z ^ 2025-05-07T19:57:37.7978328Z 2025-05-07T19:57:38.0857235Z [272/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu.o 2025-05-07T19:57:38.0879653Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:38.0882343Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:38.0883483Z ^ 2025-05-07T19:57:38.0883743Z 2025-05-07T19:57:38.0884221Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:38.0884868Z 2025-05-07T19:57:38.0886464Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:38.0889178Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:38.0890508Z ^ 2025-05-07T19:57:38.0890886Z 2025-05-07T19:57:38.0892475Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:38.0895241Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:38.0896501Z ^ 2025-05-07T19:57:38.0896786Z 2025-05-07T19:57:38.0897232Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:38.0897850Z 2025-05-07T19:57:38.0899438Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:38.0902044Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:38.0903224Z ^ 2025-05-07T19:57:38.0903584Z 2025-05-07T19:57:38.0905176Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:38.0907765Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:38.0908908Z ^ 2025-05-07T19:57:38.0909164Z 2025-05-07T19:57:38.0909599Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:38.0910277Z 2025-05-07T19:57:38.0911904Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:38.0914508Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:38.0915635Z ^ 2025-05-07T19:57:38.0916032Z 2025-05-07T19:57:38.0917609Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:38.0920234Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:38.0921353Z ^ 2025-05-07T19:57:38.0921623Z 2025-05-07T19:57:38.0922088Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:38.0922726Z 2025-05-07T19:57:38.0924347Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:38.0926855Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:38.0927944Z ^ 2025-05-07T19:57:38.0928296Z 2025-05-07T19:57:38.0929877Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:38.0932469Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:38.0933866Z ^ 2025-05-07T19:57:38.0934125Z 2025-05-07T19:57:38.0934568Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:38.0935223Z 2025-05-07T19:57:38.0936940Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:38.0939668Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:38.0940796Z ^ 2025-05-07T19:57:38.0941126Z 2025-05-07T19:57:40.5763369Z [273/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_indice_weights_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_indice_weights_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_dense_indice_weights_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_indice_weights_codegen_cuda.cu.o 2025-05-07T19:57:40.5785119Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:40.5787463Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:40.5788515Z ^ 2025-05-07T19:57:40.5788749Z 2025-05-07T19:57:40.5789136Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:40.5790130Z 2025-05-07T19:57:40.5791631Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:40.5794223Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:40.5795370Z ^ 2025-05-07T19:57:40.5795758Z 2025-05-07T19:57:40.5797402Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:40.5799792Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:40.5800835Z ^ 2025-05-07T19:57:40.5801090Z 2025-05-07T19:57:40.5801484Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:40.5802097Z 2025-05-07T19:57:40.5803501Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:40.5805926Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:40.5807084Z ^ 2025-05-07T19:57:40.5807443Z 2025-05-07T19:57:40.5809103Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:40.5811584Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:40.5812675Z ^ 2025-05-07T19:57:40.5812900Z 2025-05-07T19:57:40.5813321Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:40.5813873Z 2025-05-07T19:57:40.5815319Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:40.5817874Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:40.5819006Z ^ 2025-05-07T19:57:40.5819359Z 2025-05-07T19:57:40.5820916Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:40.5823484Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:40.5824589Z ^ 2025-05-07T19:57:40.5824855Z 2025-05-07T19:57:40.5825283Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:40.5825921Z 2025-05-07T19:57:40.5827538Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:40.5830140Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:40.5831296Z ^ 2025-05-07T19:57:40.5831644Z 2025-05-07T19:57:40.5833348Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:40.5836111Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:40.5837275Z ^ 2025-05-07T19:57:40.5837527Z 2025-05-07T19:57:40.5837968Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:40.5838625Z 2025-05-07T19:57:40.5840434Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:40.5842940Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:40.5844019Z ^ 2025-05-07T19:57:40.5844358Z 2025-05-07T19:57:41.8790800Z [274/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu.o 2025-05-07T19:57:41.8812314Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:41.8815251Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:41.8816680Z ^ 2025-05-07T19:57:41.8816913Z 2025-05-07T19:57:41.8817309Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:41.8817930Z 2025-05-07T19:57:41.8819316Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:41.8821870Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:41.8822981Z ^ 2025-05-07T19:57:41.8823299Z 2025-05-07T19:57:41.8824743Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:41.8827167Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:41.8828200Z ^ 2025-05-07T19:57:41.8828454Z 2025-05-07T19:57:41.8828889Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:41.8829503Z 2025-05-07T19:57:41.8830970Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:41.8833392Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:41.8834427Z ^ 2025-05-07T19:57:41.8834763Z 2025-05-07T19:57:41.8836178Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:41.8838584Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:41.8839672Z ^ 2025-05-07T19:57:41.8839881Z 2025-05-07T19:57:41.8840292Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:41.8840901Z 2025-05-07T19:57:41.8842414Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:41.8844864Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:41.8846022Z ^ 2025-05-07T19:57:41.8846351Z 2025-05-07T19:57:41.8847730Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:41.8850147Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:41.8851160Z ^ 2025-05-07T19:57:41.8851398Z 2025-05-07T19:57:41.8851814Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:41.8852393Z 2025-05-07T19:57:41.8853852Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:41.8856748Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:41.8857854Z ^ 2025-05-07T19:57:41.8858196Z 2025-05-07T19:57:41.8859635Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:41.8862164Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:41.8863236Z ^ 2025-05-07T19:57:41.8863469Z 2025-05-07T19:57:41.8863876Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:41.8864472Z 2025-05-07T19:57:41.8865938Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:41.8868251Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:41.8869313Z ^ 2025-05-07T19:57:41.8869635Z 2025-05-07T19:57:43.8182284Z [275/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu.o 2025-05-07T19:57:43.8203851Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:43.8206290Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:43.8207346Z ^ 2025-05-07T19:57:43.8207678Z 2025-05-07T19:57:43.8208488Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:43.8209096Z 2025-05-07T19:57:43.8210589Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:43.8212972Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:43.8214143Z ^ 2025-05-07T19:57:43.8214491Z 2025-05-07T19:57:43.8215938Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:43.8218532Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:43.8219671Z ^ 2025-05-07T19:57:43.8219896Z 2025-05-07T19:57:43.8220281Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:43.8220902Z 2025-05-07T19:57:43.8222395Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:43.8224913Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:43.8226054Z ^ 2025-05-07T19:57:43.8226425Z 2025-05-07T19:57:43.8227858Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:43.8230135Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:43.8231115Z ^ 2025-05-07T19:57:43.8231347Z 2025-05-07T19:57:43.8231735Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:43.8232367Z 2025-05-07T19:57:43.8233891Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:43.8236296Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:43.8237292Z ^ 2025-05-07T19:57:43.8237625Z 2025-05-07T19:57:43.8239012Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:43.8241396Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:43.8242411Z ^ 2025-05-07T19:57:43.8242867Z 2025-05-07T19:57:43.8243353Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:43.8243978Z 2025-05-07T19:57:43.8245382Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:43.8247795Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:43.8249125Z ^ 2025-05-07T19:57:43.8249475Z 2025-05-07T19:57:43.8250902Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:43.8253340Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:43.8254350Z ^ 2025-05-07T19:57:43.8254583Z 2025-05-07T19:57:43.8255014Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:43.8255637Z 2025-05-07T19:57:43.8257240Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:43.8259517Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:43.8260588Z ^ 2025-05-07T19:57:43.8260948Z 2025-05-07T19:57:46.4568888Z [276/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu.o 2025-05-07T19:57:46.4592804Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:46.4595440Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:46.4596503Z ^ 2025-05-07T19:57:46.4596752Z 2025-05-07T19:57:46.4597215Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:46.4597885Z 2025-05-07T19:57:46.4599490Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:46.4601929Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:46.4603016Z ^ 2025-05-07T19:57:46.4603358Z 2025-05-07T19:57:46.4604853Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:46.4607350Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:46.4608506Z ^ 2025-05-07T19:57:46.4608754Z 2025-05-07T19:57:46.4609179Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:46.4609827Z 2025-05-07T19:57:46.4611360Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:46.4613736Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:46.4614721Z ^ 2025-05-07T19:57:46.4615040Z 2025-05-07T19:57:46.4616664Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:46.4619007Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:46.4620077Z ^ 2025-05-07T19:57:46.4620302Z 2025-05-07T19:57:46.4620728Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:46.4621337Z 2025-05-07T19:57:46.4622719Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:46.4624702Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:46.4625625Z ^ 2025-05-07T19:57:46.4625888Z 2025-05-07T19:57:46.4627473Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:46.4630392Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:46.4631392Z ^ 2025-05-07T19:57:46.4631648Z 2025-05-07T19:57:46.4632061Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:46.4632652Z 2025-05-07T19:57:46.4634328Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:46.4636499Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:46.4637702Z ^ 2025-05-07T19:57:46.4638040Z 2025-05-07T19:57:46.4639662Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:46.4642235Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:46.4643264Z ^ 2025-05-07T19:57:46.4643493Z 2025-05-07T19:57:46.4643872Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:46.4644475Z 2025-05-07T19:57:46.4645821Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:46.4648388Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:46.4649574Z ^ 2025-05-07T19:57:46.4649938Z 2025-05-07T19:58:01.3866190Z [277/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_sgd_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_cuda.cu.o 2025-05-07T19:58:01.3888681Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:01.3891237Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:01.3892386Z ^ 2025-05-07T19:58:01.3892644Z 2025-05-07T19:58:01.3893051Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:01.3893631Z 2025-05-07T19:58:01.3895242Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:01.3897907Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:01.3898986Z ^ 2025-05-07T19:58:01.3899368Z 2025-05-07T19:58:01.3900959Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:01.3903273Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:01.3904345Z ^ 2025-05-07T19:58:01.3904576Z 2025-05-07T19:58:01.3905007Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:01.3905648Z 2025-05-07T19:58:01.3907240Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:01.3909820Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:01.3910999Z ^ 2025-05-07T19:58:01.3911362Z 2025-05-07T19:58:01.3913009Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:01.3915466Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:01.3916428Z ^ 2025-05-07T19:58:01.3916639Z 2025-05-07T19:58:01.3917009Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:01.3917579Z 2025-05-07T19:58:01.3919085Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:01.3921672Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:01.3923217Z ^ 2025-05-07T19:58:01.3923585Z 2025-05-07T19:58:01.3925223Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:01.3927855Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:01.3929009Z ^ 2025-05-07T19:58:01.3929458Z 2025-05-07T19:58:01.3929920Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:01.3930577Z 2025-05-07T19:58:01.3932213Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:01.3934742Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:01.3935822Z ^ 2025-05-07T19:58:01.3936166Z 2025-05-07T19:58:01.3937832Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:01.3940334Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:01.3941480Z ^ 2025-05-07T19:58:01.3941745Z 2025-05-07T19:58:01.3942190Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:01.3942794Z 2025-05-07T19:58:01.3944350Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:01.3947000Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:01.3948161Z ^ 2025-05-07T19:58:01.3948547Z 2025-05-07T19:58:08.2436660Z [278/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T19:58:08.2459602Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:08.2462214Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:08.2463358Z ^ 2025-05-07T19:58:08.2463611Z 2025-05-07T19:58:08.2464079Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:08.2464744Z 2025-05-07T19:58:08.2466318Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:08.2468570Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:08.2469618Z ^ 2025-05-07T19:58:08.2469936Z 2025-05-07T19:58:08.2471827Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:08.2474501Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:08.2475652Z ^ 2025-05-07T19:58:08.2475904Z 2025-05-07T19:58:08.2476344Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:08.2477018Z 2025-05-07T19:58:08.2478663Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:08.2481328Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:08.2482474Z ^ 2025-05-07T19:58:08.2482778Z 2025-05-07T19:58:08.2484189Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:08.2486537Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:08.2487582Z ^ 2025-05-07T19:58:08.2487803Z 2025-05-07T19:58:08.2488145Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:08.2488795Z 2025-05-07T19:58:08.2490762Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:08.2493330Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:08.2494451Z ^ 2025-05-07T19:58:08.2494819Z 2025-05-07T19:58:08.2496715Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:08.2499307Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:08.2500330Z ^ 2025-05-07T19:58:08.2500553Z 2025-05-07T19:58:08.2500968Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:08.2501574Z 2025-05-07T19:58:08.2503044Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:08.2505466Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:08.2506493Z ^ 2025-05-07T19:58:08.2506810Z 2025-05-07T19:58:08.2508299Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:08.2510667Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:08.2511781Z ^ 2025-05-07T19:58:08.2512015Z 2025-05-07T19:58:08.2512453Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:08.2513082Z 2025-05-07T19:58:08.2514600Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:08.2517149Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:08.2518289Z ^ 2025-05-07T19:58:08.2518634Z 2025-05-07T19:58:11.8339997Z [279/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu.o 2025-05-07T19:58:11.8360612Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:11.8363008Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:11.8364075Z ^ 2025-05-07T19:58:11.8364326Z 2025-05-07T19:58:11.8364699Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:11.8365278Z 2025-05-07T19:58:11.8366713Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:11.8369123Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:11.8370474Z ^ 2025-05-07T19:58:11.8370827Z 2025-05-07T19:58:11.8372237Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:11.8374621Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:11.8375675Z ^ 2025-05-07T19:58:11.8375918Z 2025-05-07T19:58:11.8376427Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:11.8376960Z 2025-05-07T19:58:11.8378142Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:11.8380459Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:11.8381508Z ^ 2025-05-07T19:58:11.8381875Z 2025-05-07T19:58:11.8383269Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:11.8385704Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:11.8387152Z ^ 2025-05-07T19:58:11.8387379Z 2025-05-07T19:58:11.8387763Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:11.8388315Z 2025-05-07T19:58:11.8389781Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:11.8392395Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:11.8393444Z ^ 2025-05-07T19:58:11.8393774Z 2025-05-07T19:58:11.8395250Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:11.8397576Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:11.8398448Z ^ 2025-05-07T19:58:11.8398667Z 2025-05-07T19:58:11.8399004Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:11.8399482Z 2025-05-07T19:58:11.8400596Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:11.8402741Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:11.8403684Z ^ 2025-05-07T19:58:11.8403968Z 2025-05-07T19:58:11.8405310Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:11.8407600Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:11.8408613Z ^ 2025-05-07T19:58:11.8408835Z 2025-05-07T19:58:11.8409233Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:11.8409824Z 2025-05-07T19:58:11.8411275Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:11.8413655Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:11.8414689Z ^ 2025-05-07T19:58:11.8415038Z 2025-05-07T19:58:13.1386071Z [280/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adam_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_warp.cu.o 2025-05-07T19:58:13.1406257Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:13.1408573Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:13.1409655Z ^ 2025-05-07T19:58:13.1409887Z 2025-05-07T19:58:13.1410304Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:13.1410927Z 2025-05-07T19:58:13.1412461Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:13.1414904Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:13.1415966Z ^ 2025-05-07T19:58:13.1416434Z 2025-05-07T19:58:13.1417877Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:13.1420105Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:13.1421038Z ^ 2025-05-07T19:58:13.1421275Z 2025-05-07T19:58:13.1421660Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:13.1422232Z 2025-05-07T19:58:13.1423691Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:13.1426011Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:13.1427077Z ^ 2025-05-07T19:58:13.1427390Z 2025-05-07T19:58:13.1429012Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:13.1431205Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:13.1432268Z ^ 2025-05-07T19:58:13.1432491Z 2025-05-07T19:58:13.1432891Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:13.1433712Z 2025-05-07T19:58:13.1435176Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:13.1437551Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:13.1438635Z ^ 2025-05-07T19:58:13.1438987Z 2025-05-07T19:58:13.1440450Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:13.1442833Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:13.1443892Z ^ 2025-05-07T19:58:13.1444157Z 2025-05-07T19:58:13.1444570Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:13.1445193Z 2025-05-07T19:58:13.1446634Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:13.1449018Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:13.1450091Z ^ 2025-05-07T19:58:13.1450430Z 2025-05-07T19:58:13.1451945Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:13.1454304Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:13.1455400Z ^ 2025-05-07T19:58:13.1455638Z 2025-05-07T19:58:13.1456030Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:13.1456872Z 2025-05-07T19:58:13.1458336Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:13.1460712Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:13.1461788Z ^ 2025-05-07T19:58:13.1462145Z 2025-05-07T19:58:13.9818292Z [281/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_indice_weights_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_indice_weights_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_indice_weights_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_indice_weights_codegen_cuda.cu.o 2025-05-07T19:58:13.9839283Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:13.9841795Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:13.9842894Z ^ 2025-05-07T19:58:13.9843144Z 2025-05-07T19:58:13.9843583Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:13.9844215Z 2025-05-07T19:58:13.9845614Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:13.9848093Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:13.9849138Z ^ 2025-05-07T19:58:13.9849454Z 2025-05-07T19:58:13.9850849Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:13.9853134Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:13.9854178Z ^ 2025-05-07T19:58:13.9854435Z 2025-05-07T19:58:13.9854815Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:13.9855411Z 2025-05-07T19:58:13.9857087Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:13.9859752Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:13.9860788Z ^ 2025-05-07T19:58:13.9861115Z 2025-05-07T19:58:13.9862572Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:13.9864955Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:13.9866055Z ^ 2025-05-07T19:58:13.9866299Z 2025-05-07T19:58:13.9866723Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:13.9867279Z 2025-05-07T19:58:13.9868648Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:13.9871400Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:13.9872454Z ^ 2025-05-07T19:58:13.9872799Z 2025-05-07T19:58:13.9874192Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:13.9876429Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:13.9877430Z ^ 2025-05-07T19:58:13.9877664Z 2025-05-07T19:58:13.9878058Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:13.9878645Z 2025-05-07T19:58:13.9880108Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:13.9882432Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:13.9883531Z ^ 2025-05-07T19:58:13.9883860Z 2025-05-07T19:58:13.9885327Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:13.9887546Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:13.9888617Z ^ 2025-05-07T19:58:13.9888829Z 2025-05-07T19:58:13.9889218Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:13.9889811Z 2025-05-07T19:58:13.9891053Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:13.9893493Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:13.9894574Z ^ 2025-05-07T19:58:13.9894916Z 2025-05-07T19:58:14.0390610Z [282/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adam_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_cta.cu.o 2025-05-07T19:58:14.0412007Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:14.0414565Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:14.0415628Z ^ 2025-05-07T19:58:14.0415875Z 2025-05-07T19:58:14.0416456Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:14.0417085Z 2025-05-07T19:58:14.0418541Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:14.0420790Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:14.0421803Z ^ 2025-05-07T19:58:14.0422134Z 2025-05-07T19:58:14.0423651Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:14.0425942Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:14.0427036Z ^ 2025-05-07T19:58:14.0427280Z 2025-05-07T19:58:14.0427709Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:14.0428566Z 2025-05-07T19:58:14.0429968Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:14.0432369Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:14.0433435Z ^ 2025-05-07T19:58:14.0433771Z 2025-05-07T19:58:14.0435492Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:14.0437929Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:14.0439016Z ^ 2025-05-07T19:58:14.0439266Z 2025-05-07T19:58:14.0439683Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:14.0440286Z 2025-05-07T19:58:14.0441776Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:14.0444211Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:14.0445296Z ^ 2025-05-07T19:58:14.0445631Z 2025-05-07T19:58:14.0447170Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:14.0449579Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:14.0450683Z ^ 2025-05-07T19:58:14.0450942Z 2025-05-07T19:58:14.0451390Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:14.0451965Z 2025-05-07T19:58:14.0453463Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:14.0455760Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:14.0457077Z ^ 2025-05-07T19:58:14.0457416Z 2025-05-07T19:58:14.0458834Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:14.0461406Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:14.0462439Z ^ 2025-05-07T19:58:14.0462682Z 2025-05-07T19:58:14.0463101Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:14.0463732Z 2025-05-07T19:58:14.0465359Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:14.0468000Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:14.0469114Z ^ 2025-05-07T19:58:14.0471339Z 2025-05-07T19:58:15.1274984Z [283/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu.o 2025-05-07T19:58:15.1297626Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:15.1300063Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:15.1301209Z ^ 2025-05-07T19:58:15.1301444Z 2025-05-07T19:58:15.1301827Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:15.1302437Z 2025-05-07T19:58:15.1303903Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:15.1306414Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:15.1307580Z ^ 2025-05-07T19:58:15.1307979Z 2025-05-07T19:58:15.1309568Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:15.1312583Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:15.1313727Z ^ 2025-05-07T19:58:15.1314012Z 2025-05-07T19:58:15.1314455Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:15.1315110Z 2025-05-07T19:58:15.1316946Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:15.1319646Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:15.1320753Z ^ 2025-05-07T19:58:15.1321058Z 2025-05-07T19:58:15.1322430Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:15.1324852Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:15.1326000Z ^ 2025-05-07T19:58:15.1326243Z 2025-05-07T19:58:15.1326660Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:15.1327344Z 2025-05-07T19:58:15.1328898Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:15.1331405Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:15.1332488Z ^ 2025-05-07T19:58:15.1332862Z 2025-05-07T19:58:15.1334317Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:15.1336936Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:15.1337976Z ^ 2025-05-07T19:58:15.1338227Z 2025-05-07T19:58:15.1338715Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:15.1339362Z 2025-05-07T19:58:15.1340997Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:15.1343638Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:15.1344826Z ^ 2025-05-07T19:58:15.1345180Z 2025-05-07T19:58:15.1346778Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:15.1349443Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:15.1350654Z ^ 2025-05-07T19:58:15.1350907Z 2025-05-07T19:58:15.1351339Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:15.1351958Z 2025-05-07T19:58:15.1353443Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:15.1356111Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:15.1357303Z ^ 2025-05-07T19:58:15.1357675Z 2025-05-07T19:58:15.8577585Z [284/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adam_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_cuda.cu.o 2025-05-07T19:58:15.8599175Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:15.8601668Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:15.8602827Z ^ 2025-05-07T19:58:15.8603113Z 2025-05-07T19:58:15.8603574Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:15.8604246Z 2025-05-07T19:58:15.8605950Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:15.8608467Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:15.8609561Z ^ 2025-05-07T19:58:15.8610286Z 2025-05-07T19:58:15.8611848Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:15.8614431Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:15.8615592Z ^ 2025-05-07T19:58:15.8615846Z 2025-05-07T19:58:15.8616588Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:15.8617265Z 2025-05-07T19:58:15.8618878Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:15.8621469Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:15.8622629Z ^ 2025-05-07T19:58:15.8623010Z 2025-05-07T19:58:15.8624577Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:15.8627114Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:15.8628271Z ^ 2025-05-07T19:58:15.8628549Z 2025-05-07T19:58:15.8628987Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:15.8629622Z 2025-05-07T19:58:15.8631226Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:15.8633848Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:15.8634956Z ^ 2025-05-07T19:58:15.8635299Z 2025-05-07T19:58:15.8636909Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:15.8639524Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:15.8640717Z ^ 2025-05-07T19:58:15.8640973Z 2025-05-07T19:58:15.8641364Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:15.8642020Z 2025-05-07T19:58:15.8643594Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:15.8646277Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:15.8647445Z ^ 2025-05-07T19:58:15.8647817Z 2025-05-07T19:58:15.8649447Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:15.8651912Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:15.8652980Z ^ 2025-05-07T19:58:15.8653197Z 2025-05-07T19:58:15.8653618Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:15.8654500Z 2025-05-07T19:58:15.8655966Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:15.8658493Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:15.8659678Z ^ 2025-05-07T19:58:15.8660019Z 2025-05-07T19:58:15.9687314Z [285/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu.o 2025-05-07T19:58:15.9711210Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:15.9713901Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:15.9715069Z ^ 2025-05-07T19:58:15.9715321Z 2025-05-07T19:58:15.9715769Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:15.9716436Z 2025-05-07T19:58:15.9718123Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:15.9721100Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:15.9722302Z ^ 2025-05-07T19:58:15.9722667Z 2025-05-07T19:58:15.9724452Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:15.9727118Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:15.9728299Z ^ 2025-05-07T19:58:15.9728551Z 2025-05-07T19:58:15.9729011Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:15.9729682Z 2025-05-07T19:58:15.9731342Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:15.9733980Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:15.9734981Z ^ 2025-05-07T19:58:15.9735302Z 2025-05-07T19:58:15.9737078Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:15.9739731Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:15.9740863Z ^ 2025-05-07T19:58:15.9741114Z 2025-05-07T19:58:15.9741512Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:15.9742134Z 2025-05-07T19:58:15.9743752Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:15.9746229Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:15.9747427Z ^ 2025-05-07T19:58:15.9747791Z 2025-05-07T19:58:15.9749473Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:15.9752082Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:15.9753260Z ^ 2025-05-07T19:58:15.9753511Z 2025-05-07T19:58:15.9753950Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:15.9754625Z 2025-05-07T19:58:15.9756273Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:15.9758900Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:15.9760047Z ^ 2025-05-07T19:58:15.9760404Z 2025-05-07T19:58:15.9761905Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:15.9764768Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:15.9765894Z ^ 2025-05-07T19:58:15.9766154Z 2025-05-07T19:58:15.9766578Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:15.9767153Z 2025-05-07T19:58:15.9768865Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:15.9771732Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:15.9772797Z ^ 2025-05-07T19:58:15.9773135Z 2025-05-07T19:58:16.4198504Z [286/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_sgd_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_warp.cu.o 2025-05-07T19:58:16.4221165Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:16.4223855Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:16.4225039Z ^ 2025-05-07T19:58:16.4225616Z 2025-05-07T19:58:16.4226092Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:16.4226777Z 2025-05-07T19:58:16.4228469Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:16.4231152Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:16.4232431Z ^ 2025-05-07T19:58:16.4232804Z 2025-05-07T19:58:16.4234475Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:16.4237021Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:16.4238164Z ^ 2025-05-07T19:58:16.4238423Z 2025-05-07T19:58:16.4238853Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:16.4239487Z 2025-05-07T19:58:16.4241126Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:16.4243658Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:16.4244737Z ^ 2025-05-07T19:58:16.4245086Z 2025-05-07T19:58:16.4246703Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:16.4249215Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:16.4250352Z ^ 2025-05-07T19:58:16.4250599Z 2025-05-07T19:58:16.4251042Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:16.4251686Z 2025-05-07T19:58:16.4253317Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:16.4256022Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:16.4257293Z ^ 2025-05-07T19:58:16.4257659Z 2025-05-07T19:58:16.4259254Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:16.4261823Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:16.4262933Z ^ 2025-05-07T19:58:16.4263190Z 2025-05-07T19:58:16.4263559Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:16.4264190Z 2025-05-07T19:58:16.4265822Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:16.4268322Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:16.4269668Z ^ 2025-05-07T19:58:16.4270025Z 2025-05-07T19:58:16.4271845Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:16.4274466Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:16.4275614Z ^ 2025-05-07T19:58:16.4275856Z 2025-05-07T19:58:16.4276473Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:16.4277128Z 2025-05-07T19:58:16.4278748Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:16.4281273Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:16.4282423Z ^ 2025-05-07T19:58:16.4282794Z 2025-05-07T19:58:18.6113721Z [287/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_gwd_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_kernel.cu.o 2025-05-07T19:58:18.6135612Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:18.6138736Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:18.6139890Z ^ 2025-05-07T19:58:18.6140137Z 2025-05-07T19:58:18.6140569Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:18.6141215Z 2025-05-07T19:58:18.6143030Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:18.6145632Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:18.6146722Z ^ 2025-05-07T19:58:18.6147083Z 2025-05-07T19:58:18.6148580Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:18.6151110Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:18.6152099Z ^ 2025-05-07T19:58:18.6152338Z 2025-05-07T19:58:18.6152782Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:18.6153430Z 2025-05-07T19:58:18.6155037Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:18.6157675Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:18.6158742Z ^ 2025-05-07T19:58:18.6159097Z 2025-05-07T19:58:18.6160424Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_gwd_kernel.cu(231): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T19:58:18.6162014Z const auto offset_idx = idx * D_emb; 2025-05-07T19:58:18.6162475Z ^ 2025-05-07T19:58:18.6162723Z 2025-05-07T19:58:18.6164245Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:18.6166717Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:18.6167839Z ^ 2025-05-07T19:58:18.6168087Z 2025-05-07T19:58:18.6168547Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:18.6169210Z 2025-05-07T19:58:18.6171322Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:18.6173872Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:18.6174945Z ^ 2025-05-07T19:58:18.6175263Z 2025-05-07T19:58:18.6176617Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_gwd_kernel.cu(231): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T19:58:18.6178336Z const auto offset_idx = idx * D_emb; 2025-05-07T19:58:18.6178865Z ^ 2025-05-07T19:58:18.6179121Z 2025-05-07T19:58:18.6181051Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:18.6183668Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:18.6184808Z ^ 2025-05-07T19:58:18.6185074Z 2025-05-07T19:58:18.6185526Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:18.6186351Z 2025-05-07T19:58:18.6187877Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:18.6190444Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:18.6191452Z ^ 2025-05-07T19:58:18.6191794Z 2025-05-07T19:58:18.6193029Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_gwd_kernel.cu(231): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T19:58:18.6194634Z const auto offset_idx = idx * D_emb; 2025-05-07T19:58:18.6195110Z ^ 2025-05-07T19:58:18.6195332Z 2025-05-07T19:58:18.6196721Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:18.6199127Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:18.6200154Z ^ 2025-05-07T19:58:18.6200413Z 2025-05-07T19:58:18.6200829Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:18.6201502Z 2025-05-07T19:58:18.6203118Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:18.6205453Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:18.6206521Z ^ 2025-05-07T19:58:18.6206848Z 2025-05-07T19:58:18.6208093Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_gwd_kernel.cu(231): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T19:58:18.6209764Z const auto offset_idx = idx * D_emb; 2025-05-07T19:58:18.6210320Z ^ 2025-05-07T19:58:18.6210585Z 2025-05-07T19:58:19.5149292Z [288/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T19:58:19.5171706Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:19.5174061Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:19.5175188Z ^ 2025-05-07T19:58:19.5175436Z 2025-05-07T19:58:19.5175835Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:19.5176540Z 2025-05-07T19:58:19.5178118Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:19.5180768Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:19.5181932Z ^ 2025-05-07T19:58:19.5182296Z 2025-05-07T19:58:19.5183785Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:19.5186146Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:19.5187125Z ^ 2025-05-07T19:58:19.5187373Z 2025-05-07T19:58:19.5187796Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:19.5188421Z 2025-05-07T19:58:19.5190055Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:19.5192588Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:19.5193754Z ^ 2025-05-07T19:58:19.5194116Z 2025-05-07T19:58:19.5195662Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:19.5198447Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:19.5199551Z ^ 2025-05-07T19:58:19.5199780Z 2025-05-07T19:58:19.5200185Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:19.5200797Z 2025-05-07T19:58:19.5202660Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:19.5205179Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:19.5206302Z ^ 2025-05-07T19:58:19.5206662Z 2025-05-07T19:58:19.5208216Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:19.5210539Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:19.5211655Z ^ 2025-05-07T19:58:19.5211907Z 2025-05-07T19:58:19.5212335Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:19.5212916Z 2025-05-07T19:58:19.5214234Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:19.5216646Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:19.5217693Z ^ 2025-05-07T19:58:19.5218014Z 2025-05-07T19:58:19.5219544Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:19.5222078Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:19.5223166Z ^ 2025-05-07T19:58:19.5223420Z 2025-05-07T19:58:19.5223789Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:19.5224361Z 2025-05-07T19:58:19.5225780Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:19.5228218Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:19.5229357Z ^ 2025-05-07T19:58:19.5229713Z 2025-05-07T19:58:23.8255712Z [289/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_sgd_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_cuda.cu.o 2025-05-07T19:58:23.8276706Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:23.8279204Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:23.8280233Z ^ 2025-05-07T19:58:23.8280480Z 2025-05-07T19:58:23.8280876Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:23.8281430Z 2025-05-07T19:58:23.8282978Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:23.8285541Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:23.8286675Z ^ 2025-05-07T19:58:23.8287067Z 2025-05-07T19:58:23.8288722Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:23.8291283Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:23.8292474Z ^ 2025-05-07T19:58:23.8292728Z 2025-05-07T19:58:23.8293205Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:23.8293744Z 2025-05-07T19:58:23.8295088Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:23.8297756Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:23.8299165Z ^ 2025-05-07T19:58:23.8299492Z 2025-05-07T19:58:23.8300991Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:23.8303773Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:23.8304928Z ^ 2025-05-07T19:58:23.8305163Z 2025-05-07T19:58:23.8305486Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:23.8305959Z 2025-05-07T19:58:23.8307364Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:23.8309519Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:23.8310490Z ^ 2025-05-07T19:58:23.8310790Z 2025-05-07T19:58:23.8312107Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:23.8314258Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:23.8315203Z ^ 2025-05-07T19:58:23.8315411Z 2025-05-07T19:58:23.8315768Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:23.8316312Z 2025-05-07T19:58:23.8317630Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:23.8319831Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:23.8320779Z ^ 2025-05-07T19:58:23.8321097Z 2025-05-07T19:58:23.8322451Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:23.8324612Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:23.8325541Z ^ 2025-05-07T19:58:23.8325762Z 2025-05-07T19:58:23.8326141Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:23.8326676Z 2025-05-07T19:58:23.8328034Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:23.8330051Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:23.8331046Z ^ 2025-05-07T19:58:23.8331334Z 2025-05-07T19:58:26.9301296Z [290/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu.o 2025-05-07T19:58:26.9321014Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:26.9323207Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:26.9324217Z ^ 2025-05-07T19:58:26.9324429Z 2025-05-07T19:58:26.9324830Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:26.9325423Z 2025-05-07T19:58:26.9326870Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:26.9329097Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:26.9330076Z ^ 2025-05-07T19:58:26.9330370Z 2025-05-07T19:58:26.9331789Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:26.9333937Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:26.9334866Z ^ 2025-05-07T19:58:26.9335086Z 2025-05-07T19:58:26.9335440Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:26.9336007Z 2025-05-07T19:58:26.9337750Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:26.9339949Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:26.9340930Z ^ 2025-05-07T19:58:26.9341187Z 2025-05-07T19:58:26.9342650Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:26.9344972Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:26.9346043Z ^ 2025-05-07T19:58:26.9346293Z 2025-05-07T19:58:26.9346712Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:26.9347373Z 2025-05-07T19:58:26.9348784Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:26.9351246Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:26.9352377Z ^ 2025-05-07T19:58:26.9352720Z 2025-05-07T19:58:26.9354101Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:26.9356294Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:26.9357365Z ^ 2025-05-07T19:58:26.9357598Z 2025-05-07T19:58:26.9357966Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:26.9358515Z 2025-05-07T19:58:26.9359961Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:26.9362160Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:26.9363187Z ^ 2025-05-07T19:58:26.9363490Z 2025-05-07T19:58:26.9364871Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:26.9367188Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:26.9368248Z ^ 2025-05-07T19:58:26.9368505Z 2025-05-07T19:58:26.9368911Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:26.9369503Z 2025-05-07T19:58:26.9371348Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:26.9373743Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:26.9374792Z ^ 2025-05-07T19:58:26.9375137Z 2025-05-07T19:58:27.0587552Z [291/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T19:58:27.0599265Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:27.0600617Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:27.0601229Z ^ 2025-05-07T19:58:27.0601377Z 2025-05-07T19:58:27.0601614Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:27.0601959Z 2025-05-07T19:58:27.0602804Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:27.0604137Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:27.0604764Z ^ 2025-05-07T19:58:27.0604956Z 2025-05-07T19:58:27.0605784Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:27.0607103Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:27.0607808Z ^ 2025-05-07T19:58:27.0607944Z 2025-05-07T19:58:27.0608192Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:27.0608531Z 2025-05-07T19:58:27.0609365Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:27.0610770Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:27.0611370Z ^ 2025-05-07T19:58:27.0611573Z 2025-05-07T19:58:27.0612385Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:27.0613713Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:27.0614303Z ^ 2025-05-07T19:58:27.0614453Z 2025-05-07T19:58:27.0614682Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:27.0615019Z 2025-05-07T19:58:27.0615856Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:27.0617305Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:27.0617914Z ^ 2025-05-07T19:58:27.0618109Z 2025-05-07T19:58:27.0618942Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:27.0620274Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:27.0620908Z ^ 2025-05-07T19:58:27.0621055Z 2025-05-07T19:58:27.0621287Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:27.0621621Z 2025-05-07T19:58:27.0622463Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:27.0623788Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:27.0624403Z ^ 2025-05-07T19:58:27.0624597Z 2025-05-07T19:58:27.0625426Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:27.0626740Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:27.0627338Z ^ 2025-05-07T19:58:27.0627476Z 2025-05-07T19:58:27.0627717Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:27.0628054Z 2025-05-07T19:58:27.0628880Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:27.0630298Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:27.0630892Z ^ 2025-05-07T19:58:27.0631098Z 2025-05-07T19:58:27.1016859Z [292/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu.o 2025-05-07T19:58:27.1037726Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:27.1040277Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:27.1041318Z ^ 2025-05-07T19:58:27.1041553Z 2025-05-07T19:58:27.1041971Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:27.1042600Z 2025-05-07T19:58:27.1044082Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:27.1046603Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:27.1047656Z ^ 2025-05-07T19:58:27.1048028Z 2025-05-07T19:58:27.1049792Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:27.1052304Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:27.1053345Z ^ 2025-05-07T19:58:27.1053575Z 2025-05-07T19:58:27.1053981Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:27.1054559Z 2025-05-07T19:58:27.1056290Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:27.1058845Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:27.1059941Z ^ 2025-05-07T19:58:27.1060296Z 2025-05-07T19:58:27.1061798Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:27.1064141Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:27.1065262Z ^ 2025-05-07T19:58:27.1065528Z 2025-05-07T19:58:27.1065950Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:27.1066549Z 2025-05-07T19:58:27.1068017Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:27.1070817Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:27.1071893Z ^ 2025-05-07T19:58:27.1072248Z 2025-05-07T19:58:27.1073768Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:27.1076154Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:27.1077176Z ^ 2025-05-07T19:58:27.1077406Z 2025-05-07T19:58:27.1077836Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:27.1078397Z 2025-05-07T19:58:27.1079780Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:27.1082201Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:27.1083323Z ^ 2025-05-07T19:58:27.1083675Z 2025-05-07T19:58:27.1085180Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:27.1087574Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:27.1088615Z ^ 2025-05-07T19:58:27.1088865Z 2025-05-07T19:58:27.1089243Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:27.1090174Z 2025-05-07T19:58:27.1091596Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:27.1093953Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:27.1095077Z ^ 2025-05-07T19:58:27.1095413Z 2025-05-07T19:58:27.4044706Z [293/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T19:58:27.4067070Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:27.4069683Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:27.4071216Z ^ 2025-05-07T19:58:27.4071496Z 2025-05-07T19:58:27.4071910Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:27.4072558Z 2025-05-07T19:58:27.4074120Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:27.4076987Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:27.4078127Z ^ 2025-05-07T19:58:27.4078443Z 2025-05-07T19:58:27.4080093Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:27.4082857Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:27.4084023Z ^ 2025-05-07T19:58:27.4084268Z 2025-05-07T19:58:27.4084653Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:27.4085292Z 2025-05-07T19:58:27.4086908Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:27.4089600Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:27.4090723Z ^ 2025-05-07T19:58:27.4091112Z 2025-05-07T19:58:27.4092585Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:27.4094756Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:27.4095690Z ^ 2025-05-07T19:58:27.4095898Z 2025-05-07T19:58:27.4096458Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:27.4097072Z 2025-05-07T19:58:27.4098634Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:27.4101313Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:27.4102538Z ^ 2025-05-07T19:58:27.4102916Z 2025-05-07T19:58:27.4104576Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:27.4107275Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:27.4108482Z ^ 2025-05-07T19:58:27.4108743Z 2025-05-07T19:58:27.4109197Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:27.4109892Z 2025-05-07T19:58:27.4111582Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:27.4114089Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:27.4115182Z ^ 2025-05-07T19:58:27.4115540Z 2025-05-07T19:58:27.4117067Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:27.4119741Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:27.4120875Z ^ 2025-05-07T19:58:27.4121113Z 2025-05-07T19:58:27.4121566Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:27.4122187Z 2025-05-07T19:58:27.4123901Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:27.4126457Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:27.4127554Z ^ 2025-05-07T19:58:27.4127898Z 2025-05-07T19:58:29.7212472Z [294/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adam_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_warp.cu.o 2025-05-07T19:58:29.7235496Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:29.7238021Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:29.7239152Z ^ 2025-05-07T19:58:29.7239414Z 2025-05-07T19:58:29.7240182Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:29.7240934Z 2025-05-07T19:58:29.7242439Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:29.7245007Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:29.7246142Z ^ 2025-05-07T19:58:29.7246724Z 2025-05-07T19:58:29.7248329Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:29.7250832Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:29.7251944Z ^ 2025-05-07T19:58:29.7252222Z 2025-05-07T19:58:29.7252645Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:29.7253290Z 2025-05-07T19:58:29.7254806Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:29.7257487Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:29.7258553Z ^ 2025-05-07T19:58:29.7258861Z 2025-05-07T19:58:29.7260352Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:29.7262962Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:29.7264166Z ^ 2025-05-07T19:58:29.7264425Z 2025-05-07T19:58:29.7264874Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:29.7265537Z 2025-05-07T19:58:29.7267158Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:29.7269776Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:29.7271184Z ^ 2025-05-07T19:58:29.7271575Z 2025-05-07T19:58:29.7273098Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:29.7275767Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:29.7276933Z ^ 2025-05-07T19:58:29.7277218Z 2025-05-07T19:58:29.7277666Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:29.7278325Z 2025-05-07T19:58:29.7280009Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:29.7282681Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:29.7283668Z ^ 2025-05-07T19:58:29.7284302Z 2025-05-07T19:58:29.7285690Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:29.7288150Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:29.7289193Z ^ 2025-05-07T19:58:29.7289453Z 2025-05-07T19:58:29.7290097Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:29.7290720Z 2025-05-07T19:58:29.7292191Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:29.7294612Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:29.7295706Z ^ 2025-05-07T19:58:29.7296058Z 2025-05-07T19:58:32.8681233Z [295/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adam_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_cta.cu.o 2025-05-07T19:58:32.8704296Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:32.8707455Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:32.8708645Z ^ 2025-05-07T19:58:32.8708904Z 2025-05-07T19:58:32.8709353Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:32.8710053Z 2025-05-07T19:58:32.8711880Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:32.8714541Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:32.8715721Z ^ 2025-05-07T19:58:32.8716104Z 2025-05-07T19:58:32.8717447Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:32.8719730Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:32.8720702Z ^ 2025-05-07T19:58:32.8720969Z 2025-05-07T19:58:32.8721415Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:32.8722072Z 2025-05-07T19:58:32.8723774Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:32.8726470Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:32.8727685Z ^ 2025-05-07T19:58:32.8728049Z 2025-05-07T19:58:32.8729708Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:32.8732405Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:32.8733602Z ^ 2025-05-07T19:58:32.8733857Z 2025-05-07T19:58:32.8734311Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:32.8735017Z 2025-05-07T19:58:32.8736814Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:32.8739542Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:32.8740751Z ^ 2025-05-07T19:58:32.8741141Z 2025-05-07T19:58:32.8742808Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:32.8745530Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:32.8746710Z ^ 2025-05-07T19:58:32.8746976Z 2025-05-07T19:58:32.8747454Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:32.8748122Z 2025-05-07T19:58:32.8749809Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:32.8752769Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:32.8753985Z ^ 2025-05-07T19:58:32.8754349Z 2025-05-07T19:58:32.8756129Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:32.8758840Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:32.8760046Z ^ 2025-05-07T19:58:32.8760309Z 2025-05-07T19:58:32.8760755Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:32.8761450Z 2025-05-07T19:58:32.8763150Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:32.8765869Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:32.8767069Z ^ 2025-05-07T19:58:32.8767443Z 2025-05-07T19:58:34.3819245Z [296/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T19:58:34.3841785Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:34.3844440Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:34.3845465Z ^ 2025-05-07T19:58:34.3846071Z 2025-05-07T19:58:34.3846491Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:34.3847092Z 2025-05-07T19:58:34.3848663Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:34.3851234Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:34.3852348Z ^ 2025-05-07T19:58:34.3852698Z 2025-05-07T19:58:34.3854306Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:34.3857038Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:34.3858073Z ^ 2025-05-07T19:58:34.3858300Z 2025-05-07T19:58:34.3858734Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:34.3859338Z 2025-05-07T19:58:34.3860988Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:34.3863468Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:34.3864551Z ^ 2025-05-07T19:58:34.3864882Z 2025-05-07T19:58:34.3866438Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:34.3869044Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:34.3870471Z ^ 2025-05-07T19:58:34.3870728Z 2025-05-07T19:58:34.3871151Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:34.3871822Z 2025-05-07T19:58:34.3873358Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:34.3875677Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:34.3876749Z ^ 2025-05-07T19:58:34.3877113Z 2025-05-07T19:58:34.3878676Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:34.3881203Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:34.3882671Z ^ 2025-05-07T19:58:34.3882912Z 2025-05-07T19:58:34.3883353Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:34.3883984Z 2025-05-07T19:58:34.3885492Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:34.3890673Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:34.3891960Z ^ 2025-05-07T19:58:34.3892336Z 2025-05-07T19:58:34.3893923Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:34.3896296Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:34.3897211Z ^ 2025-05-07T19:58:34.3897424Z 2025-05-07T19:58:34.3897774Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:34.3898344Z 2025-05-07T19:58:34.3899944Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:34.3902595Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:34.3903789Z ^ 2025-05-07T19:58:34.3904156Z 2025-05-07T19:58:37.7867684Z [297/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adam_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_cuda.cu.o 2025-05-07T19:58:37.7891917Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:37.7894614Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:37.7895797Z ^ 2025-05-07T19:58:37.7896069Z 2025-05-07T19:58:37.7896639Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:37.7897323Z 2025-05-07T19:58:37.7899016Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:37.7901728Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:37.7902936Z ^ 2025-05-07T19:58:37.7903296Z 2025-05-07T19:58:37.7904971Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:37.7907695Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:37.7908922Z ^ 2025-05-07T19:58:37.7909182Z 2025-05-07T19:58:37.7909636Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:37.7910346Z 2025-05-07T19:58:37.7912030Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:37.7914767Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:37.7915934Z ^ 2025-05-07T19:58:37.7916333Z 2025-05-07T19:58:37.7917833Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:37.7920375Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:37.7921549Z ^ 2025-05-07T19:58:37.7921834Z 2025-05-07T19:58:37.7922284Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:37.7922946Z 2025-05-07T19:58:37.7924494Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:37.7927123Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:37.7928311Z ^ 2025-05-07T19:58:37.7928640Z 2025-05-07T19:58:37.7930207Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:37.7933178Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:37.7934377Z ^ 2025-05-07T19:58:37.7934639Z 2025-05-07T19:58:37.7935086Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:37.7935771Z 2025-05-07T19:58:37.7937672Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:37.7940208Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:37.7941346Z ^ 2025-05-07T19:58:37.7941751Z 2025-05-07T19:58:37.7943385Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:37.7945983Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:37.7947137Z ^ 2025-05-07T19:58:37.7947384Z 2025-05-07T19:58:37.7947851Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:37.7948506Z 2025-05-07T19:58:37.7950149Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:37.7952698Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:37.7953724Z ^ 2025-05-07T19:58:37.7954055Z 2025-05-07T19:58:42.6682634Z [298/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu.o 2025-05-07T19:58:42.6706750Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:42.6709488Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:42.6710680Z ^ 2025-05-07T19:58:42.6710964Z 2025-05-07T19:58:42.6711420Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:42.6712109Z 2025-05-07T19:58:42.6713829Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:42.6716545Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:42.6717773Z ^ 2025-05-07T19:58:42.6718147Z 2025-05-07T19:58:42.6719833Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:42.6722520Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:42.6723721Z ^ 2025-05-07T19:58:42.6723979Z 2025-05-07T19:58:42.6724437Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:42.6725131Z 2025-05-07T19:58:42.6726806Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:42.6729507Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:42.6730702Z ^ 2025-05-07T19:58:42.6731123Z 2025-05-07T19:58:42.6732799Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:42.6735525Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:42.6736843Z ^ 2025-05-07T19:58:42.6737129Z 2025-05-07T19:58:42.6737591Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:42.6738257Z 2025-05-07T19:58:42.6739940Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:42.6742676Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:42.6744088Z ^ 2025-05-07T19:58:42.6744467Z 2025-05-07T19:58:42.6746133Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:42.6748918Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:42.6750068Z ^ 2025-05-07T19:58:42.6750325Z 2025-05-07T19:58:42.6750734Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:42.6751356Z 2025-05-07T19:58:42.6753069Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:42.6755477Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:42.6756533Z ^ 2025-05-07T19:58:42.6756886Z 2025-05-07T19:58:42.6758509Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:42.6761139Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:42.6762303Z ^ 2025-05-07T19:58:42.6762550Z 2025-05-07T19:58:42.6762998Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:42.6763675Z 2025-05-07T19:58:42.6765294Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:42.6767789Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:42.6768916Z ^ 2025-05-07T19:58:42.6769274Z 2025-05-07T19:58:44.7248074Z [299/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_sgd_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_cta.cu.o 2025-05-07T19:58:44.7270418Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:44.7273051Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:44.7274126Z ^ 2025-05-07T19:58:44.7274383Z 2025-05-07T19:58:44.7274839Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:44.7275469Z 2025-05-07T19:58:44.7277085Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:44.7279717Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:44.7280855Z ^ 2025-05-07T19:58:44.7281236Z 2025-05-07T19:58:44.7282790Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:44.7285268Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:44.7286262Z ^ 2025-05-07T19:58:44.7286464Z 2025-05-07T19:58:44.7286886Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:44.7287530Z 2025-05-07T19:58:44.7289066Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:44.7291693Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:44.7292825Z ^ 2025-05-07T19:58:44.7293179Z 2025-05-07T19:58:44.7294772Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:44.7297504Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:44.7298614Z ^ 2025-05-07T19:58:44.7298841Z 2025-05-07T19:58:44.7299276Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:44.7299944Z 2025-05-07T19:58:44.7301821Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:44.7304513Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:44.7305668Z ^ 2025-05-07T19:58:44.7305982Z 2025-05-07T19:58:44.7307844Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:44.7310288Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:44.7311268Z ^ 2025-05-07T19:58:44.7311468Z 2025-05-07T19:58:44.7311880Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:44.7312471Z 2025-05-07T19:58:44.7313983Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:44.7316517Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:44.7317658Z ^ 2025-05-07T19:58:44.7318021Z 2025-05-07T19:58:44.7319568Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:44.7322137Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:44.7323230Z ^ 2025-05-07T19:58:44.7323494Z 2025-05-07T19:58:44.7323927Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:44.7324552Z 2025-05-07T19:58:44.7326191Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:44.7328768Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:44.7329933Z ^ 2025-05-07T19:58:44.7330273Z 2025-05-07T19:58:49.1729713Z [300/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_lamb_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_warp.cu.o 2025-05-07T19:58:49.1755228Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:49.1758238Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:49.1759537Z ^ 2025-05-07T19:58:49.1759815Z 2025-05-07T19:58:49.1760296Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:49.1761025Z 2025-05-07T19:58:49.1762820Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:49.1765719Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:49.1766975Z ^ 2025-05-07T19:58:49.1767390Z 2025-05-07T19:58:49.1769163Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:49.1772364Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:49.1773637Z ^ 2025-05-07T19:58:49.1773938Z 2025-05-07T19:58:49.1774424Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:49.1775096Z 2025-05-07T19:58:49.1777007Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:49.1779777Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:49.1781084Z ^ 2025-05-07T19:58:49.1781472Z 2025-05-07T19:58:49.1783222Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:49.1786033Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:49.1787697Z ^ 2025-05-07T19:58:49.1787972Z 2025-05-07T19:58:49.1788446Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:49.1789179Z 2025-05-07T19:58:49.1790956Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:49.1794110Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:49.1795396Z ^ 2025-05-07T19:58:49.1795802Z 2025-05-07T19:58:49.1797551Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:49.1800406Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:49.1801671Z ^ 2025-05-07T19:58:49.1801950Z 2025-05-07T19:58:49.1802438Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:49.1803147Z 2025-05-07T19:58:49.1804931Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:49.1807839Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:49.1809142Z ^ 2025-05-07T19:58:49.1809531Z 2025-05-07T19:58:49.1811286Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:49.1814175Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:49.1815466Z ^ 2025-05-07T19:58:49.1815734Z 2025-05-07T19:58:49.1816361Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:49.1817083Z 2025-05-07T19:58:49.1818886Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:49.1821760Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:49.1823067Z ^ 2025-05-07T19:58:49.1823459Z 2025-05-07T19:58:50.2205284Z [301/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu.o 2025-05-07T19:58:50.2230815Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:50.2233758Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:50.2235050Z ^ 2025-05-07T19:58:50.2235330Z 2025-05-07T19:58:50.2235802Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:50.2236537Z 2025-05-07T19:58:50.2238343Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:50.2241244Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:50.2242533Z ^ 2025-05-07T19:58:50.2242945Z 2025-05-07T19:58:50.2244678Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:50.2246900Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:50.2247505Z ^ 2025-05-07T19:58:50.2247836Z 2025-05-07T19:58:50.2249571Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:50.2251790Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:50.2252420Z ^ 2025-05-07T19:58:50.2252747Z 2025-05-07T19:58:50.2254486Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:50.2257015Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:50.2257736Z ^ 2025-05-07T19:58:50.2258059Z 2025-05-07T19:58:50.2259818Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:50.2262687Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:50.2263969Z ^ 2025-05-07T19:58:50.2264421Z 2025-05-07T19:58:50.2264906Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:50.2265647Z 2025-05-07T19:58:50.2267420Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:50.2270607Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:50.2271900Z ^ 2025-05-07T19:58:50.2272301Z 2025-05-07T19:58:50.2274023Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:50.2276207Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:50.2276829Z ^ 2025-05-07T19:58:50.2277152Z 2025-05-07T19:58:50.2278876Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:50.2281007Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:50.2281642Z ^ 2025-05-07T19:58:50.2281966Z 2025-05-07T19:58:50.2283708Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:50.2285878Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:50.2286497Z ^ 2025-05-07T19:58:50.2286820Z 2025-05-07T19:58:50.2288578Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:50.2291053Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:50.2292132Z ^ 2025-05-07T19:58:50.2292432Z 2025-05-07T19:58:50.2292905Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:50.2293613Z 2025-05-07T19:58:50.2295343Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:50.2298176Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:50.2299478Z ^ 2025-05-07T19:58:50.2299870Z 2025-05-07T19:58:50.2301614Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:50.2303643Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:50.2304647Z ^ 2025-05-07T19:58:50.2304972Z 2025-05-07T19:58:50.2306710Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:50.2308816Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:50.2309339Z ^ 2025-05-07T19:58:50.2309667Z 2025-05-07T19:58:50.2311652Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:50.2313729Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:50.2314353Z ^ 2025-05-07T19:58:50.2314675Z 2025-05-07T19:58:50.2316438Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:50.2319184Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:50.2320466Z ^ 2025-05-07T19:58:50.2320740Z 2025-05-07T19:58:50.2321218Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:50.2321926Z 2025-05-07T19:58:50.2323729Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:50.2326455Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:50.2327754Z ^ 2025-05-07T19:58:50.2328122Z 2025-05-07T19:58:50.2329763Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:50.2331933Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:50.2332557Z ^ 2025-05-07T19:58:50.2332880Z 2025-05-07T19:58:50.2334512Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:50.2336875Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:50.2337501Z ^ 2025-05-07T19:58:50.2337825Z 2025-05-07T19:58:50.2339541Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:50.2341584Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:50.2342209Z ^ 2025-05-07T19:58:50.2342534Z 2025-05-07T19:58:50.2344299Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:50.2346926Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:50.2348183Z ^ 2025-05-07T19:58:50.2348473Z 2025-05-07T19:58:50.2348870Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:50.2349595Z 2025-05-07T19:58:50.2351583Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:50.2354548Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:50.2355764Z ^ 2025-05-07T19:58:50.2356157Z 2025-05-07T19:58:50.2358061Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:50.2360231Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:50.2360703Z ^ 2025-05-07T19:58:50.2361033Z 2025-05-07T19:58:50.2362756Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:50.2364937Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:50.2365563Z ^ 2025-05-07T19:58:50.2365838Z 2025-05-07T19:58:50.2367501Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:50.2369591Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:50.2370500Z ^ 2025-05-07T19:58:50.2370831Z 2025-05-07T19:58:50.5674729Z [302/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_lamb_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_cuda.cu.o 2025-05-07T19:58:50.5699354Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:50.5702644Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:50.5703946Z ^ 2025-05-07T19:58:50.5704148Z 2025-05-07T19:58:50.5704558Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:50.5705290Z 2025-05-07T19:58:50.5707069Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:50.5709857Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:50.5711143Z ^ 2025-05-07T19:58:50.5711554Z 2025-05-07T19:58:50.5713321Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:50.5716045Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:50.5717310Z ^ 2025-05-07T19:58:50.5717599Z 2025-05-07T19:58:50.5718035Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:50.5718713Z 2025-05-07T19:58:50.5720568Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:50.5723405Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:50.5724599Z ^ 2025-05-07T19:58:50.5724991Z 2025-05-07T19:58:50.5726775Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:50.5729514Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:50.5730807Z ^ 2025-05-07T19:58:50.5731016Z 2025-05-07T19:58:50.5731413Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:50.5732141Z 2025-05-07T19:58:50.5733841Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:50.5736931Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:50.5738227Z ^ 2025-05-07T19:58:50.5738632Z 2025-05-07T19:58:50.5740156Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:50.5743163Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:50.5744506Z ^ 2025-05-07T19:58:50.5744799Z 2025-05-07T19:58:50.5745274Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:50.5745984Z 2025-05-07T19:58:50.5747770Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:50.5750822Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:50.5752121Z ^ 2025-05-07T19:58:50.5752514Z 2025-05-07T19:58:50.5754260Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:50.5757135Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:50.5758409Z ^ 2025-05-07T19:58:50.5758683Z 2025-05-07T19:58:50.5759158Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:50.5759890Z 2025-05-07T19:58:50.5761673Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:50.5764553Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:50.5765823Z ^ 2025-05-07T19:58:50.5766240Z 2025-05-07T19:58:52.1045256Z [303/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T19:58:52.1068206Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:52.1070953Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:52.1072011Z ^ 2025-05-07T19:58:52.1072225Z 2025-05-07T19:58:52.1072646Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:52.1073299Z 2025-05-07T19:58:52.1074753Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:52.1077144Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:52.1078233Z ^ 2025-05-07T19:58:52.1078485Z 2025-05-07T19:58:52.1079728Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:52.1082060Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:52.1083105Z ^ 2025-05-07T19:58:52.1083324Z 2025-05-07T19:58:52.1083700Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:52.1084274Z 2025-05-07T19:58:52.1085831Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:52.1088358Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:52.1089497Z ^ 2025-05-07T19:58:52.1089832Z 2025-05-07T19:58:52.1091364Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:52.1093918Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:52.1094943Z ^ 2025-05-07T19:58:52.1095191Z 2025-05-07T19:58:52.1095638Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:52.1096419Z 2025-05-07T19:58:52.1097884Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:52.1100323Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:52.1101251Z ^ 2025-05-07T19:58:52.1101912Z 2025-05-07T19:58:52.1103347Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:52.1105655Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:52.1106602Z ^ 2025-05-07T19:58:52.1106793Z 2025-05-07T19:58:52.1107387Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:52.1107970Z 2025-05-07T19:58:52.1109531Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:52.1111871Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:52.1112974Z ^ 2025-05-07T19:58:52.1113298Z 2025-05-07T19:58:52.1114859Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:52.1117395Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:52.1118400Z ^ 2025-05-07T19:58:52.1118594Z 2025-05-07T19:58:52.1119012Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:52.1119643Z 2025-05-07T19:58:52.1121158Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:52.1123530Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:52.1124436Z ^ 2025-05-07T19:58:52.1124795Z 2025-05-07T19:58:55.0620333Z [304/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu.o 2025-05-07T19:58:55.0640522Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:55.0643061Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:55.0644113Z ^ 2025-05-07T19:58:55.0644356Z 2025-05-07T19:58:55.0644718Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:55.0645279Z 2025-05-07T19:58:55.0646693Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:55.0649076Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:55.0650117Z ^ 2025-05-07T19:58:55.0650487Z 2025-05-07T19:58:55.0651874Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:55.0654143Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:55.0655180Z ^ 2025-05-07T19:58:55.0655429Z 2025-05-07T19:58:55.0655840Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:55.0656583Z 2025-05-07T19:58:55.0658026Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:55.0660393Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:55.0661490Z ^ 2025-05-07T19:58:55.0661823Z 2025-05-07T19:58:55.0663251Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:55.0665596Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:55.0666689Z ^ 2025-05-07T19:58:55.0681514Z 2025-05-07T19:58:55.0682017Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:55.0682609Z 2025-05-07T19:58:55.0683998Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:55.0686629Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:55.0687696Z ^ 2025-05-07T19:58:55.0688013Z 2025-05-07T19:58:55.0689487Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:55.0692127Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:55.0693164Z ^ 2025-05-07T19:58:55.0693403Z 2025-05-07T19:58:55.0693767Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:55.0694327Z 2025-05-07T19:58:55.0695677Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:55.0698070Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:55.0698963Z ^ 2025-05-07T19:58:55.0699278Z 2025-05-07T19:58:55.0700629Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:55.0702889Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:55.0703728Z ^ 2025-05-07T19:58:55.0703943Z 2025-05-07T19:58:55.0704283Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:55.0704797Z 2025-05-07T19:58:55.0706180Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:55.0708448Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:55.0709474Z ^ 2025-05-07T19:58:55.0709798Z 2025-05-07T19:58:55.2908311Z [305/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T19:58:55.2929292Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:55.2931761Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:55.2932839Z ^ 2025-05-07T19:58:55.2933078Z 2025-05-07T19:58:55.2933471Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:55.2934035Z 2025-05-07T19:58:55.2935448Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:55.2938054Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:55.2939126Z ^ 2025-05-07T19:58:55.2939463Z 2025-05-07T19:58:55.2940922Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:55.2943285Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:55.2944306Z ^ 2025-05-07T19:58:55.2944560Z 2025-05-07T19:58:55.2944963Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:55.2945563Z 2025-05-07T19:58:55.2947052Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:55.2949425Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:55.2950444Z ^ 2025-05-07T19:58:55.2950770Z 2025-05-07T19:58:55.2952262Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:55.2954600Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:55.2955618Z ^ 2025-05-07T19:58:55.2956080Z 2025-05-07T19:58:55.2956579Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:55.2957170Z 2025-05-07T19:58:55.2958625Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:55.2961050Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:55.2962314Z ^ 2025-05-07T19:58:55.2962668Z 2025-05-07T19:58:55.2964085Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:55.2966430Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:55.2967501Z ^ 2025-05-07T19:58:55.2967744Z 2025-05-07T19:58:55.2968152Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:55.2968755Z 2025-05-07T19:58:55.2970464Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:55.2973015Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:55.2974076Z ^ 2025-05-07T19:58:55.2974385Z 2025-05-07T19:58:55.2975824Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:55.2978310Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:55.2979340Z ^ 2025-05-07T19:58:55.2979566Z 2025-05-07T19:58:55.2979968Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:55.2980565Z 2025-05-07T19:58:55.2982027Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:55.2984410Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:55.2985468Z ^ 2025-05-07T19:58:55.2985817Z 2025-05-07T19:58:55.7949234Z [306/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_lamb_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_cta.cu.o 2025-05-07T19:58:55.7970527Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:55.7972878Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:55.7973883Z ^ 2025-05-07T19:58:55.7974140Z 2025-05-07T19:58:55.7974483Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:55.7974994Z 2025-05-07T19:58:55.7976217Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:55.7978351Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:55.7979414Z ^ 2025-05-07T19:58:55.7979718Z 2025-05-07T19:58:55.7981137Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:55.7983427Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:55.7984437Z ^ 2025-05-07T19:58:55.7984668Z 2025-05-07T19:58:55.7985033Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:55.7985606Z 2025-05-07T19:58:55.7987022Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:55.7989418Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:55.7990443Z ^ 2025-05-07T19:58:55.7990766Z 2025-05-07T19:58:55.7992189Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:55.7994905Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:55.7995826Z ^ 2025-05-07T19:58:55.7996076Z 2025-05-07T19:58:55.7996436Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:55.7996969Z 2025-05-07T19:58:55.7998736Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:55.8001090Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:55.8002183Z ^ 2025-05-07T19:58:55.8002517Z 2025-05-07T19:58:55.8003994Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:55.8006295Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:55.8007321Z ^ 2025-05-07T19:58:55.8007555Z 2025-05-07T19:58:55.8007969Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:55.8008511Z 2025-05-07T19:58:55.8009809Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:55.8012162Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:55.8013205Z ^ 2025-05-07T19:58:55.8013565Z 2025-05-07T19:58:55.8014877Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:55.8017371Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:55.8018342Z ^ 2025-05-07T19:58:55.8018577Z 2025-05-07T19:58:55.8019021Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:55.8019607Z 2025-05-07T19:58:55.8021032Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:55.8023462Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:55.8024542Z ^ 2025-05-07T19:58:55.8024878Z 2025-05-07T19:58:57.1804808Z [307/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu.o 2025-05-07T19:58:57.1826744Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:57.1829216Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:57.1830230Z ^ 2025-05-07T19:58:57.1830482Z 2025-05-07T19:58:57.1830886Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:57.1831494Z 2025-05-07T19:58:57.1833045Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:57.1835337Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:57.1836387Z ^ 2025-05-07T19:58:57.1836729Z 2025-05-07T19:58:57.1838219Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:57.1840642Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:57.1841700Z ^ 2025-05-07T19:58:57.1841909Z 2025-05-07T19:58:57.1842397Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:57.1843018Z 2025-05-07T19:58:57.1844577Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:57.1847025Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:57.1848331Z ^ 2025-05-07T19:58:57.1848679Z 2025-05-07T19:58:57.1850115Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:57.1852570Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:57.1853636Z ^ 2025-05-07T19:58:57.1853875Z 2025-05-07T19:58:57.1854400Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:57.1855004Z 2025-05-07T19:58:57.1856654Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:57.1859052Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:57.1860093Z ^ 2025-05-07T19:58:57.1860381Z 2025-05-07T19:58:57.1861925Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:57.1864415Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:57.1865498Z ^ 2025-05-07T19:58:57.1865732Z 2025-05-07T19:58:57.1866178Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:57.1866796Z 2025-05-07T19:58:57.1868348Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:57.1871074Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:57.1872204Z ^ 2025-05-07T19:58:57.1872570Z 2025-05-07T19:58:57.1874156Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:57.1876636Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:57.1877701Z ^ 2025-05-07T19:58:57.1877941Z 2025-05-07T19:58:57.1878346Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:57.1878962Z 2025-05-07T19:58:57.1880494Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:57.1883005Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:57.1884119Z ^ 2025-05-07T19:58:57.1884454Z 2025-05-07T19:58:57.2457442Z [308/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu.o 2025-05-07T19:58:57.2479855Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:57.2482426Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:57.2483500Z ^ 2025-05-07T19:58:57.2483761Z 2025-05-07T19:58:57.2484205Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:57.2484812Z 2025-05-07T19:58:57.2486361Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:57.2488843Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:57.2489972Z ^ 2025-05-07T19:58:57.2490284Z 2025-05-07T19:58:57.2491664Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:57.2494058Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:57.2495153Z ^ 2025-05-07T19:58:57.2495394Z 2025-05-07T19:58:57.2495818Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:57.2496557Z 2025-05-07T19:58:57.2498073Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:57.2500859Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:57.2501921Z ^ 2025-05-07T19:58:57.2502206Z 2025-05-07T19:58:57.2503861Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:57.2506339Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:57.2507457Z ^ 2025-05-07T19:58:57.2507693Z 2025-05-07T19:58:57.2508127Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:57.2508762Z 2025-05-07T19:58:57.2510340Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:57.2512663Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:57.2513722Z ^ 2025-05-07T19:58:57.2514060Z 2025-05-07T19:58:57.2515586Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:57.2518065Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:57.2519166Z ^ 2025-05-07T19:58:57.2519426Z 2025-05-07T19:58:57.2519858Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:57.2520461Z 2025-05-07T19:58:57.2521862Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:57.2524276Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:57.2525396Z ^ 2025-05-07T19:58:57.2525741Z 2025-05-07T19:58:57.2527241Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:57.2529729Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:57.2530788Z ^ 2025-05-07T19:58:57.2530999Z 2025-05-07T19:58:57.2531377Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:57.2531981Z 2025-05-07T19:58:57.2533524Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:57.2536020Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:57.2537194Z ^ 2025-05-07T19:58:57.2537539Z 2025-05-07T19:58:59.6976794Z [309/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu.o 2025-05-07T19:58:59.6999761Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:59.7002410Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:59.7003483Z ^ 2025-05-07T19:58:59.7003686Z 2025-05-07T19:58:59.7004113Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:59.7004729Z 2025-05-07T19:58:59.7006304Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:59.7008864Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:59.7009917Z ^ 2025-05-07T19:58:59.7010240Z 2025-05-07T19:58:59.7011763Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:59.7014267Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:59.7015595Z ^ 2025-05-07T19:58:59.7015844Z 2025-05-07T19:58:59.7016262Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:59.7017029Z 2025-05-07T19:58:59.7018725Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:59.7021487Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:59.7022568Z ^ 2025-05-07T19:58:59.7022915Z 2025-05-07T19:58:59.7024555Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:59.7027180Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:59.7028299Z ^ 2025-05-07T19:58:59.7028547Z 2025-05-07T19:58:59.7029001Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:59.7029654Z 2025-05-07T19:58:59.7031278Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:59.7033876Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:59.7034974Z ^ 2025-05-07T19:58:59.7035324Z 2025-05-07T19:58:59.7036740Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:59.7039268Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:59.7040366Z ^ 2025-05-07T19:58:59.7040622Z 2025-05-07T19:58:59.7041052Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:59.7041665Z 2025-05-07T19:58:59.7043262Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:59.7045877Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:59.7047063Z ^ 2025-05-07T19:58:59.7047369Z 2025-05-07T19:58:59.7048903Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:59.7051505Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:59.7052663Z ^ 2025-05-07T19:58:59.7052907Z 2025-05-07T19:58:59.7053353Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:59.7054004Z 2025-05-07T19:58:59.7055645Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:59.7058346Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:59.7059783Z ^ 2025-05-07T19:58:59.7060170Z 2025-05-07T19:59:01.7910856Z [310/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T19:59:01.7930936Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:01.7933422Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:01.7934430Z ^ 2025-05-07T19:59:01.7934649Z 2025-05-07T19:59:01.7935052Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:01.7935639Z 2025-05-07T19:59:01.7937194Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:01.7939493Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:01.7940607Z ^ 2025-05-07T19:59:01.7940965Z 2025-05-07T19:59:01.7942585Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:01.7945623Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:01.7946762Z ^ 2025-05-07T19:59:01.7947018Z 2025-05-07T19:59:01.7947446Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:01.7947956Z 2025-05-07T19:59:01.7949440Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:01.7951432Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:01.7952314Z ^ 2025-05-07T19:59:01.7952605Z 2025-05-07T19:59:01.7953814Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:01.7955879Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:01.7956801Z ^ 2025-05-07T19:59:01.7957006Z 2025-05-07T19:59:01.7957371Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:01.7957970Z 2025-05-07T19:59:01.7959361Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:01.7961738Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:01.7962794Z ^ 2025-05-07T19:59:01.7963122Z 2025-05-07T19:59:01.7964325Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:01.7966681Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:01.7967708Z ^ 2025-05-07T19:59:01.7967937Z 2025-05-07T19:59:01.7968315Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:01.7968822Z 2025-05-07T19:59:01.7970537Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:01.7972731Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:01.7973687Z ^ 2025-05-07T19:59:01.7974015Z 2025-05-07T19:59:01.7975550Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:01.7977845Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:01.7978819Z ^ 2025-05-07T19:59:01.7979043Z 2025-05-07T19:59:01.7979424Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:01.7979988Z 2025-05-07T19:59:01.7981618Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:01.7984037Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:01.7985045Z ^ 2025-05-07T19:59:01.7985323Z 2025-05-07T19:59:02.1425754Z [311/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu.o 2025-05-07T19:59:02.1447774Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:02.1450422Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:02.1451578Z ^ 2025-05-07T19:59:02.1451821Z 2025-05-07T19:59:02.1452300Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:02.1452956Z 2025-05-07T19:59:02.1454508Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:02.1457155Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:02.1458471Z ^ 2025-05-07T19:59:02.1458836Z 2025-05-07T19:59:02.1460353Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:02.1462978Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:02.1463974Z ^ 2025-05-07T19:59:02.1464204Z 2025-05-07T19:59:02.1464604Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:02.1465223Z 2025-05-07T19:59:02.1474595Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:02.1477139Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:02.1478187Z ^ 2025-05-07T19:59:02.1478520Z 2025-05-07T19:59:02.1480056Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:02.1482550Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:02.1483659Z ^ 2025-05-07T19:59:02.1483942Z 2025-05-07T19:59:02.1484383Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:02.1485031Z 2025-05-07T19:59:02.1486721Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:02.1489336Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:02.1490461Z ^ 2025-05-07T19:59:02.1490782Z 2025-05-07T19:59:02.1492358Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:02.1494700Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:02.1495738Z ^ 2025-05-07T19:59:02.1495992Z 2025-05-07T19:59:02.1496572Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:02.1497217Z 2025-05-07T19:59:02.1498646Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:02.1501069Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:02.1502209Z ^ 2025-05-07T19:59:02.1502565Z 2025-05-07T19:59:02.1504172Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:02.1506741Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:02.1508222Z ^ 2025-05-07T19:59:02.1508478Z 2025-05-07T19:59:02.1508930Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:02.1509581Z 2025-05-07T19:59:02.1511072Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:02.1513596Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:02.1514694Z ^ 2025-05-07T19:59:02.1515033Z 2025-05-07T19:59:02.1604187Z [312/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu.o 2025-05-07T19:59:02.1626571Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:02.1629204Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:02.1630313Z ^ 2025-05-07T19:59:02.1630525Z 2025-05-07T19:59:02.1630922Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:02.1631842Z 2025-05-07T19:59:02.1633464Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:02.1635886Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:02.1636987Z ^ 2025-05-07T19:59:02.1637379Z 2025-05-07T19:59:02.1639018Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:02.1641457Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:02.1642569Z ^ 2025-05-07T19:59:02.1642858Z 2025-05-07T19:59:02.1643454Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:02.1644105Z 2025-05-07T19:59:02.1645730Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:02.1648354Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:02.1649561Z ^ 2025-05-07T19:59:02.1649921Z 2025-05-07T19:59:02.1651479Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:02.1653832Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:02.1654929Z ^ 2025-05-07T19:59:02.1655168Z 2025-05-07T19:59:02.1655600Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:02.1656247Z 2025-05-07T19:59:02.1657915Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:02.1660464Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:02.1661645Z ^ 2025-05-07T19:59:02.1662040Z 2025-05-07T19:59:02.1663687Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:02.1666340Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:02.1667479Z ^ 2025-05-07T19:59:02.1667707Z 2025-05-07T19:59:02.1668137Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:02.1668780Z 2025-05-07T19:59:02.1670512Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:02.1672985Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:02.1674131Z ^ 2025-05-07T19:59:02.1674493Z 2025-05-07T19:59:02.1676153Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:02.1678753Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:02.1679905Z ^ 2025-05-07T19:59:02.1680163Z 2025-05-07T19:59:02.1680597Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:02.1681249Z 2025-05-07T19:59:02.1683011Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:02.1685615Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:02.1686830Z ^ 2025-05-07T19:59:02.1687314Z 2025-05-07T19:59:04.5069238Z [313/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_lamb_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_cuda.cu.o 2025-05-07T19:59:04.5090870Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:04.5093454Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:04.5094987Z ^ 2025-05-07T19:59:04.5095225Z 2025-05-07T19:59:04.5095619Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:04.5096235Z 2025-05-07T19:59:04.5097917Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:04.5100524Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:04.5101600Z ^ 2025-05-07T19:59:04.5101979Z 2025-05-07T19:59:04.5103398Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:04.5105921Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:04.5107058Z ^ 2025-05-07T19:59:04.5107319Z 2025-05-07T19:59:04.5107748Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:04.5108399Z 2025-05-07T19:59:04.5110129Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:04.5112717Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:04.5113625Z ^ 2025-05-07T19:59:04.5113907Z 2025-05-07T19:59:04.5115160Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:04.5117165Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:04.5118063Z ^ 2025-05-07T19:59:04.5118260Z 2025-05-07T19:59:04.5118610Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:04.5119133Z 2025-05-07T19:59:04.5120390Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:04.5122581Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:04.5123472Z ^ 2025-05-07T19:59:04.5123819Z 2025-05-07T19:59:04.5125310Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:04.5127706Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:04.5128811Z ^ 2025-05-07T19:59:04.5129063Z 2025-05-07T19:59:04.5129494Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:04.5130139Z 2025-05-07T19:59:04.5131659Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:04.5134274Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:04.5135470Z ^ 2025-05-07T19:59:04.5135788Z 2025-05-07T19:59:04.5137430Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:04.5140102Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:04.5141078Z ^ 2025-05-07T19:59:04.5141295Z 2025-05-07T19:59:04.5141675Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:04.5142262Z 2025-05-07T19:59:04.5143855Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:04.5146318Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:04.5147269Z ^ 2025-05-07T19:59:04.5147583Z 2025-05-07T19:59:09.0685808Z [314/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_lars_sgd_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_cuda.cu.o 2025-05-07T19:59:09.0710678Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:09.0714254Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:09.0715551Z ^ 2025-05-07T19:59:09.0715831Z 2025-05-07T19:59:09.0716309Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:09.0717040Z 2025-05-07T19:59:09.0719028Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:09.0721958Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:09.0723265Z ^ 2025-05-07T19:59:09.0723674Z 2025-05-07T19:59:09.0725371Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:09.0728145Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:09.0729289Z ^ 2025-05-07T19:59:09.0729530Z 2025-05-07T19:59:09.0730034Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:09.0730747Z 2025-05-07T19:59:09.0732382Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:09.0735317Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:09.0736749Z ^ 2025-05-07T19:59:09.0737036Z 2025-05-07T19:59:09.0738783Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:09.0741489Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:09.0742783Z ^ 2025-05-07T19:59:09.0743058Z 2025-05-07T19:59:09.0743536Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:09.0744269Z 2025-05-07T19:59:09.0745993Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:09.0748881Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:09.0750040Z ^ 2025-05-07T19:59:09.0750437Z 2025-05-07T19:59:09.0752225Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:09.0754955Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:09.0756250Z ^ 2025-05-07T19:59:09.0756525Z 2025-05-07T19:59:09.0757014Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:09.0757984Z 2025-05-07T19:59:09.0759620Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:09.0762523Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:09.0763666Z ^ 2025-05-07T19:59:09.0764062Z 2025-05-07T19:59:09.0765961Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:09.0768651Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:09.0769888Z ^ 2025-05-07T19:59:09.0770511Z 2025-05-07T19:59:09.0771208Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:09.0771924Z 2025-05-07T19:59:09.0773699Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:09.0776705Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:09.0778017Z ^ 2025-05-07T19:59:09.0778408Z 2025-05-07T19:59:24.0314542Z [315/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T19:59:24.0340094Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:24.0343201Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:24.0344448Z ^ 2025-05-07T19:59:24.0344711Z 2025-05-07T19:59:24.0345170Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:24.0345883Z 2025-05-07T19:59:24.0347774Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:24.0350672Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:24.0351939Z ^ 2025-05-07T19:59:24.0352326Z 2025-05-07T19:59:24.0354054Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:24.0356847Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:24.0357927Z ^ 2025-05-07T19:59:24.0358173Z 2025-05-07T19:59:24.0358638Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:24.0359327Z 2025-05-07T19:59:24.0360977Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:24.0363741Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:24.0364997Z ^ 2025-05-07T19:59:24.0365383Z 2025-05-07T19:59:24.0367082Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:24.0369837Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:24.0371331Z ^ 2025-05-07T19:59:24.0371604Z 2025-05-07T19:59:24.0372073Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:24.0372705Z 2025-05-07T19:59:24.0374382Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:24.0377224Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:24.0378475Z ^ 2025-05-07T19:59:24.0378857Z 2025-05-07T19:59:24.0380592Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:24.0383623Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:24.0384859Z ^ 2025-05-07T19:59:24.0385123Z 2025-05-07T19:59:24.0385582Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:24.0386280Z 2025-05-07T19:59:24.0388131Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:24.0390957Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:24.0392194Z ^ 2025-05-07T19:59:24.0392594Z 2025-05-07T19:59:24.0394428Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:24.0397209Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:24.0398439Z ^ 2025-05-07T19:59:24.0398727Z 2025-05-07T19:59:24.0399190Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:24.0399859Z 2025-05-07T19:59:24.0401633Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:24.0404465Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:24.0405736Z ^ 2025-05-07T19:59:24.0406134Z 2025-05-07T19:59:25.0366488Z [316/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu.o 2025-05-07T19:59:25.0388168Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:25.0390644Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:25.0391650Z ^ 2025-05-07T19:59:25.0392079Z 2025-05-07T19:59:25.0392495Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:25.0393129Z 2025-05-07T19:59:25.0394647Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:25.0397060Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:25.0398111Z ^ 2025-05-07T19:59:25.0398444Z 2025-05-07T19:59:25.0399912Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:25.0402256Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:25.0403291Z ^ 2025-05-07T19:59:25.0403533Z 2025-05-07T19:59:25.0403937Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:25.0404554Z 2025-05-07T19:59:25.0406027Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:25.0408393Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:25.0409464Z ^ 2025-05-07T19:59:25.0409806Z 2025-05-07T19:59:25.0411266Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:25.0413606Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:25.0414635Z ^ 2025-05-07T19:59:25.0414879Z 2025-05-07T19:59:25.0415273Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:25.0415866Z 2025-05-07T19:59:25.0417542Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:25.0420003Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:25.0421079Z ^ 2025-05-07T19:59:25.0421757Z 2025-05-07T19:59:25.0423280Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:25.0425783Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:25.0426870Z ^ 2025-05-07T19:59:25.0427111Z 2025-05-07T19:59:25.0427635Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:25.0428251Z 2025-05-07T19:59:25.0429750Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:25.0432284Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:25.0433417Z ^ 2025-05-07T19:59:25.0433782Z 2025-05-07T19:59:25.0435305Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:25.0437739Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:25.0438742Z ^ 2025-05-07T19:59:25.0438966Z 2025-05-07T19:59:25.0439361Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:25.0439832Z 2025-05-07T19:59:25.0441171Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:25.0443524Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:25.0444494Z ^ 2025-05-07T19:59:25.0444792Z 2025-05-07T19:59:26.5248696Z [317/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu.o 2025-05-07T19:59:26.5271827Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:26.5274407Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:26.5275508Z ^ 2025-05-07T19:59:26.5275746Z 2025-05-07T19:59:26.5276155Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:26.5276694Z 2025-05-07T19:59:26.5278173Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:26.5280389Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:26.5281401Z ^ 2025-05-07T19:59:26.5281726Z 2025-05-07T19:59:26.5283038Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:26.5285540Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:26.5286596Z ^ 2025-05-07T19:59:26.5286833Z 2025-05-07T19:59:26.5287251Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:26.5287875Z 2025-05-07T19:59:26.5289318Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:26.5291699Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:26.5292845Z ^ 2025-05-07T19:59:26.5293195Z 2025-05-07T19:59:26.5294761Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:26.5297439Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:26.5298552Z ^ 2025-05-07T19:59:26.5298799Z 2025-05-07T19:59:26.5299228Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:26.5299852Z 2025-05-07T19:59:26.5301273Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:26.5304003Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:26.5305126Z ^ 2025-05-07T19:59:26.5305460Z 2025-05-07T19:59:26.5307127Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:26.5309508Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:26.5310633Z ^ 2025-05-07T19:59:26.5310878Z 2025-05-07T19:59:26.5311312Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:26.5311956Z 2025-05-07T19:59:26.5313650Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:26.5316209Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:26.5317342Z ^ 2025-05-07T19:59:26.5317713Z 2025-05-07T19:59:26.5319289Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:26.5321857Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:26.5322998Z ^ 2025-05-07T19:59:26.5323258Z 2025-05-07T19:59:26.5323707Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:26.5324351Z 2025-05-07T19:59:26.5325989Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:26.5328577Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:26.5329641Z ^ 2025-05-07T19:59:26.5329981Z 2025-05-07T19:59:29.2809409Z [318/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T19:59:29.2833403Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:29.2836051Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:29.2837165Z ^ 2025-05-07T19:59:29.2837415Z 2025-05-07T19:59:29.2837862Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:29.2838537Z 2025-05-07T19:59:29.2840205Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:29.2842930Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:29.2844105Z ^ 2025-05-07T19:59:29.2844471Z 2025-05-07T19:59:29.2846152Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:29.2848803Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:29.2849972Z ^ 2025-05-07T19:59:29.2850220Z 2025-05-07T19:59:29.2850679Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:29.2851353Z 2025-05-07T19:59:29.2853039Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:29.2855728Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:29.2857041Z ^ 2025-05-07T19:59:29.2857401Z 2025-05-07T19:59:29.2859053Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:29.2861696Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:29.2863031Z ^ 2025-05-07T19:59:29.2863348Z 2025-05-07T19:59:29.2863752Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:29.2864299Z 2025-05-07T19:59:29.2865970Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:29.2868728Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:29.2869903Z ^ 2025-05-07T19:59:29.2870482Z 2025-05-07T19:59:29.2872013Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:29.2874770Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:29.2875714Z ^ 2025-05-07T19:59:29.2875948Z 2025-05-07T19:59:29.2876399Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:29.2877055Z 2025-05-07T19:59:29.2878742Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:29.2881378Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:29.2882542Z ^ 2025-05-07T19:59:29.2882924Z 2025-05-07T19:59:29.2884589Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:29.2887209Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:29.2888350Z ^ 2025-05-07T19:59:29.2888616Z 2025-05-07T19:59:29.2889004Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:29.2889626Z 2025-05-07T19:59:29.2891167Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:29.2893751Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:29.2894916Z ^ 2025-05-07T19:59:29.2895286Z 2025-05-07T19:59:31.5718400Z [319/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu.o 2025-05-07T19:59:31.5739838Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:31.5742330Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:31.5743447Z ^ 2025-05-07T19:59:31.5743696Z 2025-05-07T19:59:31.5744164Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:31.5744812Z 2025-05-07T19:59:31.5746436Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:31.5748981Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:31.5750148Z ^ 2025-05-07T19:59:31.5750503Z 2025-05-07T19:59:31.5752129Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:31.5754767Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:31.5755905Z ^ 2025-05-07T19:59:31.5756158Z 2025-05-07T19:59:31.5756561Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:31.5757173Z 2025-05-07T19:59:31.5758693Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:31.5761291Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:31.5762368Z ^ 2025-05-07T19:59:31.5762709Z 2025-05-07T19:59:31.5764289Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:31.5767066Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:31.5768210Z ^ 2025-05-07T19:59:31.5768452Z 2025-05-07T19:59:31.5768876Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:31.5769523Z 2025-05-07T19:59:31.5771529Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:31.5774021Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:31.5775131Z ^ 2025-05-07T19:59:31.5775508Z 2025-05-07T19:59:31.5777463Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:31.5780057Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:31.5781169Z ^ 2025-05-07T19:59:31.5781427Z 2025-05-07T19:59:31.5781871Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:31.5782521Z 2025-05-07T19:59:31.5784075Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:31.5786677Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:31.5787856Z ^ 2025-05-07T19:59:31.5788203Z 2025-05-07T19:59:31.5789784Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:31.5792364Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:31.5793513Z ^ 2025-05-07T19:59:31.5793755Z 2025-05-07T19:59:31.5794152Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:31.5794792Z 2025-05-07T19:59:31.5796369Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:31.5798942Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:31.5800028Z ^ 2025-05-07T19:59:31.5800405Z 2025-05-07T19:59:39.0259233Z [320/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu.o 2025-05-07T19:59:39.0282276Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:39.0284921Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:39.0286044Z ^ 2025-05-07T19:59:39.0286289Z 2025-05-07T19:59:39.0286736Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:39.0287390Z 2025-05-07T19:59:39.0288964Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:39.0291493Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:39.0292539Z ^ 2025-05-07T19:59:39.0292898Z 2025-05-07T19:59:39.0294278Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:39.0296407Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:39.0297570Z ^ 2025-05-07T19:59:39.0297806Z 2025-05-07T19:59:39.0298242Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:39.0298879Z 2025-05-07T19:59:39.0300391Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:39.0302935Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:39.0304445Z ^ 2025-05-07T19:59:39.0304792Z 2025-05-07T19:59:39.0306347Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:39.0308838Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:39.0310085Z ^ 2025-05-07T19:59:39.0310334Z 2025-05-07T19:59:39.0310770Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:39.0311409Z 2025-05-07T19:59:39.0313033Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:39.0315728Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:39.0316875Z ^ 2025-05-07T19:59:39.0317254Z 2025-05-07T19:59:39.0318859Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:39.0321392Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:39.0322531Z ^ 2025-05-07T19:59:39.0322786Z 2025-05-07T19:59:39.0323219Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:39.0323875Z 2025-05-07T19:59:39.0325497Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:39.0327943Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:39.0329050Z ^ 2025-05-07T19:59:39.0329401Z 2025-05-07T19:59:39.0330995Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:39.0333470Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:39.0334599Z ^ 2025-05-07T19:59:39.0334850Z 2025-05-07T19:59:39.0335292Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:39.0335967Z 2025-05-07T19:59:39.0337681Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:39.0340249Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:39.0341375Z ^ 2025-05-07T19:59:39.0341758Z 2025-05-07T19:59:50.7817875Z [321/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu.o 2025-05-07T19:59:50.7841135Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:50.7843704Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:50.7844814Z ^ 2025-05-07T19:59:50.7845049Z 2025-05-07T19:59:50.7845469Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:50.7846114Z 2025-05-07T19:59:50.7847523Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:50.7849927Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:50.7851100Z ^ 2025-05-07T19:59:50.7851446Z 2025-05-07T19:59:50.7852961Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:50.7855014Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:59:50.7855777Z ^ 2025-05-07T19:59:50.7856065Z 2025-05-07T19:59:50.7857718Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:50.7859896Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:50.7860438Z ^ 2025-05-07T19:59:50.7860715Z 2025-05-07T19:59:50.7862218Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:50.7864126Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:50.7864676Z ^ 2025-05-07T19:59:50.7865012Z 2025-05-07T19:59:50.7866507Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:50.7868394Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:50.7868922Z ^ 2025-05-07T19:59:50.7869219Z 2025-05-07T19:59:50.7871240Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:50.7873878Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:50.7874993Z ^ 2025-05-07T19:59:50.7875244Z 2025-05-07T19:59:50.7875681Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:50.7876325Z 2025-05-07T19:59:50.7877948Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:50.7880513Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:50.7881674Z ^ 2025-05-07T19:59:50.7882030Z 2025-05-07T19:59:50.7883527Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:50.7885586Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:59:50.7886308Z ^ 2025-05-07T19:59:50.7886598Z 2025-05-07T19:59:50.7888084Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:50.7889976Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:50.7890519Z ^ 2025-05-07T19:59:50.7890798Z 2025-05-07T19:59:50.7892268Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:50.7894152Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:50.7894666Z ^ 2025-05-07T19:59:50.7894938Z 2025-05-07T19:59:50.7896428Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:50.7898398Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:50.7898920Z ^ 2025-05-07T19:59:50.7899195Z 2025-05-07T19:59:50.7900755Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:50.7903531Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:50.7904645Z ^ 2025-05-07T19:59:50.7904876Z 2025-05-07T19:59:50.7905300Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:50.7905929Z 2025-05-07T19:59:50.7907597Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:50.7909622Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:50.7910736Z ^ 2025-05-07T19:59:50.7911101Z 2025-05-07T19:59:50.7912600Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:50.7914553Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:59:50.7915330Z ^ 2025-05-07T19:59:50.7915610Z 2025-05-07T19:59:50.7916975Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:50.7918913Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:50.7919462Z ^ 2025-05-07T19:59:50.7919781Z 2025-05-07T19:59:50.7921265Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:50.7923217Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:50.7923771Z ^ 2025-05-07T19:59:50.7924085Z 2025-05-07T19:59:50.7925548Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:50.7927425Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:50.7927986Z ^ 2025-05-07T19:59:50.7928262Z 2025-05-07T19:59:50.7929891Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:50.7932476Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:50.7933684Z ^ 2025-05-07T19:59:50.7933945Z 2025-05-07T19:59:50.7934425Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:50.7935072Z 2025-05-07T19:59:50.7936791Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:50.7939345Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:50.7940511Z ^ 2025-05-07T19:59:50.7940911Z 2025-05-07T19:59:50.7942393Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:50.7944678Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:59:50.7945413Z ^ 2025-05-07T19:59:50.7945728Z 2025-05-07T19:59:50.7947205Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:50.7949098Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:50.7949648Z ^ 2025-05-07T19:59:50.7950045Z 2025-05-07T19:59:50.7951541Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:50.7953408Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:50.7953969Z ^ 2025-05-07T19:59:50.7954252Z 2025-05-07T19:59:50.7955731Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:50.7957438Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:50.7957999Z ^ 2025-05-07T19:59:50.7958263Z 2025-05-07T19:59:50.7959885Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:50.7962478Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:50.7963646Z ^ 2025-05-07T19:59:50.7963898Z 2025-05-07T19:59:50.7964346Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:50.7965033Z 2025-05-07T19:59:50.7966654Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:50.7969250Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:50.7970653Z ^ 2025-05-07T19:59:50.7971030Z 2025-05-07T19:59:50.7972485Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:50.7974500Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:59:50.7975242Z ^ 2025-05-07T19:59:50.7975538Z 2025-05-07T19:59:50.7977140Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:50.7979019Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:50.7979591Z ^ 2025-05-07T19:59:50.7979880Z 2025-05-07T19:59:50.7981358Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:50.7983226Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:50.7983789Z ^ 2025-05-07T19:59:50.7984079Z 2025-05-07T19:59:50.7985556Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:50.7987761Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:50.7988319Z ^ 2025-05-07T19:59:50.7988568Z 2025-05-07T19:59:51.2417374Z [322/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu.o 2025-05-07T19:59:51.2440557Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:51.2443326Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:51.2444517Z ^ 2025-05-07T19:59:51.2444803Z 2025-05-07T19:59:51.2445257Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:51.2445925Z 2025-05-07T19:59:51.2447618Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:51.2450070Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:51.2451122Z ^ 2025-05-07T19:59:51.2451824Z 2025-05-07T19:59:51.2453250Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:51.2455321Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:59:51.2456066Z ^ 2025-05-07T19:59:51.2456343Z 2025-05-07T19:59:51.2458017Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:51.2459866Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:51.2460442Z ^ 2025-05-07T19:59:51.2460757Z 2025-05-07T19:59:51.2462288Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:51.2464298Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:51.2464867Z ^ 2025-05-07T19:59:51.2465176Z 2025-05-07T19:59:51.2466734Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:51.2468724Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:51.2469291Z ^ 2025-05-07T19:59:51.2469575Z 2025-05-07T19:59:51.2471481Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:51.2474166Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:51.2475382Z ^ 2025-05-07T19:59:51.2475651Z 2025-05-07T19:59:51.2476132Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:51.2476811Z 2025-05-07T19:59:51.2478486Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:51.2481138Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:51.2482193Z ^ 2025-05-07T19:59:51.2482564Z 2025-05-07T19:59:51.2483870Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:51.2485820Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:59:51.2486508Z ^ 2025-05-07T19:59:51.2486807Z 2025-05-07T19:59:51.2488251Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:51.2490076Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:51.2490592Z ^ 2025-05-07T19:59:51.2490871Z 2025-05-07T19:59:51.2492241Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:51.2494179Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:51.2495021Z ^ 2025-05-07T19:59:51.2495294Z 2025-05-07T19:59:51.2496993Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:51.2498953Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:51.2499503Z ^ 2025-05-07T19:59:51.2499776Z 2025-05-07T19:59:51.2501548Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:51.2520136Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:51.2521198Z ^ 2025-05-07T19:59:51.2521452Z 2025-05-07T19:59:51.2521907Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:51.2522658Z 2025-05-07T19:59:51.2524275Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:51.2526803Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:51.2528000Z ^ 2025-05-07T19:59:51.2528373Z 2025-05-07T19:59:51.2529956Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:51.2532084Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:59:51.2532870Z ^ 2025-05-07T19:59:51.2533157Z 2025-05-07T19:59:51.2534752Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:51.2536842Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:51.2537405Z ^ 2025-05-07T19:59:51.2537695Z 2025-05-07T19:59:51.2539267Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:51.2541201Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:51.2541733Z ^ 2025-05-07T19:59:51.2541987Z 2025-05-07T19:59:51.2543377Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:51.2545056Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:51.2545526Z ^ 2025-05-07T19:59:51.2545783Z 2025-05-07T19:59:51.2547215Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:51.2549674Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:51.2550734Z ^ 2025-05-07T19:59:51.2550967Z 2025-05-07T19:59:51.2551375Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:51.2551922Z 2025-05-07T19:59:51.2553526Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:51.2556304Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:51.2557491Z ^ 2025-05-07T19:59:51.2557852Z 2025-05-07T19:59:51.2559507Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:51.2561661Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:59:51.2562428Z ^ 2025-05-07T19:59:51.2562717Z 2025-05-07T19:59:51.2564357Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:51.2566324Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:51.2566879Z ^ 2025-05-07T19:59:51.2567177Z 2025-05-07T19:59:51.2568741Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:51.2570950Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:51.2571455Z ^ 2025-05-07T19:59:51.2571718Z 2025-05-07T19:59:51.2573086Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:51.2574784Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:51.2575224Z ^ 2025-05-07T19:59:51.2575465Z 2025-05-07T19:59:51.2577069Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:51.2579496Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:51.2580558Z ^ 2025-05-07T19:59:51.2580792Z 2025-05-07T19:59:51.2581221Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:51.2581840Z 2025-05-07T19:59:51.2583374Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:51.2585786Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:51.2586874Z ^ 2025-05-07T19:59:51.2587227Z 2025-05-07T19:59:51.2588752Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:51.2590762Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:59:51.2591505Z ^ 2025-05-07T19:59:51.2591793Z 2025-05-07T19:59:51.2593359Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:51.2595310Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:51.2596210Z ^ 2025-05-07T19:59:51.2596520Z 2025-05-07T19:59:51.2598081Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:51.2600041Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:51.2600583Z ^ 2025-05-07T19:59:51.2600863Z 2025-05-07T19:59:51.2602534Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:51.2604494Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:51.2605052Z ^ 2025-05-07T19:59:51.2605334Z 2025-05-07T19:59:52.3413318Z [323/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_inference.so -o fbgemm_gpu_tbe_inference.so CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_weighted_codegen_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host.cpp.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_split_lookup.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm.so fbgemm_gpu_tbe_cache.so fbgemm_gpu_config.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so" -Wl,--as-needed asmjit.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -lcudadevrt -lcudart_static -ldl && : 2025-05-07T19:59:54.4869409Z [324/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_ssd_unweighted_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_kernel.cu.o 2025-05-07T19:59:54.4891655Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:54.4894166Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:54.4895204Z ^ 2025-05-07T19:59:54.4895465Z 2025-05-07T19:59:54.4895889Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:54.4896684Z 2025-05-07T19:59:54.4898506Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:54.4901026Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:54.4902108Z ^ 2025-05-07T19:59:54.4902429Z 2025-05-07T19:59:54.4903973Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:54.4906537Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:54.4907695Z ^ 2025-05-07T19:59:54.4907943Z 2025-05-07T19:59:54.4908402Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:54.4909019Z 2025-05-07T19:59:54.4910492Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:54.4912991Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:54.4914171Z ^ 2025-05-07T19:59:54.4914530Z 2025-05-07T19:59:54.4916128Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:54.4918568Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:54.4919593Z ^ 2025-05-07T19:59:54.4919856Z 2025-05-07T19:59:54.4920279Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:54.4920882Z 2025-05-07T19:59:54.4922518Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:54.4925098Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:54.4926184Z ^ 2025-05-07T19:59:54.4926503Z 2025-05-07T19:59:54.4928032Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:54.4931043Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:54.4932237Z ^ 2025-05-07T19:59:54.4932501Z 2025-05-07T19:59:54.4932973Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:54.4933592Z 2025-05-07T19:59:54.4935183Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:54.4937966Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:54.4939140Z ^ 2025-05-07T19:59:54.4939552Z 2025-05-07T19:59:54.4941255Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:54.4943673Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:54.4944721Z ^ 2025-05-07T19:59:54.4944965Z 2025-05-07T19:59:54.4945399Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:54.4945997Z 2025-05-07T19:59:54.4947439Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:54.4949916Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:54.4951026Z ^ 2025-05-07T19:59:54.4951383Z 2025-05-07T20:00:13.8910806Z [325/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_ssd_weighted_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_kernel.cu.o 2025-05-07T20:00:13.8933033Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:13.8935774Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:13.8937322Z ^ 2025-05-07T20:00:13.8937579Z 2025-05-07T20:00:13.8938027Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:13.8938705Z 2025-05-07T20:00:13.8940340Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:13.8943015Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:13.8944178Z ^ 2025-05-07T20:00:13.8944552Z 2025-05-07T20:00:13.8946067Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:13.8948675Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:13.8949807Z ^ 2025-05-07T20:00:13.8950077Z 2025-05-07T20:00:13.8950481Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:13.8951107Z 2025-05-07T20:00:13.8952673Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:13.8955225Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:13.8956361Z ^ 2025-05-07T20:00:13.8956703Z 2025-05-07T20:00:13.8958282Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:13.8960872Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:13.8962029Z ^ 2025-05-07T20:00:13.8962280Z 2025-05-07T20:00:13.8962722Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:13.8963390Z 2025-05-07T20:00:13.8964948Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:13.8967576Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:13.8968947Z ^ 2025-05-07T20:00:13.8969328Z 2025-05-07T20:00:13.8971182Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:13.8973681Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:13.8974794Z ^ 2025-05-07T20:00:13.8975342Z 2025-05-07T20:00:13.8975681Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:13.8976161Z 2025-05-07T20:00:13.8977478Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:13.8979805Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:13.8980902Z ^ 2025-05-07T20:00:13.8981237Z 2025-05-07T20:00:13.8982791Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:13.8985196Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:13.8986304Z ^ 2025-05-07T20:00:13.8986543Z 2025-05-07T20:00:13.8986930Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:13.8987540Z 2025-05-07T20:00:13.8989104Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:13.8991652Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:13.8992767Z ^ 2025-05-07T20:00:13.8993113Z 2025-05-07T20:00:17.8822356Z [326/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu.o 2025-05-07T20:00:17.8845965Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:17.8848583Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:17.8849707Z ^ 2025-05-07T20:00:17.8849928Z 2025-05-07T20:00:17.8850351Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:17.8850964Z 2025-05-07T20:00:17.8852548Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:17.8855162Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:17.8856528Z ^ 2025-05-07T20:00:17.8856914Z 2025-05-07T20:00:17.8858535Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:17.8861185Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:17.8862342Z ^ 2025-05-07T20:00:17.8862608Z 2025-05-07T20:00:17.8863051Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:17.8863712Z 2025-05-07T20:00:17.8865293Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:17.8867982Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:17.8869202Z ^ 2025-05-07T20:00:17.8869575Z 2025-05-07T20:00:17.8871479Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:17.8874208Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:17.8875338Z ^ 2025-05-07T20:00:17.8875566Z 2025-05-07T20:00:17.8875979Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:17.8876623Z 2025-05-07T20:00:17.8878245Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:17.8881171Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:17.8882307Z ^ 2025-05-07T20:00:17.8882663Z 2025-05-07T20:00:17.8884352Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:17.8887017Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:17.8888190Z ^ 2025-05-07T20:00:17.8888442Z 2025-05-07T20:00:17.8888831Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:17.8889467Z 2025-05-07T20:00:17.8891283Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:17.8893974Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:17.8895133Z ^ 2025-05-07T20:00:17.8895499Z 2025-05-07T20:00:17.8897282Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:17.8899810Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:17.8900936Z ^ 2025-05-07T20:00:17.8901179Z 2025-05-07T20:00:17.8901579Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:17.8902232Z 2025-05-07T20:00:17.8903855Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:17.8906510Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:17.8907714Z ^ 2025-05-07T20:00:17.8908087Z 2025-05-07T20:00:19.7017386Z [327/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_none_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_warp.cu.o 2025-05-07T20:00:19.7038603Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:19.7041105Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:19.7042177Z ^ 2025-05-07T20:00:19.7042420Z 2025-05-07T20:00:19.7042807Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:19.7043391Z 2025-05-07T20:00:19.7044950Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:19.7047305Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:19.7048444Z ^ 2025-05-07T20:00:19.7048760Z 2025-05-07T20:00:19.7050246Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:19.7052681Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:19.7053678Z ^ 2025-05-07T20:00:19.7053909Z 2025-05-07T20:00:19.7054326Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:19.7054951Z 2025-05-07T20:00:19.7056653Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:19.7059124Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:19.7060147Z ^ 2025-05-07T20:00:19.7060522Z 2025-05-07T20:00:19.7061969Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:19.7064341Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:19.7065630Z ^ 2025-05-07T20:00:19.7065884Z 2025-05-07T20:00:19.7066302Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:19.7066894Z 2025-05-07T20:00:19.7068386Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:19.7071232Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:19.7072294Z ^ 2025-05-07T20:00:19.7072607Z 2025-05-07T20:00:19.7074030Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:19.7076579Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:19.7077666Z ^ 2025-05-07T20:00:19.7077897Z 2025-05-07T20:00:19.7078336Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:19.7078949Z 2025-05-07T20:00:19.7080335Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:19.7082799Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:19.7083894Z ^ 2025-05-07T20:00:19.7084235Z 2025-05-07T20:00:19.7085724Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:19.7088149Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:19.7089151Z ^ 2025-05-07T20:00:19.7089391Z 2025-05-07T20:00:19.7089787Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:19.7090413Z 2025-05-07T20:00:19.7091914Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:19.7094335Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:19.7095374Z ^ 2025-05-07T20:00:19.7095724Z 2025-05-07T20:00:22.4266088Z [328/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T20:00:22.4278037Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:22.4279373Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:22.4279990Z ^ 2025-05-07T20:00:22.4280130Z 2025-05-07T20:00:22.4280365Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:22.4280722Z 2025-05-07T20:00:22.4281551Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:22.4282906Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:22.4283510Z ^ 2025-05-07T20:00:22.4283715Z 2025-05-07T20:00:22.4284534Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:22.4285883Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:22.4286472Z ^ 2025-05-07T20:00:22.4286626Z 2025-05-07T20:00:22.4286860Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:22.4287200Z 2025-05-07T20:00:22.4288100Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:22.4289434Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:22.4290048Z ^ 2025-05-07T20:00:22.4290236Z 2025-05-07T20:00:22.4291071Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:22.4292596Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:22.4293203Z ^ 2025-05-07T20:00:22.4293342Z 2025-05-07T20:00:22.4293574Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:22.4293935Z 2025-05-07T20:00:22.4294821Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:22.4296174Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:22.4296906Z ^ 2025-05-07T20:00:22.4297113Z 2025-05-07T20:00:22.4299905Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:22.4301298Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:22.4301884Z ^ 2025-05-07T20:00:22.4302034Z 2025-05-07T20:00:22.4302267Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:22.4302605Z 2025-05-07T20:00:22.4303430Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:22.4304783Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:22.4305399Z ^ 2025-05-07T20:00:22.4305592Z 2025-05-07T20:00:22.4306410Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:22.4307741Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:22.4308341Z ^ 2025-05-07T20:00:22.4308479Z 2025-05-07T20:00:22.4308713Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:22.4309068Z 2025-05-07T20:00:22.4309897Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:22.4311255Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:22.4311853Z ^ 2025-05-07T20:00:22.4312048Z 2025-05-07T20:00:23.9268775Z [329/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu.o 2025-05-07T20:00:23.9291340Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:23.9294096Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:23.9295277Z ^ 2025-05-07T20:00:23.9295530Z 2025-05-07T20:00:23.9295975Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:23.9296773Z 2025-05-07T20:00:23.9298408Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:23.9300955Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:23.9301864Z ^ 2025-05-07T20:00:23.9302187Z 2025-05-07T20:00:23.9303566Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:23.9305835Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:23.9306817Z ^ 2025-05-07T20:00:23.9307044Z 2025-05-07T20:00:23.9307431Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:23.9308010Z 2025-05-07T20:00:23.9309548Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:23.9312316Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:23.9313508Z ^ 2025-05-07T20:00:23.9313865Z 2025-05-07T20:00:23.9315476Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:23.9318269Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:23.9319424Z ^ 2025-05-07T20:00:23.9319689Z 2025-05-07T20:00:23.9320130Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:23.9320802Z 2025-05-07T20:00:23.9322533Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:23.9325125Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:23.9326302Z ^ 2025-05-07T20:00:23.9326652Z 2025-05-07T20:00:23.9328259Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:23.9330848Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:23.9331972Z ^ 2025-05-07T20:00:23.9332208Z 2025-05-07T20:00:23.9332634Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:23.9333311Z 2025-05-07T20:00:23.9334966Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:23.9337697Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:23.9338837Z ^ 2025-05-07T20:00:23.9339197Z 2025-05-07T20:00:23.9340868Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:23.9343485Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:23.9344621Z ^ 2025-05-07T20:00:23.9344882Z 2025-05-07T20:00:23.9345307Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:23.9345958Z 2025-05-07T20:00:23.9347571Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:23.9350123Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:23.9351237Z ^ 2025-05-07T20:00:23.9351565Z 2025-05-07T20:00:24.7432479Z [330/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_none_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_warp.cu.o 2025-05-07T20:00:24.7450749Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:24.7452162Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:24.7452884Z ^ 2025-05-07T20:00:24.7453227Z 2025-05-07T20:00:24.7453675Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:24.7454249Z 2025-05-07T20:00:24.7455506Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:24.7458018Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:24.7459237Z ^ 2025-05-07T20:00:24.7459646Z 2025-05-07T20:00:24.7461272Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:24.7463596Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:24.7464622Z ^ 2025-05-07T20:00:24.7464882Z 2025-05-07T20:00:24.7465333Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:24.7466158Z 2025-05-07T20:00:24.7467469Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:24.7469768Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:24.7471165Z ^ 2025-05-07T20:00:24.7471473Z 2025-05-07T20:00:24.7473049Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:24.7475704Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:24.7476800Z ^ 2025-05-07T20:00:24.7477160Z 2025-05-07T20:00:24.7477593Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:24.7478242Z 2025-05-07T20:00:24.7479886Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:24.7482553Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:24.7483585Z ^ 2025-05-07T20:00:24.7483882Z 2025-05-07T20:00:24.7485179Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:24.7487526Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:24.7488582Z ^ 2025-05-07T20:00:24.7488859Z 2025-05-07T20:00:24.7489341Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:24.7489949Z 2025-05-07T20:00:24.7491430Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:24.7494231Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:24.7495410Z ^ 2025-05-07T20:00:24.7495803Z 2025-05-07T20:00:24.7497650Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:24.7500173Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:24.7501305Z ^ 2025-05-07T20:00:24.7501588Z 2025-05-07T20:00:24.7502015Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:24.7502615Z 2025-05-07T20:00:24.7504269Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:24.7506722Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:24.7507806Z ^ 2025-05-07T20:00:24.7508409Z 2025-05-07T20:00:31.1043688Z [331/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_none_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_cuda.cu.o 2025-05-07T20:00:31.1065903Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:31.1068655Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:31.1069845Z ^ 2025-05-07T20:00:31.1070407Z 2025-05-07T20:00:31.1070853Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:31.1071483Z 2025-05-07T20:00:31.1073023Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:31.1075561Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:31.1076697Z ^ 2025-05-07T20:00:31.1077057Z 2025-05-07T20:00:31.1078641Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:31.1081640Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:31.1082815Z ^ 2025-05-07T20:00:31.1083073Z 2025-05-07T20:00:31.1083478Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:31.1084132Z 2025-05-07T20:00:31.1086837Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:31.1089523Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:31.1090640Z ^ 2025-05-07T20:00:31.1091005Z 2025-05-07T20:00:31.1092688Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:31.1094909Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:31.1095862Z ^ 2025-05-07T20:00:31.1096081Z 2025-05-07T20:00:31.1096632Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:31.1097152Z 2025-05-07T20:00:31.1098443Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:31.1100502Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:31.1101439Z ^ 2025-05-07T20:00:31.1101737Z 2025-05-07T20:00:31.1103122Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:31.1105403Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:31.1106433Z ^ 2025-05-07T20:00:31.1106650Z 2025-05-07T20:00:31.1107038Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:31.1107634Z 2025-05-07T20:00:31.1109029Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:31.1111331Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:31.1112353Z ^ 2025-05-07T20:00:31.1112688Z 2025-05-07T20:00:31.1114116Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:31.1116577Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:31.1117627Z ^ 2025-05-07T20:00:31.1117853Z 2025-05-07T20:00:31.1118292Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:31.1118872Z 2025-05-07T20:00:31.1120359Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:31.1122966Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:31.1124007Z ^ 2025-05-07T20:00:31.1124316Z 2025-05-07T20:00:32.0819079Z [332/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu.o 2025-05-07T20:00:32.0840096Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:32.0842654Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:32.0843767Z ^ 2025-05-07T20:00:32.0844064Z 2025-05-07T20:00:32.0844527Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:32.0845198Z 2025-05-07T20:00:32.0846853Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:32.0849339Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:32.0850823Z ^ 2025-05-07T20:00:32.0851282Z 2025-05-07T20:00:32.0852978Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:32.0855593Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:32.0856906Z ^ 2025-05-07T20:00:32.0857157Z 2025-05-07T20:00:32.0857712Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:32.0858380Z 2025-05-07T20:00:32.0860014Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:32.0862688Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:32.0863720Z ^ 2025-05-07T20:00:32.0864038Z 2025-05-07T20:00:32.0865437Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:32.0867775Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:32.0868811Z ^ 2025-05-07T20:00:32.0869021Z 2025-05-07T20:00:32.0869391Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:32.0869962Z 2025-05-07T20:00:32.0871927Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:32.0874363Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:32.0875409Z ^ 2025-05-07T20:00:32.0875751Z 2025-05-07T20:00:32.0877270Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:32.0879872Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:32.0881059Z ^ 2025-05-07T20:00:32.0881320Z 2025-05-07T20:00:32.0881803Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:32.0882468Z 2025-05-07T20:00:32.0884127Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:32.0886820Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:32.0887996Z ^ 2025-05-07T20:00:32.0888387Z 2025-05-07T20:00:32.0890026Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:32.0892700Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:32.0893847Z ^ 2025-05-07T20:00:32.0894474Z 2025-05-07T20:00:32.0894922Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:32.0895594Z 2025-05-07T20:00:32.0897432Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:32.0900083Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:32.0901376Z ^ 2025-05-07T20:00:32.0901742Z 2025-05-07T20:00:34.4926438Z [333/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_none_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_cuda.cu.o 2025-05-07T20:00:34.4949309Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:34.4952089Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:34.4953276Z ^ 2025-05-07T20:00:34.4953553Z 2025-05-07T20:00:34.4954013Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:34.4954682Z 2025-05-07T20:00:34.4956363Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:34.4959422Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:34.4960638Z ^ 2025-05-07T20:00:34.4961018Z 2025-05-07T20:00:34.4962757Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:34.4965450Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:34.4966648Z ^ 2025-05-07T20:00:34.4966909Z 2025-05-07T20:00:34.4967359Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:34.4968071Z 2025-05-07T20:00:34.4969841Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:34.4972835Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:34.4974021Z ^ 2025-05-07T20:00:34.4974391Z 2025-05-07T20:00:34.4976080Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:34.4978822Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:34.4980020Z ^ 2025-05-07T20:00:34.4980286Z 2025-05-07T20:00:34.4980775Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:34.4981451Z 2025-05-07T20:00:34.4983096Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:34.4985778Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:34.4987000Z ^ 2025-05-07T20:00:34.4987374Z 2025-05-07T20:00:34.4989020Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:34.4991724Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:34.4992840Z ^ 2025-05-07T20:00:34.4993111Z 2025-05-07T20:00:34.4993525Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:34.4994144Z 2025-05-07T20:00:34.4995534Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:34.4997743Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:34.4998868Z ^ 2025-05-07T20:00:34.4999211Z 2025-05-07T20:00:34.5000686Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:34.5005690Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:34.5006804Z ^ 2025-05-07T20:00:34.5007047Z 2025-05-07T20:00:34.5007470Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:34.5008081Z 2025-05-07T20:00:34.5009799Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:34.5012314Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:34.5013421Z ^ 2025-05-07T20:00:34.5013791Z 2025-05-07T20:00:36.4228668Z [334/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_none_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_cuda.cu.o 2025-05-07T20:00:36.4251587Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:36.4254259Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:36.4255370Z ^ 2025-05-07T20:00:36.4255924Z 2025-05-07T20:00:36.4256625Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:36.4257261Z 2025-05-07T20:00:36.4258848Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:36.4261405Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:36.4262715Z ^ 2025-05-07T20:00:36.4263061Z 2025-05-07T20:00:36.4264639Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:36.4267299Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:36.4268586Z ^ 2025-05-07T20:00:36.4268846Z 2025-05-07T20:00:36.4269302Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:36.4269950Z 2025-05-07T20:00:36.4271792Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:36.4274372Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:36.4275477Z ^ 2025-05-07T20:00:36.4275783Z 2025-05-07T20:00:36.4277337Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:36.4279929Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:36.4281084Z ^ 2025-05-07T20:00:36.4281359Z 2025-05-07T20:00:36.4281791Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:36.4282449Z 2025-05-07T20:00:36.4284126Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:36.4286788Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:36.4287977Z ^ 2025-05-07T20:00:36.4288343Z 2025-05-07T20:00:36.4289998Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:36.4292583Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:36.4293712Z ^ 2025-05-07T20:00:36.4293947Z 2025-05-07T20:00:36.4294382Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:36.4295059Z 2025-05-07T20:00:36.4296803Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:36.4299503Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:36.4301002Z ^ 2025-05-07T20:00:36.4301391Z 2025-05-07T20:00:36.4302965Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:36.4305470Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:36.4306655Z ^ 2025-05-07T20:00:36.4306921Z 2025-05-07T20:00:36.4307545Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:36.4308216Z 2025-05-07T20:00:36.4309930Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:36.4312759Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:36.4313975Z ^ 2025-05-07T20:00:36.4314340Z 2025-05-07T20:00:41.5836420Z [335/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp 2025-05-07T20:00:41.5855845Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:44.0443316Z [336/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp 2025-05-07T20:00:44.0463224Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:45.8198180Z [337/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp 2025-05-07T20:00:45.8216277Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:49.7844056Z [338/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_none_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_cta.cu.o 2025-05-07T20:00:49.7866855Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:49.7869546Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:49.7870938Z ^ 2025-05-07T20:00:49.7871188Z 2025-05-07T20:00:49.7871625Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:49.7872318Z 2025-05-07T20:00:49.7873949Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:49.7876509Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:49.7878043Z ^ 2025-05-07T20:00:49.7878443Z 2025-05-07T20:00:49.7880071Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:49.7882680Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:49.7883810Z ^ 2025-05-07T20:00:49.7884177Z 2025-05-07T20:00:49.7884650Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:49.7885303Z 2025-05-07T20:00:49.7886904Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:49.7889586Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:49.7890763Z ^ 2025-05-07T20:00:49.7891122Z 2025-05-07T20:00:49.7892736Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:49.7895241Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:49.7896342Z ^ 2025-05-07T20:00:49.7896725Z 2025-05-07T20:00:49.7897152Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:49.7897812Z 2025-05-07T20:00:49.7899475Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:49.7902105Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:49.7903315Z ^ 2025-05-07T20:00:49.7903643Z 2025-05-07T20:00:49.7905200Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:49.7907684Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:49.7908802Z ^ 2025-05-07T20:00:49.7909047Z 2025-05-07T20:00:49.7909486Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:49.7910136Z 2025-05-07T20:00:49.7911612Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:49.7913985Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:49.7915058Z ^ 2025-05-07T20:00:49.7915410Z 2025-05-07T20:00:49.7916952Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:49.7919511Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:49.7920868Z ^ 2025-05-07T20:00:49.7921142Z 2025-05-07T20:00:49.7921590Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:49.7922250Z 2025-05-07T20:00:49.7923871Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:49.7926535Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:49.7927720Z ^ 2025-05-07T20:00:49.7928061Z 2025-05-07T20:00:50.9032769Z [339/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp 2025-05-07T20:00:50.9053283Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:51.4913631Z [340/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T20:00:51.4937182Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:51.4939863Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:51.4940991Z ^ 2025-05-07T20:00:51.4941259Z 2025-05-07T20:00:51.4941690Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:51.4942294Z 2025-05-07T20:00:51.4943986Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:51.4946634Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:51.4947829Z ^ 2025-05-07T20:00:51.4948155Z 2025-05-07T20:00:51.4949809Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:51.4952547Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:51.4953747Z ^ 2025-05-07T20:00:51.4953999Z 2025-05-07T20:00:51.4954450Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:51.4955142Z 2025-05-07T20:00:51.4956841Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:51.4959421Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:51.4960602Z ^ 2025-05-07T20:00:51.4961282Z 2025-05-07T20:00:51.4962942Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:51.4965425Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:51.4966548Z ^ 2025-05-07T20:00:51.4966815Z 2025-05-07T20:00:51.4967267Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:51.4968033Z 2025-05-07T20:00:51.4969676Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:51.4972634Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:51.4974008Z ^ 2025-05-07T20:00:51.4974339Z 2025-05-07T20:00:51.4975889Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:51.4978608Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:51.4979770Z ^ 2025-05-07T20:00:51.4980013Z 2025-05-07T20:00:51.4980459Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:51.4981139Z 2025-05-07T20:00:51.4982803Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:51.4985514Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:51.4986693Z ^ 2025-05-07T20:00:51.4987053Z 2025-05-07T20:00:51.4988715Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:51.4991273Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:51.4992421Z ^ 2025-05-07T20:00:51.4992669Z 2025-05-07T20:00:51.4993111Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:51.4993716Z 2025-05-07T20:00:51.4995317Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:51.4998021Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:51.4999225Z ^ 2025-05-07T20:00:51.4999596Z 2025-05-07T20:00:52.2974020Z [341/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_none_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_cta.cu.o 2025-05-07T20:00:52.2996100Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:52.2998759Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:52.2999822Z ^ 2025-05-07T20:00:52.3000066Z 2025-05-07T20:00:52.3000495Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:52.3001152Z 2025-05-07T20:00:52.3002744Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:52.3005259Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:52.3006209Z ^ 2025-05-07T20:00:52.3006496Z 2025-05-07T20:00:52.3007760Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:52.3010110Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:52.3011245Z ^ 2025-05-07T20:00:52.3011490Z 2025-05-07T20:00:52.3011925Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:52.3012553Z 2025-05-07T20:00:52.3014155Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:52.3017149Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:52.3018265Z ^ 2025-05-07T20:00:52.3018627Z 2025-05-07T20:00:52.3020184Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:52.3022873Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:52.3023999Z ^ 2025-05-07T20:00:52.3024262Z 2025-05-07T20:00:52.3024699Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:52.3025350Z 2025-05-07T20:00:52.3027075Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:52.3029676Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:52.3030834Z ^ 2025-05-07T20:00:52.3031195Z 2025-05-07T20:00:52.3032599Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:52.3034474Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:52.3035276Z ^ 2025-05-07T20:00:52.3035451Z 2025-05-07T20:00:52.3035780Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:52.3036311Z 2025-05-07T20:00:52.3037487Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:52.3039421Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:52.3040297Z ^ 2025-05-07T20:00:52.3040582Z 2025-05-07T20:00:52.3041773Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:52.3043781Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:52.3044702Z ^ 2025-05-07T20:00:52.3044912Z 2025-05-07T20:00:52.3045255Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:52.3045759Z 2025-05-07T20:00:52.3047045Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:52.3048797Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:52.3049739Z ^ 2025-05-07T20:00:52.3050031Z 2025-05-07T20:00:53.3591214Z [342/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp 2025-05-07T20:00:53.3611596Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:55.3563223Z [343/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu.o 2025-05-07T20:00:55.3587599Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:55.3590161Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:55.3591045Z ^ 2025-05-07T20:00:55.3591243Z 2025-05-07T20:00:55.3591586Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:55.3592095Z 2025-05-07T20:00:55.3593331Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:55.3595382Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:55.3596380Z ^ 2025-05-07T20:00:55.3596703Z 2025-05-07T20:00:55.3598176Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:55.3600580Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:55.3601629Z ^ 2025-05-07T20:00:55.3601848Z 2025-05-07T20:00:55.3602252Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:55.3602867Z 2025-05-07T20:00:55.3604427Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:55.3606932Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:55.3608011Z ^ 2025-05-07T20:00:55.3608325Z 2025-05-07T20:00:55.3609970Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:55.3612690Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:55.3613875Z ^ 2025-05-07T20:00:55.3614124Z 2025-05-07T20:00:55.3614575Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:55.3615269Z 2025-05-07T20:00:55.3617078Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:55.3620166Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:55.3621353Z ^ 2025-05-07T20:00:55.3621723Z 2025-05-07T20:00:55.3623371Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:55.3626145Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:55.3627318Z ^ 2025-05-07T20:00:55.3627565Z 2025-05-07T20:00:55.3628028Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:55.3628702Z 2025-05-07T20:00:55.3630456Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:55.3633196Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:55.3634324Z ^ 2025-05-07T20:00:55.3634648Z 2025-05-07T20:00:55.3636246Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:55.3638836Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:55.3639991Z ^ 2025-05-07T20:00:55.3640235Z 2025-05-07T20:00:55.3640661Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:55.3641346Z 2025-05-07T20:00:55.3643010Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:55.3645664Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:55.3646831Z ^ 2025-05-07T20:00:55.3647192Z 2025-05-07T20:00:55.6072649Z [344/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp 2025-05-07T20:00:55.6092437Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:57.2986506Z [345/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu.o 2025-05-07T20:00:57.3009642Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:57.3012400Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:57.3013903Z ^ 2025-05-07T20:00:57.3014161Z 2025-05-07T20:00:57.3014609Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:57.3015277Z 2025-05-07T20:00:57.3017051Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:57.3019846Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:57.3021014Z ^ 2025-05-07T20:00:57.3021379Z 2025-05-07T20:00:57.3023009Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:57.3025799Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:57.3026894Z ^ 2025-05-07T20:00:57.3027159Z 2025-05-07T20:00:57.3027578Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:57.3028245Z 2025-05-07T20:00:57.3029893Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:57.3032341Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:57.3033487Z ^ 2025-05-07T20:00:57.3033837Z 2025-05-07T20:00:57.3035457Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:57.3038057Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:57.3039193Z ^ 2025-05-07T20:00:57.3039439Z 2025-05-07T20:00:57.3039896Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:57.3040532Z 2025-05-07T20:00:57.3042102Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:57.3044649Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:57.3045846Z ^ 2025-05-07T20:00:57.3046239Z 2025-05-07T20:00:57.3047837Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:57.3050465Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:57.3051609Z ^ 2025-05-07T20:00:57.3051872Z 2025-05-07T20:00:57.3052326Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:57.3052984Z 2025-05-07T20:00:57.3054653Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:57.3057549Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:57.3058660Z ^ 2025-05-07T20:00:57.3058976Z 2025-05-07T20:00:57.3060565Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:57.3063210Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:57.3064364Z ^ 2025-05-07T20:00:57.3064605Z 2025-05-07T20:00:57.3065051Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:57.3065721Z 2025-05-07T20:00:57.3067453Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:57.3070067Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:57.3071396Z ^ 2025-05-07T20:00:57.3071763Z 2025-05-07T20:00:59.0267350Z [346/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp 2025-05-07T20:00:59.0287227Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:59.9371470Z [347/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp 2025-05-07T20:00:59.9391185Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:01:00.6459384Z [348/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_dense_split_weighted_vbe_meta.cpp 2025-05-07T20:01:00.6479201Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:01:01.4295959Z [349/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp 2025-05-07T20:01:02.0308914Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:01:02.0330949Z [350/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu.o 2025-05-07T20:01:02.0354792Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:02.0357405Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:02.0358570Z ^ 2025-05-07T20:01:02.0358827Z 2025-05-07T20:01:02.0359264Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:02.0359905Z 2025-05-07T20:01:02.0361537Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:02.0364145Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:02.0365312Z ^ 2025-05-07T20:01:02.0365663Z 2025-05-07T20:01:02.0367120Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:02.0369446Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:02.0370920Z ^ 2025-05-07T20:01:02.0371169Z 2025-05-07T20:01:02.0371598Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:02.0372233Z 2025-05-07T20:01:02.0373517Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:02.0375864Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:02.0377139Z ^ 2025-05-07T20:01:02.0377502Z 2025-05-07T20:01:02.0379112Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:02.0382059Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:02.0383222Z ^ 2025-05-07T20:01:02.0383465Z 2025-05-07T20:01:02.0383911Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:02.0384574Z 2025-05-07T20:01:02.0386288Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:02.0388933Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:02.0390093Z ^ 2025-05-07T20:01:02.0390462Z 2025-05-07T20:01:02.0392161Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:02.0394784Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:02.0395898Z ^ 2025-05-07T20:01:02.0396150Z 2025-05-07T20:01:02.0396590Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:02.0397229Z 2025-05-07T20:01:02.0398838Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:02.0401416Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:02.0402416Z ^ 2025-05-07T20:01:02.0402771Z 2025-05-07T20:01:02.0404183Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:02.0406767Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:02.0407906Z ^ 2025-05-07T20:01:02.0408146Z 2025-05-07T20:01:02.0408579Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:02.0409247Z 2025-05-07T20:01:02.0410884Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:02.0413547Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:02.0414700Z ^ 2025-05-07T20:01:02.0415070Z 2025-05-07T20:01:02.6061218Z [351/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adam_split_weighted_vbe_meta.cpp 2025-05-07T20:01:02.6080920Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:01:04.6688487Z [352/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp 2025-05-07T20:01:04.6707700Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:01:07.0358495Z [353/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu.o 2025-05-07T20:01:07.0381774Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:07.0384367Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:07.0385539Z ^ 2025-05-07T20:01:07.0385814Z 2025-05-07T20:01:07.0386274Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:07.0386935Z 2025-05-07T20:01:07.0388579Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:07.0391149Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:07.0392192Z ^ 2025-05-07T20:01:07.0392481Z 2025-05-07T20:01:07.0393922Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:07.0396340Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:07.0397837Z ^ 2025-05-07T20:01:07.0398099Z 2025-05-07T20:01:07.0398522Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:07.0399157Z 2025-05-07T20:01:07.0400792Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:07.0403468Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:07.0404625Z ^ 2025-05-07T20:01:07.0404986Z 2025-05-07T20:01:07.0406551Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:07.0409184Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:07.0410313Z ^ 2025-05-07T20:01:07.0410545Z 2025-05-07T20:01:07.0410927Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:07.0411577Z 2025-05-07T20:01:07.0413196Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:07.0415854Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:07.0417058Z ^ 2025-05-07T20:01:07.0417408Z 2025-05-07T20:01:07.0418911Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:07.0421468Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:07.0422591Z ^ 2025-05-07T20:01:07.0422845Z 2025-05-07T20:01:07.0423287Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:07.0423934Z 2025-05-07T20:01:07.0425520Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:07.0439917Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:07.0441173Z ^ 2025-05-07T20:01:07.0441542Z 2025-05-07T20:01:07.0443163Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:07.0445790Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:07.0446959Z ^ 2025-05-07T20:01:07.0447208Z 2025-05-07T20:01:07.0447665Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:07.0448333Z 2025-05-07T20:01:07.0449788Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:07.0452693Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:07.0453878Z ^ 2025-05-07T20:01:07.0454231Z 2025-05-07T20:01:07.5875353Z [354/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_vbe_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_kernel.cu.o 2025-05-07T20:01:07.5896479Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:07.5899280Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:07.5900351Z ^ 2025-05-07T20:01:07.5900591Z 2025-05-07T20:01:07.5901000Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:07.5901571Z 2025-05-07T20:01:07.5903169Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:07.5905717Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:07.5906933Z ^ 2025-05-07T20:01:07.5907305Z 2025-05-07T20:01:07.5908934Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:07.5911822Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:07.5912974Z ^ 2025-05-07T20:01:07.5913228Z 2025-05-07T20:01:07.5913679Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:07.5914334Z 2025-05-07T20:01:07.5916071Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:07.5918763Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:07.5919911Z ^ 2025-05-07T20:01:07.5920281Z 2025-05-07T20:01:07.5921978Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:07.5924600Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:07.5925788Z ^ 2025-05-07T20:01:07.5926057Z 2025-05-07T20:01:07.5926513Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:07.5927179Z 2025-05-07T20:01:07.5928785Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:07.5931220Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:07.5932284Z ^ 2025-05-07T20:01:07.5932626Z 2025-05-07T20:01:07.5934189Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:07.5936855Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:07.5937930Z ^ 2025-05-07T20:01:07.5938167Z 2025-05-07T20:01:07.5938611Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:07.5939278Z 2025-05-07T20:01:07.5940767Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:07.5943211Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:07.5944372Z ^ 2025-05-07T20:01:07.5944723Z 2025-05-07T20:01:07.5946315Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:07.5948929Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:07.5950070Z ^ 2025-05-07T20:01:07.5950310Z 2025-05-07T20:01:07.5950756Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:07.5951608Z 2025-05-07T20:01:07.5953252Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:07.5955813Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:07.5956905Z ^ 2025-05-07T20:01:07.5957257Z 2025-05-07T20:01:12.0848964Z [355/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T20:01:12.0872184Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:12.0874628Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:12.0875727Z ^ 2025-05-07T20:01:12.0875955Z 2025-05-07T20:01:12.0876362Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:12.0876991Z 2025-05-07T20:01:12.0878447Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:12.0881231Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:12.0882294Z ^ 2025-05-07T20:01:12.0882659Z 2025-05-07T20:01:12.0884258Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:12.0887064Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:12.0888166Z ^ 2025-05-07T20:01:12.0888419Z 2025-05-07T20:01:12.0888852Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:12.0889477Z 2025-05-07T20:01:12.0891228Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:12.0893889Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:12.0895078Z ^ 2025-05-07T20:01:12.0895433Z 2025-05-07T20:01:12.0897065Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:12.0899444Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:12.0900522Z ^ 2025-05-07T20:01:12.0900769Z 2025-05-07T20:01:12.0901227Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:12.0901896Z 2025-05-07T20:01:12.0903478Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:12.0906058Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:12.0907180Z ^ 2025-05-07T20:01:12.0907559Z 2025-05-07T20:01:12.0909139Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:12.0911664Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:12.0912794Z ^ 2025-05-07T20:01:12.0913043Z 2025-05-07T20:01:12.0913481Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:12.0914122Z 2025-05-07T20:01:12.0915706Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:12.0918194Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:12.0919319Z ^ 2025-05-07T20:01:12.0919672Z 2025-05-07T20:01:12.0921180Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:12.0923913Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:12.0925018Z ^ 2025-05-07T20:01:12.0925262Z 2025-05-07T20:01:12.0925664Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:12.0926301Z 2025-05-07T20:01:12.0928009Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:12.0930622Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:12.0931767Z ^ 2025-05-07T20:01:12.0932111Z 2025-05-07T20:01:14.1613461Z [356/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu.o 2025-05-07T20:01:14.1634468Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:14.1637035Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:14.1638052Z ^ 2025-05-07T20:01:14.1638687Z 2025-05-07T20:01:14.1639125Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:14.1639670Z 2025-05-07T20:01:14.1640891Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:14.1643015Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:14.1644246Z ^ 2025-05-07T20:01:14.1644612Z 2025-05-07T20:01:14.1645931Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:14.1648167Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:14.1649302Z ^ 2025-05-07T20:01:14.1649555Z 2025-05-07T20:01:14.1649962Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:14.1650488Z 2025-05-07T20:01:14.1651974Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:14.1654322Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:14.1655416Z ^ 2025-05-07T20:01:14.1655761Z 2025-05-07T20:01:14.1657480Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:14.1659974Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:14.1661042Z ^ 2025-05-07T20:01:14.1661274Z 2025-05-07T20:01:14.1661661Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:14.1662211Z 2025-05-07T20:01:14.1663640Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:14.1666086Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:14.1667197Z ^ 2025-05-07T20:01:14.1667564Z 2025-05-07T20:01:14.1669100Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:14.1671820Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:14.1672886Z ^ 2025-05-07T20:01:14.1673127Z 2025-05-07T20:01:14.1673552Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:14.1674128Z 2025-05-07T20:01:14.1675540Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:14.1677816Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:14.1679187Z ^ 2025-05-07T20:01:14.1679518Z 2025-05-07T20:01:14.1680920Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:14.1683336Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:14.1684369Z ^ 2025-05-07T20:01:14.1684616Z 2025-05-07T20:01:14.1685178Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:14.1685789Z 2025-05-07T20:01:14.1687281Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:14.1689799Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:14.1690896Z ^ 2025-05-07T20:01:14.1691238Z 2025-05-07T20:01:17.3146338Z [357/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu.o 2025-05-07T20:01:17.3168338Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:17.3171607Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:17.3172696Z ^ 2025-05-07T20:01:17.3172954Z 2025-05-07T20:01:17.3173366Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:17.3173988Z 2025-05-07T20:01:17.3175743Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:17.3178496Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:17.3179630Z ^ 2025-05-07T20:01:17.3179967Z 2025-05-07T20:01:17.3181456Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:17.3183821Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:17.3184817Z ^ 2025-05-07T20:01:17.3185028Z 2025-05-07T20:01:17.3185416Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:17.3186013Z 2025-05-07T20:01:17.3187502Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:17.3189772Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:17.3190875Z ^ 2025-05-07T20:01:17.3191231Z 2025-05-07T20:01:17.3192763Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:17.3195102Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:17.3196159Z ^ 2025-05-07T20:01:17.3196409Z 2025-05-07T20:01:17.3196799Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:17.3197354Z 2025-05-07T20:01:17.3198776Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:17.3201251Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:17.3202374Z ^ 2025-05-07T20:01:17.3202728Z 2025-05-07T20:01:17.3204210Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:17.3206597Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:17.3207679Z ^ 2025-05-07T20:01:17.3207896Z 2025-05-07T20:01:17.3208277Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:17.3208886Z 2025-05-07T20:01:17.3210614Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:17.3214505Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:17.3215502Z ^ 2025-05-07T20:01:17.3215812Z 2025-05-07T20:01:17.3217541Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:17.3219946Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:17.3221038Z ^ 2025-05-07T20:01:17.3221273Z 2025-05-07T20:01:17.3221726Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:17.3222438Z 2025-05-07T20:01:17.3223986Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:17.3226476Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:17.3227546Z ^ 2025-05-07T20:01:17.3227853Z 2025-05-07T20:01:18.5874931Z [358/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu.o 2025-05-07T20:01:18.5897581Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:18.5900466Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:18.5901526Z ^ 2025-05-07T20:01:18.5901788Z 2025-05-07T20:01:18.5902197Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:18.5902816Z 2025-05-07T20:01:18.5904588Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:18.5907176Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:18.5908219Z ^ 2025-05-07T20:01:18.5908568Z 2025-05-07T20:01:18.5910181Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:18.5912762Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:18.5913831Z ^ 2025-05-07T20:01:18.5914052Z 2025-05-07T20:01:18.5914470Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:18.5915127Z 2025-05-07T20:01:18.5916741Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:18.5919169Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:18.5920303Z ^ 2025-05-07T20:01:18.5920655Z 2025-05-07T20:01:18.5922288Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:18.5924723Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:18.5925886Z ^ 2025-05-07T20:01:18.5926157Z 2025-05-07T20:01:18.5926571Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:18.5927143Z 2025-05-07T20:01:18.5928663Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:18.5931174Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:18.5932378Z ^ 2025-05-07T20:01:18.5932746Z 2025-05-07T20:01:18.5934233Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:18.5936986Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:18.5938319Z ^ 2025-05-07T20:01:18.5938561Z 2025-05-07T20:01:18.5938993Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:18.5939631Z 2025-05-07T20:01:18.5941282Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:18.5943919Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:18.5945130Z ^ 2025-05-07T20:01:18.5945470Z 2025-05-07T20:01:18.5946948Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:18.5949500Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:18.5950631Z ^ 2025-05-07T20:01:18.5950883Z 2025-05-07T20:01:18.5951315Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:18.5951975Z 2025-05-07T20:01:18.5953564Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:18.5956130Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:18.5957277Z ^ 2025-05-07T20:01:18.5957597Z 2025-05-07T20:01:18.7273410Z [359/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu.o 2025-05-07T20:01:18.7296181Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:18.7298870Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:18.7300021Z ^ 2025-05-07T20:01:18.7300440Z 2025-05-07T20:01:18.7300867Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:18.7301508Z 2025-05-07T20:01:18.7303019Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:18.7305580Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:18.7306745Z ^ 2025-05-07T20:01:18.7307032Z 2025-05-07T20:01:18.7308516Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:18.7311033Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:18.7312117Z ^ 2025-05-07T20:01:18.7312344Z 2025-05-07T20:01:18.7312767Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:18.7313393Z 2025-05-07T20:01:18.7314982Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:18.7317408Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:18.7318522Z ^ 2025-05-07T20:01:18.7318884Z 2025-05-07T20:01:18.7320483Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:18.7322941Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:18.7324028Z ^ 2025-05-07T20:01:18.7324274Z 2025-05-07T20:01:18.7324707Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:18.7325331Z 2025-05-07T20:01:18.7326900Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:18.7329232Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:18.7330370Z ^ 2025-05-07T20:01:18.7330945Z 2025-05-07T20:01:18.7332275Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:18.7334815Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:18.7335958Z ^ 2025-05-07T20:01:18.7336235Z 2025-05-07T20:01:18.7336962Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:18.7337482Z 2025-05-07T20:01:18.7338919Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:18.7341677Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:18.7342751Z ^ 2025-05-07T20:01:18.7343078Z 2025-05-07T20:01:18.7344651Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:18.7347125Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:18.7348210Z ^ 2025-05-07T20:01:18.7348466Z 2025-05-07T20:01:18.7348927Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:18.7349506Z 2025-05-07T20:01:18.7351071Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:18.7353588Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:18.7354732Z ^ 2025-05-07T20:01:18.7355070Z 2025-05-07T20:01:20.0393174Z [360/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu.o 2025-05-07T20:01:20.0414532Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:20.0416956Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:20.0417985Z ^ 2025-05-07T20:01:20.0418221Z 2025-05-07T20:01:20.0418631Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:20.0419230Z 2025-05-07T20:01:20.0420740Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:20.0423064Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:20.0424099Z ^ 2025-05-07T20:01:20.0424440Z 2025-05-07T20:01:20.0425919Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:20.0428324Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:20.0429371Z ^ 2025-05-07T20:01:20.0429633Z 2025-05-07T20:01:20.0430073Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:20.0430647Z 2025-05-07T20:01:20.0432049Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:20.0434392Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:20.0435517Z ^ 2025-05-07T20:01:20.0435875Z 2025-05-07T20:01:20.0437378Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:20.0439727Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:20.0440798Z ^ 2025-05-07T20:01:20.0441024Z 2025-05-07T20:01:20.0441321Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:20.0441771Z 2025-05-07T20:01:20.0443060Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:20.0445739Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:20.0446766Z ^ 2025-05-07T20:01:20.0447048Z 2025-05-07T20:01:20.0448566Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:20.0450911Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:20.0451888Z ^ 2025-05-07T20:01:20.0452140Z 2025-05-07T20:01:20.0452540Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:20.0453162Z 2025-05-07T20:01:20.0454729Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:20.0457354Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:20.0458445Z ^ 2025-05-07T20:01:20.0458772Z 2025-05-07T20:01:20.0460281Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:20.0462644Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:20.0463705Z ^ 2025-05-07T20:01:20.0463944Z 2025-05-07T20:01:20.0464283Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:20.0464892Z 2025-05-07T20:01:20.0466420Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:20.0468919Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:20.0470021Z ^ 2025-05-07T20:01:20.0470649Z 2025-05-07T20:01:21.5320325Z [361/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_vbe_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_kernel.cu.o 2025-05-07T20:01:21.5344698Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:21.5347626Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:21.5348875Z ^ 2025-05-07T20:01:21.5349140Z 2025-05-07T20:01:21.5349617Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:21.5350355Z 2025-05-07T20:01:21.5351975Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:21.5354810Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:21.5356097Z ^ 2025-05-07T20:01:21.5356444Z 2025-05-07T20:01:21.5358177Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:21.5360972Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:21.5362239Z ^ 2025-05-07T20:01:21.5362532Z 2025-05-07T20:01:21.5363013Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:21.5363740Z 2025-05-07T20:01:21.5365556Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:21.5368445Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:21.5369747Z ^ 2025-05-07T20:01:21.5370495Z 2025-05-07T20:01:21.5372243Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:21.5375133Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:21.5376540Z ^ 2025-05-07T20:01:21.5377159Z 2025-05-07T20:01:21.5377585Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:21.5378246Z 2025-05-07T20:01:21.5380005Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:21.5382911Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:21.5384362Z ^ 2025-05-07T20:01:21.5384800Z 2025-05-07T20:01:21.5386565Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:21.5389444Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:21.5390871Z ^ 2025-05-07T20:01:21.5391116Z 2025-05-07T20:01:21.5391631Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:21.5392353Z 2025-05-07T20:01:21.5394127Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:21.5396991Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:21.5398290Z ^ 2025-05-07T20:01:21.5398694Z 2025-05-07T20:01:21.5400460Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:21.5403394Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:21.5404715Z ^ 2025-05-07T20:01:21.5404998Z 2025-05-07T20:01:21.5405488Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:21.5406252Z 2025-05-07T20:01:21.5408026Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:21.5410888Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:21.5412157Z ^ 2025-05-07T20:01:21.5412560Z 2025-05-07T20:01:29.4696609Z [362/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_ssd_weighted_vbe_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_kernel.cu.o 2025-05-07T20:01:29.4720984Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:29.4723852Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:29.4725144Z ^ 2025-05-07T20:01:29.4725455Z 2025-05-07T20:01:29.4725964Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:29.4726699Z 2025-05-07T20:01:29.4728338Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:29.4731219Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:29.4732537Z ^ 2025-05-07T20:01:29.4732947Z 2025-05-07T20:01:29.4734729Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:29.4737793Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:29.4739132Z ^ 2025-05-07T20:01:29.4739420Z 2025-05-07T20:01:29.4739905Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:29.4740663Z 2025-05-07T20:01:29.4742443Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:29.4745371Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:29.4746680Z ^ 2025-05-07T20:01:29.4747110Z 2025-05-07T20:01:29.4748894Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:29.4752076Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:29.4753347Z ^ 2025-05-07T20:01:29.4753659Z 2025-05-07T20:01:29.4754144Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:29.4754872Z 2025-05-07T20:01:29.4756811Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:29.4759718Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:29.4761045Z ^ 2025-05-07T20:01:29.4761440Z 2025-05-07T20:01:29.4763298Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:29.4766221Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:29.4767522Z ^ 2025-05-07T20:01:29.4767806Z 2025-05-07T20:01:29.4768293Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:29.4769042Z 2025-05-07T20:01:29.4771144Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:29.4774100Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:29.4775426Z ^ 2025-05-07T20:01:29.4775869Z 2025-05-07T20:01:29.4777760Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:29.4780671Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:29.4781966Z ^ 2025-05-07T20:01:29.4782251Z 2025-05-07T20:01:29.4782772Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:29.4783499Z 2025-05-07T20:01:29.4785280Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:29.4788230Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:29.4789558Z ^ 2025-05-07T20:01:29.4789956Z 2025-05-07T20:01:36.7421272Z [363/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu.o 2025-05-07T20:01:36.7446266Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:36.7449002Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:36.7450289Z ^ 2025-05-07T20:01:36.7450572Z 2025-05-07T20:01:36.7451054Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:36.7451784Z 2025-05-07T20:01:36.7453578Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:36.7456634Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:36.7457924Z ^ 2025-05-07T20:01:36.7458328Z 2025-05-07T20:01:36.7459925Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:36.7462765Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:36.7464044Z ^ 2025-05-07T20:01:36.7464316Z 2025-05-07T20:01:36.7464808Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:36.7465529Z 2025-05-07T20:01:36.7467302Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:36.7470474Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:36.7471973Z ^ 2025-05-07T20:01:36.7472348Z 2025-05-07T20:01:36.7474108Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:36.7476972Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:36.7478221Z ^ 2025-05-07T20:01:36.7478716Z 2025-05-07T20:01:36.7479198Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:36.7479909Z 2025-05-07T20:01:36.7481531Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:36.7486704Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:36.7488062Z ^ 2025-05-07T20:01:36.7488449Z 2025-05-07T20:01:36.7490219Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:36.7492901Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:36.7494187Z ^ 2025-05-07T20:01:36.7494460Z 2025-05-07T20:01:36.7494935Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:36.7495663Z 2025-05-07T20:01:36.7497593Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:36.7500498Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:36.7501624Z ^ 2025-05-07T20:01:36.7502031Z 2025-05-07T20:01:36.7503794Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:36.7506659Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:36.7507710Z ^ 2025-05-07T20:01:36.7507980Z 2025-05-07T20:01:36.7508452Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:36.7509183Z 2025-05-07T20:01:36.7510974Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:36.7513750Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:36.7515051Z ^ 2025-05-07T20:01:36.7515442Z 2025-05-07T20:01:40.6188140Z [364/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu.o 2025-05-07T20:01:40.6214184Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:40.6217276Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:40.6218549Z ^ 2025-05-07T20:01:40.6218827Z 2025-05-07T20:01:40.6219311Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:40.6220040Z 2025-05-07T20:01:40.6221819Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:40.6224742Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:40.6226031Z ^ 2025-05-07T20:01:40.6226439Z 2025-05-07T20:01:40.6228136Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:40.6231021Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:40.6232296Z ^ 2025-05-07T20:01:40.6232574Z 2025-05-07T20:01:40.6233067Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:40.6233778Z 2025-05-07T20:01:40.6235741Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:40.6238725Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:40.6240023Z ^ 2025-05-07T20:01:40.6240422Z 2025-05-07T20:01:40.6242287Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:40.6245161Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:40.6246438Z ^ 2025-05-07T20:01:40.6246722Z 2025-05-07T20:01:40.6247203Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:40.6248023Z 2025-05-07T20:01:40.6249811Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:40.6252680Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:40.6253961Z ^ 2025-05-07T20:01:40.6254352Z 2025-05-07T20:01:40.6256116Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:40.6259078Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:40.6260370Z ^ 2025-05-07T20:01:40.6260644Z 2025-05-07T20:01:40.6261133Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:40.6261842Z 2025-05-07T20:01:40.6263609Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:40.6266500Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:40.6267797Z ^ 2025-05-07T20:01:40.6268183Z 2025-05-07T20:01:40.6269930Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:40.6273004Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:40.6274141Z ^ 2025-05-07T20:01:40.6274431Z 2025-05-07T20:01:40.6274907Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:40.6275621Z 2025-05-07T20:01:40.6277422Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:40.6280271Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:40.6281555Z ^ 2025-05-07T20:01:40.6281943Z 2025-05-07T20:01:45.9128679Z [365/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T20:01:45.9155098Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:45.9157991Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:45.9159316Z ^ 2025-05-07T20:01:45.9159596Z 2025-05-07T20:01:45.9160107Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:45.9160836Z 2025-05-07T20:01:45.9162648Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:45.9165610Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:45.9166934Z ^ 2025-05-07T20:01:45.9167329Z 2025-05-07T20:01:45.9168984Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:45.9172522Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:45.9173792Z ^ 2025-05-07T20:01:45.9174090Z 2025-05-07T20:01:45.9174574Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:45.9175284Z 2025-05-07T20:01:45.9177413Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:45.9180284Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:45.9181574Z ^ 2025-05-07T20:01:45.9181968Z 2025-05-07T20:01:45.9183818Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:45.9186665Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:45.9187937Z ^ 2025-05-07T20:01:45.9188208Z 2025-05-07T20:01:45.9188682Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:45.9189409Z 2025-05-07T20:01:45.9191197Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:45.9193958Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:45.9195232Z ^ 2025-05-07T20:01:45.9195646Z 2025-05-07T20:01:45.9197400Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:45.9200264Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:45.9201518Z ^ 2025-05-07T20:01:45.9201805Z 2025-05-07T20:01:45.9202282Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:45.9202998Z 2025-05-07T20:01:45.9204789Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:45.9207659Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:45.9208969Z ^ 2025-05-07T20:01:45.9209356Z 2025-05-07T20:01:45.9211102Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:45.9213971Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:45.9215266Z ^ 2025-05-07T20:01:45.9215528Z 2025-05-07T20:01:45.9216000Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:45.9216912Z 2025-05-07T20:01:45.9218688Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:45.9221918Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:45.9223193Z ^ 2025-05-07T20:01:45.9223596Z 2025-05-07T20:01:51.0085788Z [366/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu.o 2025-05-07T20:01:51.0109611Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:51.0112224Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:51.0113314Z ^ 2025-05-07T20:01:51.0113546Z 2025-05-07T20:01:51.0113976Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:51.0114618Z 2025-05-07T20:01:51.0116237Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:51.0118761Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:51.0120181Z ^ 2025-05-07T20:01:51.0120545Z 2025-05-07T20:01:51.0122074Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:51.0124121Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:51.0124981Z ^ 2025-05-07T20:01:51.0125218Z 2025-05-07T20:01:51.0125678Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:51.0126174Z 2025-05-07T20:01:51.0127498Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:51.0129623Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:51.0130760Z ^ 2025-05-07T20:01:51.0131027Z 2025-05-07T20:01:51.0132302Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:51.0134402Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:51.0135334Z ^ 2025-05-07T20:01:51.0135531Z 2025-05-07T20:01:51.0135889Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:51.0136558Z 2025-05-07T20:01:51.0137868Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:51.0140035Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:51.0140957Z ^ 2025-05-07T20:01:51.0141224Z 2025-05-07T20:01:51.0142492Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:51.0144757Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:51.0145764Z ^ 2025-05-07T20:01:51.0145998Z 2025-05-07T20:01:51.0146410Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:51.0147039Z 2025-05-07T20:01:51.0148582Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:51.0150939Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:51.0151992Z ^ 2025-05-07T20:01:51.0152303Z 2025-05-07T20:01:51.0153731Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:51.0156100Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:51.0157330Z ^ 2025-05-07T20:01:51.0157610Z 2025-05-07T20:01:51.0158007Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:51.0158616Z 2025-05-07T20:01:51.0160074Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:51.0162612Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:51.0163837Z ^ 2025-05-07T20:01:51.0164185Z 2025-05-07T20:01:51.7687619Z [367/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu.o 2025-05-07T20:01:51.7709049Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:51.7711634Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:51.7712635Z ^ 2025-05-07T20:01:51.7712832Z 2025-05-07T20:01:51.7713202Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:51.7713771Z 2025-05-07T20:01:51.7715527Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:51.7718033Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:51.7719072Z ^ 2025-05-07T20:01:51.7719387Z 2025-05-07T20:01:51.7720973Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:51.7723358Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:51.7724377Z ^ 2025-05-07T20:01:51.7724596Z 2025-05-07T20:01:51.7724998Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:51.7725665Z 2025-05-07T20:01:51.7727137Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:51.7729438Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:51.7730475Z ^ 2025-05-07T20:01:51.7730786Z 2025-05-07T20:01:51.7732182Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:51.7734418Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:51.7735358Z ^ 2025-05-07T20:01:51.7735586Z 2025-05-07T20:01:51.7735950Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:51.7736697Z 2025-05-07T20:01:51.7738411Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:51.7740798Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:51.7741699Z ^ 2025-05-07T20:01:51.7741990Z 2025-05-07T20:01:51.7743056Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:51.7745418Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:51.7746522Z ^ 2025-05-07T20:01:51.7746810Z 2025-05-07T20:01:51.7747266Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:51.7747859Z 2025-05-07T20:01:51.7749326Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:51.7751901Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:51.7753024Z ^ 2025-05-07T20:01:51.7753373Z 2025-05-07T20:01:51.7754729Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:51.7757008Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:51.7758087Z ^ 2025-05-07T20:01:51.7758316Z 2025-05-07T20:01:51.7758681Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:51.7759271Z 2025-05-07T20:01:51.7762217Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:51.7764676Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:51.7765716Z ^ 2025-05-07T20:01:51.7766038Z 2025-05-07T20:01:59.9055736Z [368/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_ssd_unweighted_vbe_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_kernel.cu.o 2025-05-07T20:01:59.9083668Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:59.9086291Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:59.9087781Z ^ 2025-05-07T20:01:59.9088035Z 2025-05-07T20:01:59.9088469Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:59.9089126Z 2025-05-07T20:01:59.9090755Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:59.9093341Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:59.9094486Z ^ 2025-05-07T20:01:59.9094852Z 2025-05-07T20:01:59.9096578Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:59.9099265Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:59.9100386Z ^ 2025-05-07T20:01:59.9100649Z 2025-05-07T20:01:59.9101082Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:59.9101722Z 2025-05-07T20:01:59.9103415Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:59.9105965Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:59.9107135Z ^ 2025-05-07T20:01:59.9107470Z 2025-05-07T20:01:59.9109081Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:59.9111667Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:59.9112750Z ^ 2025-05-07T20:01:59.9112995Z 2025-05-07T20:01:59.9113422Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:59.9114073Z 2025-05-07T20:01:59.9115663Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:59.9118282Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:59.9119440Z ^ 2025-05-07T20:01:59.9119776Z 2025-05-07T20:01:59.9121323Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:59.9123843Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:59.9124928Z ^ 2025-05-07T20:01:59.9125169Z 2025-05-07T20:01:59.9125580Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:59.9126227Z 2025-05-07T20:01:59.9127862Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:59.9130323Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:59.9131702Z ^ 2025-05-07T20:01:59.9132063Z 2025-05-07T20:01:59.9133402Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:59.9135549Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:59.9136726Z ^ 2025-05-07T20:01:59.9136956Z 2025-05-07T20:01:59.9137365Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:59.9137990Z 2025-05-07T20:01:59.9139382Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:59.9141771Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:59.9142810Z ^ 2025-05-07T20:01:59.9143156Z 2025-05-07T20:02:14.1234814Z [369/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu.o 2025-05-07T20:02:14.1256106Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:14.1258697Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:14.1259735Z ^ 2025-05-07T20:02:14.1259966Z 2025-05-07T20:02:14.1260368Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:14.1261222Z 2025-05-07T20:02:14.1262638Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:14.1264981Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:14.1266194Z ^ 2025-05-07T20:02:14.1266538Z 2025-05-07T20:02:14.1267966Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:14.1270551Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:14.1271527Z ^ 2025-05-07T20:02:14.1271786Z 2025-05-07T20:02:14.1272186Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:14.1272787Z 2025-05-07T20:02:14.1274226Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:14.1276662Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:14.1277758Z ^ 2025-05-07T20:02:14.1278066Z 2025-05-07T20:02:14.1279538Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:14.1281895Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:14.1282854Z ^ 2025-05-07T20:02:14.1283080Z 2025-05-07T20:02:14.1283478Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:14.1284099Z 2025-05-07T20:02:14.1285569Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:14.1287671Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:14.1288618Z ^ 2025-05-07T20:02:14.1288949Z 2025-05-07T20:02:14.1290255Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:14.1292615Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:14.1293566Z ^ 2025-05-07T20:02:14.1293811Z 2025-05-07T20:02:14.1294216Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:14.1295218Z 2025-05-07T20:02:14.1296857Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:14.1299271Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:14.1300348Z ^ 2025-05-07T20:02:14.1300722Z 2025-05-07T20:02:14.1302321Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:14.1304657Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:14.1305728Z ^ 2025-05-07T20:02:14.1305959Z 2025-05-07T20:02:14.1306448Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:14.1307104Z 2025-05-07T20:02:14.1308679Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:14.1311170Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:14.1312259Z ^ 2025-05-07T20:02:14.1312615Z 2025-05-07T20:02:14.9172515Z [370/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu.o 2025-05-07T20:02:14.9184661Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:14.9186108Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:14.9186714Z ^ 2025-05-07T20:02:14.9186853Z 2025-05-07T20:02:14.9187087Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:14.9187427Z 2025-05-07T20:02:14.9188338Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:14.9189675Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:14.9190290Z ^ 2025-05-07T20:02:14.9190482Z 2025-05-07T20:02:14.9191318Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:14.9192636Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:14.9193237Z ^ 2025-05-07T20:02:14.9193380Z 2025-05-07T20:02:14.9193626Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:14.9193962Z 2025-05-07T20:02:14.9194788Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:14.9196134Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:14.9196745Z ^ 2025-05-07T20:02:14.9196938Z 2025-05-07T20:02:14.9197753Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:14.9199087Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:14.9199682Z ^ 2025-05-07T20:02:14.9199831Z 2025-05-07T20:02:14.9200059Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:14.9200397Z 2025-05-07T20:02:14.9201235Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:14.9202573Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:14.9203186Z ^ 2025-05-07T20:02:14.9203375Z 2025-05-07T20:02:14.9204212Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:14.9205632Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:14.9206237Z ^ 2025-05-07T20:02:14.9206376Z 2025-05-07T20:02:14.9206608Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:14.9206959Z 2025-05-07T20:02:14.9207826Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:14.9209171Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:14.9209772Z ^ 2025-05-07T20:02:14.9209973Z 2025-05-07T20:02:14.9210820Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:14.9212158Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:14.9212742Z ^ 2025-05-07T20:02:14.9212896Z 2025-05-07T20:02:14.9213126Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:14.9213468Z 2025-05-07T20:02:14.9214325Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:14.9215652Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:14.9216265Z ^ 2025-05-07T20:02:14.9216584Z 2025-05-07T20:02:15.0386385Z [371/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu.o 2025-05-07T20:02:15.0410281Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:15.0413111Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:15.0414186Z ^ 2025-05-07T20:02:15.0414415Z 2025-05-07T20:02:15.0414826Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:15.0415436Z 2025-05-07T20:02:15.0417189Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:15.0419790Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:15.0420892Z ^ 2025-05-07T20:02:15.0421242Z 2025-05-07T20:02:15.0422930Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:15.0425599Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:15.0426774Z ^ 2025-05-07T20:02:15.0427029Z 2025-05-07T20:02:15.0427472Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:15.0428118Z 2025-05-07T20:02:15.0429659Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:15.0432092Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:15.0433238Z ^ 2025-05-07T20:02:15.0433614Z 2025-05-07T20:02:15.0435263Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:15.0437906Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:15.0439025Z ^ 2025-05-07T20:02:15.0439283Z 2025-05-07T20:02:15.0439693Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:15.0440291Z 2025-05-07T20:02:15.0441831Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:15.0444304Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:15.0445617Z ^ 2025-05-07T20:02:15.0445925Z 2025-05-07T20:02:15.0447270Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:15.0449656Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:15.0450756Z ^ 2025-05-07T20:02:15.0450984Z 2025-05-07T20:02:15.0451391Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:15.0452041Z 2025-05-07T20:02:15.0453680Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:15.0456598Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:15.0457770Z ^ 2025-05-07T20:02:15.0458139Z 2025-05-07T20:02:15.0459698Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:15.0462179Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:15.0463274Z ^ 2025-05-07T20:02:15.0463515Z 2025-05-07T20:02:15.0463940Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:15.0464598Z 2025-05-07T20:02:15.0466263Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:15.0468926Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:15.0470306Z ^ 2025-05-07T20:02:15.0470684Z 2025-05-07T20:02:16.7217542Z [372/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu.o 2025-05-07T20:02:16.7241381Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:16.7244071Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:16.7245296Z ^ 2025-05-07T20:02:16.7245579Z 2025-05-07T20:02:16.7245968Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:16.7246627Z 2025-05-07T20:02:16.7248105Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:16.7250773Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:16.7251921Z ^ 2025-05-07T20:02:16.7252289Z 2025-05-07T20:02:16.7253897Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:16.7256611Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:16.7257770Z ^ 2025-05-07T20:02:16.7258012Z 2025-05-07T20:02:16.7258458Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:16.7259098Z 2025-05-07T20:02:16.7260618Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:16.7263149Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:16.7264359Z ^ 2025-05-07T20:02:16.7264725Z 2025-05-07T20:02:16.7266313Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:16.7269001Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:16.7270419Z ^ 2025-05-07T20:02:16.7270714Z 2025-05-07T20:02:16.7271169Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:16.7274318Z 2025-05-07T20:02:16.7275960Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:16.7278489Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:16.7279629Z ^ 2025-05-07T20:02:16.7279997Z 2025-05-07T20:02:16.7281727Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:16.7284219Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:16.7285430Z ^ 2025-05-07T20:02:16.7285702Z 2025-05-07T20:02:16.7286294Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:16.7286993Z 2025-05-07T20:02:16.7288613Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:16.7291217Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:16.7292360Z ^ 2025-05-07T20:02:16.7292752Z 2025-05-07T20:02:16.7294364Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:16.7297209Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:16.7298311Z ^ 2025-05-07T20:02:16.7298586Z 2025-05-07T20:02:16.7299016Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:16.7299669Z 2025-05-07T20:02:16.7301320Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:16.7303990Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:16.7305154Z ^ 2025-05-07T20:02:16.7305505Z 2025-05-07T20:02:17.5541326Z [373/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T20:02:17.5564044Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:17.5566592Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:17.5567724Z ^ 2025-05-07T20:02:17.5567997Z 2025-05-07T20:02:17.5568422Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:17.5569088Z 2025-05-07T20:02:17.5570825Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:17.5573277Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:17.5574325Z ^ 2025-05-07T20:02:17.5574700Z 2025-05-07T20:02:17.5576228Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:17.5578859Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:17.5579980Z ^ 2025-05-07T20:02:17.5580236Z 2025-05-07T20:02:17.5580698Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:17.5581344Z 2025-05-07T20:02:17.5582988Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:17.5585694Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:17.5586882Z ^ 2025-05-07T20:02:17.5587222Z 2025-05-07T20:02:17.5588706Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:17.5591508Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:17.5592640Z ^ 2025-05-07T20:02:17.5592889Z 2025-05-07T20:02:17.5593283Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:17.5593975Z 2025-05-07T20:02:17.5595726Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:17.5598345Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:17.5599535Z ^ 2025-05-07T20:02:17.5599907Z 2025-05-07T20:02:17.5601690Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:17.5604071Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:17.5605157Z ^ 2025-05-07T20:02:17.5605397Z 2025-05-07T20:02:17.5605826Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:17.5606455Z 2025-05-07T20:02:17.5608016Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:17.5610611Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:17.5611798Z ^ 2025-05-07T20:02:17.5612135Z 2025-05-07T20:02:17.5613633Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:17.5616138Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:17.5617401Z ^ 2025-05-07T20:02:17.5617701Z 2025-05-07T20:02:17.5618161Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:17.5618768Z 2025-05-07T20:02:17.5620363Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:17.5622981Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:17.5624060Z ^ 2025-05-07T20:02:17.5624431Z 2025-05-07T20:02:30.3914552Z [374/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu.o 2025-05-07T20:02:30.3933918Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:30.3936111Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:30.3937287Z ^ 2025-05-07T20:02:30.3937518Z 2025-05-07T20:02:30.3937898Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:30.3938421Z 2025-05-07T20:02:30.3939760Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:30.3941903Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:30.3942929Z ^ 2025-05-07T20:02:30.3956667Z 2025-05-07T20:02:30.3958060Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:30.3960132Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:30.3961082Z ^ 2025-05-07T20:02:30.3961283Z 2025-05-07T20:02:30.3961622Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:30.3962157Z 2025-05-07T20:02:30.3963538Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:30.3965922Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:30.3966941Z ^ 2025-05-07T20:02:30.3967255Z 2025-05-07T20:02:30.3968540Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:30.3971206Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:30.3972121Z ^ 2025-05-07T20:02:30.3972314Z 2025-05-07T20:02:30.3972687Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:30.3973229Z 2025-05-07T20:02:30.3974714Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:30.3976985Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:30.3977931Z ^ 2025-05-07T20:02:30.3978213Z 2025-05-07T20:02:30.3979585Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:30.3981628Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:30.3982581Z ^ 2025-05-07T20:02:30.3982787Z 2025-05-07T20:02:30.3983179Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:30.3983727Z 2025-05-07T20:02:30.3985016Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:30.3987053Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:30.3987977Z ^ 2025-05-07T20:02:30.3988281Z 2025-05-07T20:02:30.3989623Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:30.3991814Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:30.3992737Z ^ 2025-05-07T20:02:30.3992955Z 2025-05-07T20:02:30.3993328Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:30.3993833Z 2025-05-07T20:02:30.3995147Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:30.3997308Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:30.3998230Z ^ 2025-05-07T20:02:30.3998535Z 2025-05-07T20:02:34.3102138Z [375/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu.o 2025-05-07T20:02:34.3122329Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:34.3124755Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:34.3125794Z ^ 2025-05-07T20:02:34.3126009Z 2025-05-07T20:02:34.3126444Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:34.3127061Z 2025-05-07T20:02:34.3128503Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:34.3130844Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:34.3131844Z ^ 2025-05-07T20:02:34.3132182Z 2025-05-07T20:02:34.3133546Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:34.3135751Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:34.3136902Z ^ 2025-05-07T20:02:34.3137124Z 2025-05-07T20:02:34.3137522Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:34.3138311Z 2025-05-07T20:02:34.3139717Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:34.3142017Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:34.3143069Z ^ 2025-05-07T20:02:34.3143386Z 2025-05-07T20:02:34.3144956Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:34.3147229Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:34.3148239Z ^ 2025-05-07T20:02:34.3148574Z 2025-05-07T20:02:34.3148965Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:34.3149473Z 2025-05-07T20:02:34.3150852Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:34.3153064Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:34.3154027Z ^ 2025-05-07T20:02:34.3154333Z 2025-05-07T20:02:34.3155704Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:34.3157981Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:34.3159033Z ^ 2025-05-07T20:02:34.3159263Z 2025-05-07T20:02:34.3159656Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:34.3160218Z 2025-05-07T20:02:34.3161693Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:34.3163980Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:34.3164953Z ^ 2025-05-07T20:02:34.3165269Z 2025-05-07T20:02:34.3166594Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:34.3168835Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:34.3169806Z ^ 2025-05-07T20:02:34.3170043Z 2025-05-07T20:02:34.3170731Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:34.3171312Z 2025-05-07T20:02:34.3172761Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:34.3174963Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:34.3176003Z ^ 2025-05-07T20:02:34.3176730Z 2025-05-07T20:02:37.5927060Z [376/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu.o 2025-05-07T20:02:37.5947275Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:37.5949698Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:37.5950718Z ^ 2025-05-07T20:02:37.5950943Z 2025-05-07T20:02:37.5951345Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:37.5951973Z 2025-05-07T20:02:37.5953369Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:37.5955699Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:37.5956779Z ^ 2025-05-07T20:02:37.5957112Z 2025-05-07T20:02:37.5958512Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:37.5961117Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:37.5962073Z ^ 2025-05-07T20:02:37.5962305Z 2025-05-07T20:02:37.5962743Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:37.5963320Z 2025-05-07T20:02:37.5964870Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:37.5967219Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:37.5968242Z ^ 2025-05-07T20:02:37.5968576Z 2025-05-07T20:02:37.5970400Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:37.5972754Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:37.5973806Z ^ 2025-05-07T20:02:37.5974081Z 2025-05-07T20:02:37.5974491Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:37.5975097Z 2025-05-07T20:02:37.5976719Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:37.5979052Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:37.5980180Z ^ 2025-05-07T20:02:37.5980522Z 2025-05-07T20:02:37.5981938Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:37.5984285Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:37.5985359Z ^ 2025-05-07T20:02:37.5985608Z 2025-05-07T20:02:37.5986025Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:37.5986606Z 2025-05-07T20:02:37.5988042Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:37.5990423Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:37.5991397Z ^ 2025-05-07T20:02:37.5991724Z 2025-05-07T20:02:37.5993087Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:37.5995383Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:37.5996352Z ^ 2025-05-07T20:02:37.5996574Z 2025-05-07T20:02:37.5996983Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:37.5997511Z 2025-05-07T20:02:37.5998910Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:37.6001586Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:37.6002591Z ^ 2025-05-07T20:02:37.6002893Z 2025-05-07T20:02:40.8710934Z [377/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T20:02:40.8732727Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:40.8735053Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:40.8736076Z ^ 2025-05-07T20:02:40.8736510Z 2025-05-07T20:02:40.8736926Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:40.8737473Z 2025-05-07T20:02:40.8738860Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:40.8741530Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:40.8742553Z ^ 2025-05-07T20:02:40.8742884Z 2025-05-07T20:02:40.8744486Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:40.8746883Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:40.8747899Z ^ 2025-05-07T20:02:40.8748152Z 2025-05-07T20:02:40.8748537Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:40.8749104Z 2025-05-07T20:02:40.8750719Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:40.8753586Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:40.8754710Z ^ 2025-05-07T20:02:40.8755028Z 2025-05-07T20:02:40.8756415Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:40.8758735Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:40.8759752Z ^ 2025-05-07T20:02:40.8759976Z 2025-05-07T20:02:40.8760362Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:40.8760930Z 2025-05-07T20:02:40.8762357Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:40.8764792Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:40.8765894Z ^ 2025-05-07T20:02:40.8766199Z 2025-05-07T20:02:40.8767566Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:40.8769902Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:40.8771260Z ^ 2025-05-07T20:02:40.8771544Z 2025-05-07T20:02:40.8771985Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:40.8772624Z 2025-05-07T20:02:40.8774272Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:40.8777051Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:40.8778216Z ^ 2025-05-07T20:02:40.8778571Z 2025-05-07T20:02:40.8780187Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:40.8782731Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:40.8784039Z ^ 2025-05-07T20:02:40.8784261Z 2025-05-07T20:02:40.8784659Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:40.8785303Z 2025-05-07T20:02:40.8786601Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:40.8790962Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:40.8792090Z ^ 2025-05-07T20:02:40.8792413Z 2025-05-07T20:02:43.9983516Z [378/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu.o 2025-05-07T20:02:43.9995223Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:43.9996567Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:43.9997173Z ^ 2025-05-07T20:02:43.9997313Z 2025-05-07T20:02:43.9997550Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:43.9998050Z 2025-05-07T20:02:43.9998883Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:44.0000230Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:44.0000830Z ^ 2025-05-07T20:02:44.0001039Z 2025-05-07T20:02:44.0001925Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:44.0003271Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:44.0003866Z ^ 2025-05-07T20:02:44.0004005Z 2025-05-07T20:02:44.0004283Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:44.0004624Z 2025-05-07T20:02:44.0005454Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:44.0006800Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:44.0007419Z ^ 2025-05-07T20:02:44.0007609Z 2025-05-07T20:02:44.0008438Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:44.0009780Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:44.0010377Z ^ 2025-05-07T20:02:44.0010510Z 2025-05-07T20:02:44.0010741Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:44.0011091Z 2025-05-07T20:02:44.0011917Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:44.0013257Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:44.0013855Z ^ 2025-05-07T20:02:44.0014043Z 2025-05-07T20:02:44.0014883Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:44.0016210Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:44.0016954Z ^ 2025-05-07T20:02:44.0017090Z 2025-05-07T20:02:44.0017340Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:44.0017681Z 2025-05-07T20:02:44.0018511Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:44.0019862Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:44.0020473Z ^ 2025-05-07T20:02:44.0020751Z 2025-05-07T20:02:44.0021580Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:44.0022912Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:44.0023495Z ^ 2025-05-07T20:02:44.0023646Z 2025-05-07T20:02:44.0023948Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:44.0024286Z 2025-05-07T20:02:44.0025133Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:44.0026464Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:44.0027116Z ^ 2025-05-07T20:02:44.0027307Z 2025-05-07T20:02:45.9572642Z [379/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu.o 2025-05-07T20:02:45.9594190Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:45.9596839Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:45.9597849Z ^ 2025-05-07T20:02:45.9598082Z 2025-05-07T20:02:45.9598502Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:45.9599095Z 2025-05-07T20:02:45.9600793Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:45.9602959Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:45.9603994Z ^ 2025-05-07T20:02:45.9604336Z 2025-05-07T20:02:45.9605949Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:45.9608342Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:45.9609392Z ^ 2025-05-07T20:02:45.9609644Z 2025-05-07T20:02:45.9610050Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:45.9610668Z 2025-05-07T20:02:45.9612189Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:45.9614635Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:45.9615792Z ^ 2025-05-07T20:02:45.9616148Z 2025-05-07T20:02:45.9617717Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:45.9620106Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:45.9621213Z ^ 2025-05-07T20:02:45.9621471Z 2025-05-07T20:02:45.9621902Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:45.9622528Z 2025-05-07T20:02:45.9624044Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:45.9626571Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:45.9627681Z ^ 2025-05-07T20:02:45.9628020Z 2025-05-07T20:02:45.9629502Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:45.9632318Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:45.9633475Z ^ 2025-05-07T20:02:45.9633691Z 2025-05-07T20:02:45.9634063Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:45.9634638Z 2025-05-07T20:02:45.9636372Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:45.9638946Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:45.9640051Z ^ 2025-05-07T20:02:45.9640394Z 2025-05-07T20:02:45.9641977Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:45.9644087Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:45.9644996Z ^ 2025-05-07T20:02:45.9645216Z 2025-05-07T20:02:45.9645609Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:45.9646209Z 2025-05-07T20:02:45.9647680Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:45.9650171Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:45.9651394Z ^ 2025-05-07T20:02:45.9651708Z 2025-05-07T20:02:48.8858826Z [380/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu.o 2025-05-07T20:02:48.8881563Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:48.8884178Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:48.8885515Z ^ 2025-05-07T20:02:48.8885769Z 2025-05-07T20:02:48.8886204Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:48.8886837Z 2025-05-07T20:02:48.8888448Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:48.8892443Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:48.8893608Z ^ 2025-05-07T20:02:48.8893952Z 2025-05-07T20:02:48.8895526Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:48.8898239Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:48.8899369Z ^ 2025-05-07T20:02:48.8899617Z 2025-05-07T20:02:48.8900048Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:48.8900704Z 2025-05-07T20:02:48.8902276Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:48.8904894Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:48.8906038Z ^ 2025-05-07T20:02:48.8906394Z 2025-05-07T20:02:48.8907970Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:48.8910550Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:48.8911668Z ^ 2025-05-07T20:02:48.8911918Z 2025-05-07T20:02:48.8912354Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:48.8913017Z 2025-05-07T20:02:48.8914444Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:48.8916932Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:48.8918066Z ^ 2025-05-07T20:02:48.8918435Z 2025-05-07T20:02:48.8920023Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:48.8922611Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:48.8924030Z ^ 2025-05-07T20:02:48.8924274Z 2025-05-07T20:02:48.8924724Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:48.8925376Z 2025-05-07T20:02:48.8926979Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:48.8929605Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:48.8930743Z ^ 2025-05-07T20:02:48.8931103Z 2025-05-07T20:02:48.8932711Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:48.8935287Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:48.8936551Z ^ 2025-05-07T20:02:48.8936797Z 2025-05-07T20:02:48.8937243Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:48.8937842Z 2025-05-07T20:02:48.8939315Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:48.8941839Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:48.8942972Z ^ 2025-05-07T20:02:48.8943318Z 2025-05-07T20:02:49.3044371Z [381/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu.o 2025-05-07T20:02:49.3066468Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:49.3068915Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:49.3069983Z ^ 2025-05-07T20:02:49.3070484Z 2025-05-07T20:02:49.3070996Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:49.3071602Z 2025-05-07T20:02:49.3073078Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:49.3075531Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:49.3076611Z ^ 2025-05-07T20:02:49.3076971Z 2025-05-07T20:02:49.3078429Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:49.3080823Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:49.3081876Z ^ 2025-05-07T20:02:49.3082121Z 2025-05-07T20:02:49.3082519Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:49.3083120Z 2025-05-07T20:02:49.3084629Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:49.3087045Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:49.3088125Z ^ 2025-05-07T20:02:49.3088460Z 2025-05-07T20:02:49.3089954Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:49.3092354Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:49.3093411Z ^ 2025-05-07T20:02:49.3093639Z 2025-05-07T20:02:49.3094047Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:49.3094650Z 2025-05-07T20:02:49.3096137Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:49.3098619Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:49.3099675Z ^ 2025-05-07T20:02:49.3100026Z 2025-05-07T20:02:49.3101679Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:49.3104171Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:49.3105215Z ^ 2025-05-07T20:02:49.3105441Z 2025-05-07T20:02:49.3105854Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:49.3106443Z 2025-05-07T20:02:49.3108040Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:49.3110452Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:49.3111528Z ^ 2025-05-07T20:02:49.3111958Z 2025-05-07T20:02:49.3113428Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:49.3115776Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:49.3116830Z ^ 2025-05-07T20:02:49.3117066Z 2025-05-07T20:02:49.3117458Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:49.3118072Z 2025-05-07T20:02:49.3119543Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:49.3122005Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:49.3123035Z ^ 2025-05-07T20:02:49.3123303Z 2025-05-07T20:02:49.6245650Z [382/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu.o 2025-05-07T20:02:49.6276793Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:49.6280152Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:49.6281637Z ^ 2025-05-07T20:02:49.6281936Z 2025-05-07T20:02:49.6282431Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:49.6283262Z 2025-05-07T20:02:49.6285366Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:49.6288654Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:49.6290121Z ^ 2025-05-07T20:02:49.6290586Z 2025-05-07T20:02:49.6292662Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:49.6295890Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:49.6297523Z ^ 2025-05-07T20:02:49.6297818Z 2025-05-07T20:02:49.6298398Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:49.6299191Z 2025-05-07T20:02:49.6301205Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:49.6304718Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:49.6306344Z ^ 2025-05-07T20:02:49.6306825Z 2025-05-07T20:02:49.6308826Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:49.6312135Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:49.6313574Z ^ 2025-05-07T20:02:49.6313912Z 2025-05-07T20:02:49.6314449Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:49.6315254Z 2025-05-07T20:02:49.6317295Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:49.6321053Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:49.6322604Z ^ 2025-05-07T20:02:49.6323070Z 2025-05-07T20:02:49.6325279Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:49.6328881Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:49.6330367Z ^ 2025-05-07T20:02:49.6330660Z 2025-05-07T20:02:49.6331183Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:49.6331991Z 2025-05-07T20:02:49.6334130Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:49.6337546Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:49.6338964Z ^ 2025-05-07T20:02:49.6339431Z 2025-05-07T20:02:49.6341377Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:49.6344623Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:49.6346077Z ^ 2025-05-07T20:02:49.6346419Z 2025-05-07T20:02:49.6346968Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:49.6347756Z 2025-05-07T20:02:49.6349695Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:49.6353201Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:49.6354864Z ^ 2025-05-07T20:02:49.6355331Z 2025-05-07T20:02:52.4409296Z [383/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu.o 2025-05-07T20:02:52.4433270Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:52.4435951Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:52.4437158Z ^ 2025-05-07T20:02:52.4437475Z 2025-05-07T20:02:52.4437931Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:52.4438592Z 2025-05-07T20:02:52.4440177Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:52.4442800Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:52.4443888Z ^ 2025-05-07T20:02:52.4444221Z 2025-05-07T20:02:52.4445840Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:52.4448564Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:52.4449768Z ^ 2025-05-07T20:02:52.4450038Z 2025-05-07T20:02:52.4450487Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:52.4451162Z 2025-05-07T20:02:52.4452872Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:52.4455558Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:52.4456908Z ^ 2025-05-07T20:02:52.4457277Z 2025-05-07T20:02:52.4458952Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:52.4461683Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:52.4463107Z ^ 2025-05-07T20:02:52.4463395Z 2025-05-07T20:02:52.4463854Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:52.4464528Z 2025-05-07T20:02:52.4466227Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:52.4469005Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:52.4470476Z ^ 2025-05-07T20:02:52.4470849Z 2025-05-07T20:02:52.4472522Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:52.4475384Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:52.4476592Z ^ 2025-05-07T20:02:52.4476855Z 2025-05-07T20:02:52.4477304Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:52.4478005Z 2025-05-07T20:02:52.4479688Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:52.4482418Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:52.4483605Z ^ 2025-05-07T20:02:52.4484003Z 2025-05-07T20:02:52.4485660Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:52.4488341Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:52.4489378Z ^ 2025-05-07T20:02:52.4489610Z 2025-05-07T20:02:52.4490061Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:52.4490682Z 2025-05-07T20:02:52.4492275Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:52.4494930Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:52.4496179Z ^ 2025-05-07T20:02:52.4496674Z 2025-05-07T20:02:52.9734965Z [384/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu.o 2025-05-07T20:02:52.9757862Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:52.9760468Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:52.9761576Z ^ 2025-05-07T20:02:52.9761807Z 2025-05-07T20:02:52.9762261Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:52.9762902Z 2025-05-07T20:02:52.9764479Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:52.9766974Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:52.9768080Z ^ 2025-05-07T20:02:52.9768395Z 2025-05-07T20:02:52.9769927Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:52.9772654Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:52.9773776Z ^ 2025-05-07T20:02:52.9774048Z 2025-05-07T20:02:52.9774500Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:52.9775155Z 2025-05-07T20:02:52.9777037Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:52.9779483Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:52.9780641Z ^ 2025-05-07T20:02:52.9780989Z 2025-05-07T20:02:52.9782603Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:52.9785631Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:52.9786698Z ^ 2025-05-07T20:02:52.9786928Z 2025-05-07T20:02:52.9787341Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:52.9787989Z 2025-05-07T20:02:52.9789802Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:52.9792462Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:52.9793591Z ^ 2025-05-07T20:02:52.9793965Z 2025-05-07T20:02:52.9795533Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:52.9797585Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:52.9798554Z ^ 2025-05-07T20:02:52.9798803Z 2025-05-07T20:02:52.9799248Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:52.9799896Z 2025-05-07T20:02:52.9801400Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:52.9804019Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:52.9805168Z ^ 2025-05-07T20:02:52.9805521Z 2025-05-07T20:02:52.9807133Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:52.9809871Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:52.9810917Z ^ 2025-05-07T20:02:52.9811163Z 2025-05-07T20:02:52.9811593Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:52.9812250Z 2025-05-07T20:02:52.9813872Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:52.9816671Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:52.9817813Z ^ 2025-05-07T20:02:52.9818202Z 2025-05-07T20:02:53.3409737Z [385/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu.o 2025-05-07T20:02:53.3431420Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:53.3433924Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:53.3435039Z ^ 2025-05-07T20:02:53.3435281Z 2025-05-07T20:02:53.3435746Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:53.3436350Z 2025-05-07T20:02:53.3437836Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:53.3440130Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:53.3441116Z ^ 2025-05-07T20:02:53.3441489Z 2025-05-07T20:02:53.3442956Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:53.3445374Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:53.3446510Z ^ 2025-05-07T20:02:53.3446769Z 2025-05-07T20:02:53.3447205Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:53.3447858Z 2025-05-07T20:02:53.3449480Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:53.3452352Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:53.3453458Z ^ 2025-05-07T20:02:53.3453795Z 2025-05-07T20:02:53.3455326Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:53.3458130Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:53.3459259Z ^ 2025-05-07T20:02:53.3459488Z 2025-05-07T20:02:53.3459939Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:53.3460558Z 2025-05-07T20:02:53.3462272Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:53.3464731Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:53.3465850Z ^ 2025-05-07T20:02:53.3466238Z 2025-05-07T20:02:53.3467779Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:53.3470599Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:53.3471727Z ^ 2025-05-07T20:02:53.3472001Z 2025-05-07T20:02:53.3472444Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:53.3473100Z 2025-05-07T20:02:53.3474715Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:53.3477222Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:53.3478322Z ^ 2025-05-07T20:02:53.3478664Z 2025-05-07T20:02:53.3480176Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:53.3482636Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:53.3483778Z ^ 2025-05-07T20:02:53.3484028Z 2025-05-07T20:02:53.3484454Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:53.3485061Z 2025-05-07T20:02:53.3486632Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:53.3489163Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:53.3490268Z ^ 2025-05-07T20:02:53.3490628Z 2025-05-07T20:02:57.9539953Z [386/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_dense_split_weighted_meta.cpp 2025-05-07T20:02:57.9559184Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:02:58.3780754Z [387/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp 2025-05-07T20:02:58.3799989Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:02:58.5686423Z [388/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_dense_split_unweighted_meta.cpp 2025-05-07T20:02:58.5705458Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:03:10.3035796Z [389/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu.o 2025-05-07T20:03:10.3059644Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:10.3062324Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:10.3063466Z ^ 2025-05-07T20:03:10.3063738Z 2025-05-07T20:03:10.3064166Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:10.3064768Z 2025-05-07T20:03:10.3066300Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:10.3068937Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:10.3070427Z ^ 2025-05-07T20:03:10.3070805Z 2025-05-07T20:03:10.3072437Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:10.3074906Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:10.3076019Z ^ 2025-05-07T20:03:10.3076244Z 2025-05-07T20:03:10.3076689Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:10.3077385Z 2025-05-07T20:03:10.3079057Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:10.3081716Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:10.3082883Z ^ 2025-05-07T20:03:10.3083284Z 2025-05-07T20:03:10.3084950Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:10.3087861Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:10.3089041Z ^ 2025-05-07T20:03:10.3089330Z 2025-05-07T20:03:10.3089791Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:10.3090470Z 2025-05-07T20:03:10.3092215Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:10.3094856Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:10.3096018Z ^ 2025-05-07T20:03:10.3096495Z 2025-05-07T20:03:10.3098217Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:10.3100913Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:10.3102105Z ^ 2025-05-07T20:03:10.3102370Z 2025-05-07T20:03:10.3102831Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:10.3103521Z 2025-05-07T20:03:10.3105192Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:10.3107858Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:10.3109027Z ^ 2025-05-07T20:03:10.3109391Z 2025-05-07T20:03:10.3111059Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:10.3113606Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:10.3114757Z ^ 2025-05-07T20:03:10.3115030Z 2025-05-07T20:03:10.3115458Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:10.3116080Z 2025-05-07T20:03:10.3117593Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:10.3120261Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:10.3121445Z ^ 2025-05-07T20:03:10.3121804Z 2025-05-07T20:03:13.2255058Z [390/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu.o 2025-05-07T20:03:13.2279809Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:13.2282374Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:13.2283554Z ^ 2025-05-07T20:03:13.2283811Z 2025-05-07T20:03:13.2284283Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:13.2284958Z 2025-05-07T20:03:13.2286634Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:13.2289343Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:13.2290563Z ^ 2025-05-07T20:03:13.2290946Z 2025-05-07T20:03:13.2292604Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:13.2295275Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:13.2296576Z ^ 2025-05-07T20:03:13.2296874Z 2025-05-07T20:03:13.2297326Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:13.2297999Z 2025-05-07T20:03:13.2299683Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:13.2302382Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:13.2303903Z ^ 2025-05-07T20:03:13.2304274Z 2025-05-07T20:03:13.2305952Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:13.2308727Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:13.2309943Z ^ 2025-05-07T20:03:13.2310196Z 2025-05-07T20:03:13.2310662Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:13.2311319Z 2025-05-07T20:03:13.2313092Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:13.2315818Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:13.2316995Z ^ 2025-05-07T20:03:13.2317387Z 2025-05-07T20:03:13.2319025Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:13.2321681Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:13.2322840Z ^ 2025-05-07T20:03:13.2323123Z 2025-05-07T20:03:13.2323569Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:13.2324244Z 2025-05-07T20:03:13.2325955Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:13.2328594Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:13.2329771Z ^ 2025-05-07T20:03:13.2330138Z 2025-05-07T20:03:13.2331805Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:13.2334446Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:13.2335621Z ^ 2025-05-07T20:03:13.2335873Z 2025-05-07T20:03:13.2336328Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:13.2337124Z 2025-05-07T20:03:13.2338782Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:13.2341184Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:13.2342292Z ^ 2025-05-07T20:03:13.2342660Z 2025-05-07T20:03:13.6121522Z [391/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_split_dense.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_split_dense.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_split_dense.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_dense.cpp 2025-05-07T20:03:13.6140478Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:03:14.1186194Z [392/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu.o 2025-05-07T20:03:14.1208163Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:14.1210723Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:14.1211852Z ^ 2025-05-07T20:03:14.1212116Z 2025-05-07T20:03:14.1212693Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:14.1213352Z 2025-05-07T20:03:14.1214861Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:14.1217469Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:14.1218518Z ^ 2025-05-07T20:03:14.1218870Z 2025-05-07T20:03:14.1220361Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:14.1222810Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:14.1223924Z ^ 2025-05-07T20:03:14.1224173Z 2025-05-07T20:03:14.1224595Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:14.1225198Z 2025-05-07T20:03:14.1226814Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:14.1229300Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:14.1230439Z ^ 2025-05-07T20:03:14.1230801Z 2025-05-07T20:03:14.1232315Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:14.1234812Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:14.1235906Z ^ 2025-05-07T20:03:14.1236147Z 2025-05-07T20:03:14.1236557Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:14.1237200Z 2025-05-07T20:03:14.1238694Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:14.1241145Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:14.1242263Z ^ 2025-05-07T20:03:14.1242637Z 2025-05-07T20:03:14.1244028Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:14.1246686Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:14.1247749Z ^ 2025-05-07T20:03:14.1247998Z 2025-05-07T20:03:14.1248436Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:14.1249045Z 2025-05-07T20:03:14.1250778Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:14.1253221Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:14.1254340Z ^ 2025-05-07T20:03:14.1254776Z 2025-05-07T20:03:14.1256288Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:14.1258854Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:14.1259915Z ^ 2025-05-07T20:03:14.1260144Z 2025-05-07T20:03:14.1260563Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:14.1261102Z 2025-05-07T20:03:14.1262601Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:14.1265078Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:14.1266202Z ^ 2025-05-07T20:03:14.1266549Z 2025-05-07T20:03:15.5035867Z [393/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu.o 2025-05-07T20:03:15.5055306Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:15.5057640Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:15.5058570Z ^ 2025-05-07T20:03:15.5058776Z 2025-05-07T20:03:15.5059129Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:15.5059682Z 2025-05-07T20:03:15.5061008Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:15.5063126Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:15.5064062Z ^ 2025-05-07T20:03:15.5064361Z 2025-05-07T20:03:15.5065664Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:15.5067766Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:15.5068698Z ^ 2025-05-07T20:03:15.5068901Z 2025-05-07T20:03:15.5069273Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:15.5069797Z 2025-05-07T20:03:15.5071342Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:15.5073447Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:15.5074404Z ^ 2025-05-07T20:03:15.5074695Z 2025-05-07T20:03:15.5075968Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:15.5078063Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:15.5078997Z ^ 2025-05-07T20:03:15.5079202Z 2025-05-07T20:03:15.5079555Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:15.5080070Z 2025-05-07T20:03:15.5081373Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:15.5083741Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:15.5084685Z ^ 2025-05-07T20:03:15.5084971Z 2025-05-07T20:03:15.5086286Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:15.5088519Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:15.5089498Z ^ 2025-05-07T20:03:15.5089709Z 2025-05-07T20:03:15.5090079Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:15.5090564Z 2025-05-07T20:03:15.5091984Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:15.5094304Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:15.5095361Z ^ 2025-05-07T20:03:15.5095722Z 2025-05-07T20:03:15.5097332Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:15.5099783Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:15.5100847Z ^ 2025-05-07T20:03:15.5101100Z 2025-05-07T20:03:15.5101521Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:15.5102149Z 2025-05-07T20:03:15.5103511Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:15.5106156Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:15.5107342Z ^ 2025-05-07T20:03:15.5107685Z 2025-05-07T20:03:16.4152932Z [394/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu.o 2025-05-07T20:03:16.4178542Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:16.4181264Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:16.4182382Z ^ 2025-05-07T20:03:16.4182627Z 2025-05-07T20:03:16.4183041Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:16.4183712Z 2025-05-07T20:03:16.4185380Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:16.4188116Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:16.4189291Z ^ 2025-05-07T20:03:16.4189653Z 2025-05-07T20:03:16.4191238Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:16.4193779Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:16.4194928Z ^ 2025-05-07T20:03:16.4195177Z 2025-05-07T20:03:16.4195631Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:16.4196300Z 2025-05-07T20:03:16.4197955Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:16.4200598Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:16.4201785Z ^ 2025-05-07T20:03:16.4202157Z 2025-05-07T20:03:16.4203778Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:16.4206517Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:16.4207974Z ^ 2025-05-07T20:03:16.4208333Z 2025-05-07T20:03:16.4208795Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:16.4209474Z 2025-05-07T20:03:16.4211194Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:16.4213700Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:16.4214985Z ^ 2025-05-07T20:03:16.4215360Z 2025-05-07T20:03:16.4217173Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:16.4219889Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:16.4220880Z ^ 2025-05-07T20:03:16.4221097Z 2025-05-07T20:03:16.4221473Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:16.4222080Z 2025-05-07T20:03:16.4223689Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:16.4226373Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:16.4227543Z ^ 2025-05-07T20:03:16.4227919Z 2025-05-07T20:03:16.4229526Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:16.4232174Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:16.4233318Z ^ 2025-05-07T20:03:16.4233578Z 2025-05-07T20:03:16.4234003Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:16.4234641Z 2025-05-07T20:03:16.4236287Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:16.4238907Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:16.4240086Z ^ 2025-05-07T20:03:16.4240441Z 2025-05-07T20:03:18.8312642Z [395/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu.o 2025-05-07T20:03:18.8336554Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:18.8339280Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:18.8340502Z ^ 2025-05-07T20:03:18.8340758Z 2025-05-07T20:03:18.8341204Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:18.8341865Z 2025-05-07T20:03:18.8343378Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:18.8345970Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:18.8347145Z ^ 2025-05-07T20:03:18.8347521Z 2025-05-07T20:03:18.8349137Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:18.8351782Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:18.8352934Z ^ 2025-05-07T20:03:18.8353199Z 2025-05-07T20:03:18.8353640Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:18.8354282Z 2025-05-07T20:03:18.8355962Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:18.8358723Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:18.8359939Z ^ 2025-05-07T20:03:18.8360307Z 2025-05-07T20:03:18.8361997Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:18.8364719Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:18.8365899Z ^ 2025-05-07T20:03:18.8366151Z 2025-05-07T20:03:18.8366603Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:18.8367292Z 2025-05-07T20:03:18.8369043Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:18.8371610Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:18.8372722Z ^ 2025-05-07T20:03:18.8373073Z 2025-05-07T20:03:18.8374905Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:18.8377641Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:18.8378781Z ^ 2025-05-07T20:03:18.8379030Z 2025-05-07T20:03:18.8379490Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:18.8380149Z 2025-05-07T20:03:18.8381810Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:18.8384420Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:18.8385597Z ^ 2025-05-07T20:03:18.8385950Z 2025-05-07T20:03:18.8387603Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:18.8390223Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:18.8391361Z ^ 2025-05-07T20:03:18.8391565Z 2025-05-07T20:03:18.8391973Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:18.8392593Z 2025-05-07T20:03:18.8394246Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:18.8396823Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:18.8397987Z ^ 2025-05-07T20:03:18.8398349Z 2025-05-07T20:03:19.0523598Z [396/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu.o 2025-05-07T20:03:19.0546522Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:19.0549108Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:19.0550237Z ^ 2025-05-07T20:03:19.0550511Z 2025-05-07T20:03:19.0550955Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:19.0551611Z 2025-05-07T20:03:19.0553260Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:19.0555765Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:19.0556884Z ^ 2025-05-07T20:03:19.0557232Z 2025-05-07T20:03:19.0558917Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:19.0561359Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:19.0562469Z ^ 2025-05-07T20:03:19.0562678Z 2025-05-07T20:03:19.0563075Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:19.0563626Z 2025-05-07T20:03:19.0565108Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:19.0567735Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:19.0568767Z ^ 2025-05-07T20:03:19.0569130Z 2025-05-07T20:03:19.0571182Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:19.0573871Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:19.0574959Z ^ 2025-05-07T20:03:19.0575198Z 2025-05-07T20:03:19.0575622Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:19.0576241Z 2025-05-07T20:03:19.0578021Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:19.0580399Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:19.0581499Z ^ 2025-05-07T20:03:19.0581841Z 2025-05-07T20:03:19.0583380Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:19.0585680Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:19.0586804Z ^ 2025-05-07T20:03:19.0587047Z 2025-05-07T20:03:19.0587484Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:19.0588161Z 2025-05-07T20:03:19.0589791Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:19.0592355Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:19.0593442Z ^ 2025-05-07T20:03:19.0593804Z 2025-05-07T20:03:19.0595368Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:19.0598012Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:19.0599144Z ^ 2025-05-07T20:03:19.0599406Z 2025-05-07T20:03:19.0599849Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:19.0600496Z 2025-05-07T20:03:19.0602055Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:19.0604694Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:19.0605867Z ^ 2025-05-07T20:03:19.0606219Z 2025-05-07T20:03:20.7793153Z [397/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu.o 2025-05-07T20:03:20.7816160Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:20.7818970Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:20.7820106Z ^ 2025-05-07T20:03:20.7820350Z 2025-05-07T20:03:20.7820748Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:20.7821293Z 2025-05-07T20:03:20.7822839Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:20.7825346Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:20.7826419Z ^ 2025-05-07T20:03:20.7826787Z 2025-05-07T20:03:20.7828387Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:20.7830754Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:20.7832124Z ^ 2025-05-07T20:03:20.7832453Z 2025-05-07T20:03:20.7832896Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:20.7833548Z 2025-05-07T20:03:20.7835195Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:20.7837692Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:20.7838860Z ^ 2025-05-07T20:03:20.7839163Z 2025-05-07T20:03:20.7840626Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:20.7843155Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:20.7844267Z ^ 2025-05-07T20:03:20.7844506Z 2025-05-07T20:03:20.7844932Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:20.7845569Z 2025-05-07T20:03:20.7847195Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:20.7849833Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:20.7850974Z ^ 2025-05-07T20:03:20.7851347Z 2025-05-07T20:03:20.7852992Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:20.7855533Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:20.7856780Z ^ 2025-05-07T20:03:20.7856991Z 2025-05-07T20:03:20.7857345Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:20.7857946Z 2025-05-07T20:03:20.7859569Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:20.7862005Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:20.7863185Z ^ 2025-05-07T20:03:20.7863560Z 2025-05-07T20:03:20.7865114Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:20.7867714Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:20.7868847Z ^ 2025-05-07T20:03:20.7869104Z 2025-05-07T20:03:20.7869552Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:20.7870514Z 2025-05-07T20:03:20.7872136Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:20.7874723Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:20.7876158Z ^ 2025-05-07T20:03:20.7876514Z 2025-05-07T20:03:24.0851753Z [398/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu.o 2025-05-07T20:03:24.0875485Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:24.0878140Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:24.0879186Z ^ 2025-05-07T20:03:24.0879418Z 2025-05-07T20:03:24.0879825Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:24.0880447Z 2025-05-07T20:03:24.0881985Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:24.0884653Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:24.0885810Z ^ 2025-05-07T20:03:24.0886186Z 2025-05-07T20:03:24.0887820Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:24.0890825Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:24.0891959Z ^ 2025-05-07T20:03:24.0892209Z 2025-05-07T20:03:24.0892670Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:24.0893326Z 2025-05-07T20:03:24.0895057Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:24.0897839Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:24.0899010Z ^ 2025-05-07T20:03:24.0899379Z 2025-05-07T20:03:24.0901079Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:24.0903502Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:24.0904459Z ^ 2025-05-07T20:03:24.0904714Z 2025-05-07T20:03:24.0905137Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:24.0905784Z 2025-05-07T20:03:24.0907440Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:24.0921705Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:24.0922910Z ^ 2025-05-07T20:03:24.0923303Z 2025-05-07T20:03:24.0924941Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:24.0927588Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:24.0928743Z ^ 2025-05-07T20:03:24.0929012Z 2025-05-07T20:03:24.0929452Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:24.0930109Z 2025-05-07T20:03:24.0931760Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:24.0934421Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:24.0935588Z ^ 2025-05-07T20:03:24.0935942Z 2025-05-07T20:03:24.0937691Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:24.0940081Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:24.0941047Z ^ 2025-05-07T20:03:24.0941265Z 2025-05-07T20:03:24.0941682Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:24.0942500Z 2025-05-07T20:03:24.0944136Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:24.0946616Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:24.0947746Z ^ 2025-05-07T20:03:24.0948098Z 2025-05-07T20:03:25.8308473Z [399/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu.o 2025-05-07T20:03:25.8331122Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:25.8333639Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:25.8334783Z ^ 2025-05-07T20:03:25.8335033Z 2025-05-07T20:03:25.8335470Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:25.8336136Z 2025-05-07T20:03:25.8337910Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:25.8340872Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:25.8342020Z ^ 2025-05-07T20:03:25.8342353Z 2025-05-07T20:03:25.8343953Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:25.8346623Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:25.8347780Z ^ 2025-05-07T20:03:25.8348012Z 2025-05-07T20:03:25.8348453Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:25.8349093Z 2025-05-07T20:03:25.8350779Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:25.8353296Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:25.8354431Z ^ 2025-05-07T20:03:25.8354763Z 2025-05-07T20:03:25.8356304Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:25.8358902Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:25.8360018Z ^ 2025-05-07T20:03:25.8360270Z 2025-05-07T20:03:25.8360690Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:25.8361351Z 2025-05-07T20:03:25.8363024Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:25.8365694Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:25.8366828Z ^ 2025-05-07T20:03:25.8367179Z 2025-05-07T20:03:25.8368857Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:25.8371735Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:25.8372853Z ^ 2025-05-07T20:03:25.8373107Z 2025-05-07T20:03:25.8373563Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:25.8374214Z 2025-05-07T20:03:25.8375897Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:25.8378673Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:25.8379845Z ^ 2025-05-07T20:03:25.8380191Z 2025-05-07T20:03:25.8381714Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:25.8383991Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:25.8385410Z ^ 2025-05-07T20:03:25.8385658Z 2025-05-07T20:03:25.8386068Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:25.8386692Z 2025-05-07T20:03:25.8388339Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:25.8391148Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:25.8392349Z ^ 2025-05-07T20:03:25.8392719Z 2025-05-07T20:03:34.0659316Z [400/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu.o 2025-05-07T20:03:34.0682368Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:34.0684686Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:34.0685602Z ^ 2025-05-07T20:03:34.0685826Z 2025-05-07T20:03:34.0686182Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:34.0687019Z 2025-05-07T20:03:34.0688342Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:34.0690377Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:34.0691276Z ^ 2025-05-07T20:03:34.0691554Z 2025-05-07T20:03:34.0692952Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:34.0694999Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:34.0695888Z ^ 2025-05-07T20:03:34.0696091Z 2025-05-07T20:03:34.0696716Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:34.0697254Z 2025-05-07T20:03:34.0698516Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:34.0700546Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:34.0701450Z ^ 2025-05-07T20:03:34.0701762Z 2025-05-07T20:03:34.0703100Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:34.0705301Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:34.0706201Z ^ 2025-05-07T20:03:34.0706419Z 2025-05-07T20:03:34.0706798Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:34.0707333Z 2025-05-07T20:03:34.0708790Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:34.0711065Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:34.0712050Z ^ 2025-05-07T20:03:34.0712357Z 2025-05-07T20:03:34.0713814Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:34.0716110Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:34.0717213Z ^ 2025-05-07T20:03:34.0717453Z 2025-05-07T20:03:34.0717809Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:34.0718377Z 2025-05-07T20:03:34.0719869Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:34.0722385Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:34.0723476Z ^ 2025-05-07T20:03:34.0724008Z 2025-05-07T20:03:34.0725415Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:34.0727918Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:34.0729155Z ^ 2025-05-07T20:03:34.0729423Z 2025-05-07T20:03:34.0729999Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:34.0730674Z 2025-05-07T20:03:34.0732350Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:34.0735092Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:34.0736524Z ^ 2025-05-07T20:03:34.0736900Z 2025-05-07T20:03:35.4681636Z [401/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu.o 2025-05-07T20:03:35.4699584Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:35.4702145Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:35.4703163Z ^ 2025-05-07T20:03:35.4703372Z 2025-05-07T20:03:35.4703743Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:35.4704292Z 2025-05-07T20:03:35.4705846Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:35.4708099Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:35.4709070Z ^ 2025-05-07T20:03:35.4709387Z 2025-05-07T20:03:35.4710912Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:35.4713166Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:35.4714210Z ^ 2025-05-07T20:03:35.4714421Z 2025-05-07T20:03:35.4714827Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:35.4715363Z 2025-05-07T20:03:35.4716751Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:35.4719174Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:35.4720172Z ^ 2025-05-07T20:03:35.4720494Z 2025-05-07T20:03:35.4721976Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:35.4724389Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:35.4725459Z ^ 2025-05-07T20:03:35.4725701Z 2025-05-07T20:03:35.4726100Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:35.4726666Z 2025-05-07T20:03:35.4728034Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:35.4730481Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:35.4731721Z ^ 2025-05-07T20:03:35.4732087Z 2025-05-07T20:03:35.4733757Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:35.4736614Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:35.4737794Z ^ 2025-05-07T20:03:35.4738046Z 2025-05-07T20:03:35.4738493Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:35.4739184Z 2025-05-07T20:03:35.4740860Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:35.4743796Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:35.4744977Z ^ 2025-05-07T20:03:35.4745360Z 2025-05-07T20:03:35.4747022Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:35.4749810Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:35.4750992Z ^ 2025-05-07T20:03:35.4751263Z 2025-05-07T20:03:35.4751716Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:35.4752391Z 2025-05-07T20:03:35.4754153Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:35.4756778Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:35.4757979Z ^ 2025-05-07T20:03:35.4758346Z 2025-05-07T20:03:36.4521194Z [402/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adagrad.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adagrad.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adagrad.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_adagrad.cpp 2025-05-07T20:03:36.4536746Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:03:37.9307857Z [403/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_dense_split_weighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_cuda.cu.o 2025-05-07T20:03:37.9328132Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:37.9330535Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:37.9331566Z ^ 2025-05-07T20:03:37.9331800Z 2025-05-07T20:03:37.9332231Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:37.9332826Z 2025-05-07T20:03:37.9334186Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:37.9336551Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:37.9337534Z ^ 2025-05-07T20:03:37.9337840Z 2025-05-07T20:03:37.9339154Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:37.9341342Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:37.9342319Z ^ 2025-05-07T20:03:37.9342880Z 2025-05-07T20:03:37.9343280Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:37.9343752Z 2025-05-07T20:03:37.9344873Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:37.9346836Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:37.9347982Z ^ 2025-05-07T20:03:37.9348330Z 2025-05-07T20:03:37.9349810Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:37.9352192Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:37.9353280Z ^ 2025-05-07T20:03:37.9353482Z 2025-05-07T20:03:37.9353793Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:37.9354392Z 2025-05-07T20:03:37.9355928Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:37.9358119Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:37.9359040Z ^ 2025-05-07T20:03:37.9359354Z 2025-05-07T20:03:37.9360707Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:37.9362946Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:37.9363989Z ^ 2025-05-07T20:03:37.9364218Z 2025-05-07T20:03:37.9364626Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:37.9365201Z 2025-05-07T20:03:37.9366711Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:37.9369131Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:37.9370466Z ^ 2025-05-07T20:03:37.9370793Z 2025-05-07T20:03:37.9372287Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:37.9374542Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:37.9375570Z ^ 2025-05-07T20:03:37.9375812Z 2025-05-07T20:03:37.9376235Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:37.9376986Z 2025-05-07T20:03:37.9378433Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:37.9380879Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:37.9382368Z ^ 2025-05-07T20:03:37.9382725Z 2025-05-07T20:03:41.5133739Z [404/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_sgd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_sgd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_sgd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_sgd.cpp 2025-05-07T20:03:41.5152352Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:03:42.4054127Z [405/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lamb.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lamb.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lamb.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_lamb.cpp 2025-05-07T20:03:42.4071960Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:03:43.9750378Z [406/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu.o 2025-05-07T20:03:43.9772585Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:43.9775336Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:43.9776579Z ^ 2025-05-07T20:03:43.9776832Z 2025-05-07T20:03:43.9777268Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:43.9777929Z 2025-05-07T20:03:43.9779889Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:43.9782393Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:43.9783405Z ^ 2025-05-07T20:03:43.9783764Z 2025-05-07T20:03:43.9785495Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:43.9788124Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:43.9789255Z ^ 2025-05-07T20:03:43.9789499Z 2025-05-07T20:03:43.9789964Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:43.9790711Z 2025-05-07T20:03:43.9792258Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:43.9794846Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:43.9795952Z ^ 2025-05-07T20:03:43.9796298Z 2025-05-07T20:03:43.9797730Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:43.9800213Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:43.9801369Z ^ 2025-05-07T20:03:43.9801620Z 2025-05-07T20:03:43.9802034Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:43.9802685Z 2025-05-07T20:03:43.9804361Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:43.9806969Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:43.9808132Z ^ 2025-05-07T20:03:43.9808493Z 2025-05-07T20:03:43.9810131Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:43.9812705Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:43.9813842Z ^ 2025-05-07T20:03:43.9814084Z 2025-05-07T20:03:43.9814539Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:43.9815185Z 2025-05-07T20:03:43.9817001Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:43.9819658Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:43.9820846Z ^ 2025-05-07T20:03:43.9821213Z 2025-05-07T20:03:43.9822849Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:43.9825742Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:43.9826851Z ^ 2025-05-07T20:03:43.9827107Z 2025-05-07T20:03:43.9827542Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:43.9828111Z 2025-05-07T20:03:43.9829797Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:43.9832267Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:43.9833347Z ^ 2025-05-07T20:03:43.9833638Z 2025-05-07T20:03:44.3760087Z [407/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu.o 2025-05-07T20:03:44.3782623Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:44.3785110Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:44.3786600Z ^ 2025-05-07T20:03:44.3786831Z 2025-05-07T20:03:44.3787266Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:44.3787909Z 2025-05-07T20:03:44.3789474Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:44.3792123Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:44.3793223Z ^ 2025-05-07T20:03:44.3793558Z 2025-05-07T20:03:44.3795072Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:44.3797707Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:44.3798817Z ^ 2025-05-07T20:03:44.3799074Z 2025-05-07T20:03:44.3799485Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:44.3800106Z 2025-05-07T20:03:44.3801724Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:44.3804213Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:44.3805365Z ^ 2025-05-07T20:03:44.3805696Z 2025-05-07T20:03:44.3807282Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:44.3809737Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:44.3810836Z ^ 2025-05-07T20:03:44.3811076Z 2025-05-07T20:03:44.3811491Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:44.3812113Z 2025-05-07T20:03:44.3813588Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:44.3816024Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:44.3817268Z ^ 2025-05-07T20:03:44.3817603Z 2025-05-07T20:03:44.3819182Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:44.3821617Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:44.3822685Z ^ 2025-05-07T20:03:44.3822927Z 2025-05-07T20:03:44.3823346Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:44.3823947Z 2025-05-07T20:03:44.3825510Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:44.3828140Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:44.3828948Z ^ 2025-05-07T20:03:44.3829197Z 2025-05-07T20:03:44.3830500Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:44.3832910Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:44.3833891Z ^ 2025-05-07T20:03:44.3834096Z 2025-05-07T20:03:44.3834497Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:44.3835155Z 2025-05-07T20:03:44.3836756Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:44.3839438Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:44.3840513Z ^ 2025-05-07T20:03:44.3840835Z 2025-05-07T20:03:44.9501872Z [408/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lars_sgd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lars_sgd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lars_sgd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_lars_sgd.cpp 2025-05-07T20:03:44.9521880Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:03:46.7773970Z [409/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adam.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adam.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adam.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_adam.cpp 2025-05-07T20:03:46.7792810Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:03:47.3057967Z [410/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_rowwise_adagrad.cpp 2025-05-07T20:03:47.3077984Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:03:48.3438418Z [411/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_adam.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_adam.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_adam.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_partial_rowwise_adam.cpp 2025-05-07T20:03:48.3458125Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:03:48.6115990Z [412/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu.o 2025-05-07T20:03:48.6140017Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:48.6142872Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:48.6144003Z ^ 2025-05-07T20:03:48.6144258Z 2025-05-07T20:03:48.6144697Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:48.6145341Z 2025-05-07T20:03:48.6146931Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:48.6149561Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:48.6150724Z ^ 2025-05-07T20:03:48.6151091Z 2025-05-07T20:03:48.6152701Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:48.6155339Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:48.6156400Z ^ 2025-05-07T20:03:48.6156656Z 2025-05-07T20:03:48.6157122Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:48.6157773Z 2025-05-07T20:03:48.6159440Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:48.6162002Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:48.6163122Z ^ 2025-05-07T20:03:48.6163464Z 2025-05-07T20:03:48.6164980Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:48.6167600Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:48.6169067Z ^ 2025-05-07T20:03:48.6169325Z 2025-05-07T20:03:48.6169761Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:48.6170710Z 2025-05-07T20:03:48.6172394Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:48.6175333Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:48.6176632Z ^ 2025-05-07T20:03:48.6176999Z 2025-05-07T20:03:48.6178633Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:48.6181406Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:48.6182627Z ^ 2025-05-07T20:03:48.6182886Z 2025-05-07T20:03:48.6183384Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:48.6184036Z 2025-05-07T20:03:48.6185727Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:48.6188336Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:48.6189549Z ^ 2025-05-07T20:03:48.6189933Z 2025-05-07T20:03:48.6191455Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:48.6194130Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:48.6195284Z ^ 2025-05-07T20:03:48.6195571Z 2025-05-07T20:03:48.6196031Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:48.6196722Z 2025-05-07T20:03:48.6198421Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:48.6201071Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:48.6202318Z ^ 2025-05-07T20:03:48.6202688Z 2025-05-07T20:03:49.2652097Z [413/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu.o 2025-05-07T20:03:49.2675347Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:49.2677663Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:49.2678570Z ^ 2025-05-07T20:03:49.2678820Z 2025-05-07T20:03:49.2679210Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:49.2679845Z 2025-05-07T20:03:49.2681367Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:49.2683795Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:49.2684994Z ^ 2025-05-07T20:03:49.2685351Z 2025-05-07T20:03:49.2686978Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:49.2689483Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:49.2690616Z ^ 2025-05-07T20:03:49.2690878Z 2025-05-07T20:03:49.2691346Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:49.2691997Z 2025-05-07T20:03:49.2693553Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:49.2696203Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:49.2697504Z ^ 2025-05-07T20:03:49.2698295Z 2025-05-07T20:03:49.2699911Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:49.2702522Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:49.2703627Z ^ 2025-05-07T20:03:49.2703900Z 2025-05-07T20:03:49.2704436Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:49.2705104Z 2025-05-07T20:03:49.2706704Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:49.2709299Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:49.2710468Z ^ 2025-05-07T20:03:49.2710825Z 2025-05-07T20:03:49.2712406Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:49.2714879Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:49.2716004Z ^ 2025-05-07T20:03:49.2716261Z 2025-05-07T20:03:49.2716704Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:49.2717375Z 2025-05-07T20:03:49.2718991Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:49.2721576Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:49.2722627Z ^ 2025-05-07T20:03:49.2723015Z 2025-05-07T20:03:49.2724597Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:49.2727062Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:49.2728166Z ^ 2025-05-07T20:03:49.2728459Z 2025-05-07T20:03:49.2728894Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:49.2729536Z 2025-05-07T20:03:49.2731084Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:49.2733548Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:49.2734692Z ^ 2025-05-07T20:03:49.2735050Z 2025-05-07T20:03:50.3809387Z [414/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu.o 2025-05-07T20:03:50.3832387Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:50.3834803Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:50.3835897Z ^ 2025-05-07T20:03:50.3836175Z 2025-05-07T20:03:50.3836619Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:50.3837237Z 2025-05-07T20:03:50.3838765Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:50.3841442Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:50.3842594Z ^ 2025-05-07T20:03:50.3842964Z 2025-05-07T20:03:50.3844571Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:50.3847124Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:50.3848170Z ^ 2025-05-07T20:03:50.3848426Z 2025-05-07T20:03:50.3848807Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:50.3849462Z 2025-05-07T20:03:50.3851034Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:50.3854017Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:50.3855223Z ^ 2025-05-07T20:03:50.3855579Z 2025-05-07T20:03:50.3857503Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:50.3860018Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:50.3861102Z ^ 2025-05-07T20:03:50.3861321Z 2025-05-07T20:03:50.3861719Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:50.3862333Z 2025-05-07T20:03:50.3864007Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:50.3866640Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:50.3867823Z ^ 2025-05-07T20:03:50.3868203Z 2025-05-07T20:03:50.3869831Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:50.3872684Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:50.3873792Z ^ 2025-05-07T20:03:50.3874023Z 2025-05-07T20:03:50.3874460Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:50.3875078Z 2025-05-07T20:03:50.3876549Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:50.3879050Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:50.3880255Z ^ 2025-05-07T20:03:50.3880629Z 2025-05-07T20:03:50.3882295Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:50.3884956Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:50.3886000Z ^ 2025-05-07T20:03:50.3886252Z 2025-05-07T20:03:50.3886672Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:50.3887279Z 2025-05-07T20:03:50.3888815Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:50.3891058Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:50.3892099Z ^ 2025-05-07T20:03:50.3892399Z 2025-05-07T20:03:50.8668924Z [415/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_sgd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_sgd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_sgd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_approx_sgd.cpp 2025-05-07T20:03:50.8688747Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:03:51.0997020Z [416/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu.o 2025-05-07T20:03:51.1020343Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:51.1023130Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:51.1024269Z ^ 2025-05-07T20:03:51.1024525Z 2025-05-07T20:03:51.1024962Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:51.1025612Z 2025-05-07T20:03:51.1027262Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:51.1029885Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:51.1031058Z ^ 2025-05-07T20:03:51.1031416Z 2025-05-07T20:03:51.1033033Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:51.1035654Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:51.1036798Z ^ 2025-05-07T20:03:51.1037047Z 2025-05-07T20:03:51.1037480Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:51.1038143Z 2025-05-07T20:03:51.1039830Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:51.1042465Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:51.1043622Z ^ 2025-05-07T20:03:51.1043995Z 2025-05-07T20:03:51.1045625Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:51.1048242Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:51.1049373Z ^ 2025-05-07T20:03:51.1049635Z 2025-05-07T20:03:51.1050070Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:51.1050721Z 2025-05-07T20:03:51.1052365Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:51.1054990Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:51.1058009Z ^ 2025-05-07T20:03:51.1058365Z 2025-05-07T20:03:51.1059979Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:51.1062634Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:51.1063771Z ^ 2025-05-07T20:03:51.1064018Z 2025-05-07T20:03:51.1064451Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:51.1065115Z 2025-05-07T20:03:51.1066805Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:51.1069451Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:51.1070878Z ^ 2025-05-07T20:03:51.1071243Z 2025-05-07T20:03:51.1072850Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:51.1075444Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:51.1076577Z ^ 2025-05-07T20:03:51.1076816Z 2025-05-07T20:03:51.1077269Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:51.1077918Z 2025-05-07T20:03:51.1079559Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:51.1082195Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:51.1083363Z ^ 2025-05-07T20:03:51.1083723Z 2025-05-07T20:03:52.1929840Z [417/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu.o 2025-05-07T20:03:52.1954360Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:52.1957028Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:52.1958199Z ^ 2025-05-07T20:03:52.1958441Z 2025-05-07T20:03:52.1958862Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:52.1959450Z 2025-05-07T20:03:52.1961080Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:52.1963727Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:52.1964880Z ^ 2025-05-07T20:03:52.1965245Z 2025-05-07T20:03:52.1966680Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:52.1969260Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:52.1970630Z ^ 2025-05-07T20:03:52.1970873Z 2025-05-07T20:03:52.1971321Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:52.1971957Z 2025-05-07T20:03:52.1973550Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:52.1976028Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:52.1977295Z ^ 2025-05-07T20:03:52.1977640Z 2025-05-07T20:03:52.1979274Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:52.1981957Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:52.1983121Z ^ 2025-05-07T20:03:52.1983361Z 2025-05-07T20:03:52.1983811Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:52.1984840Z 2025-05-07T20:03:52.1986440Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:52.1989047Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:52.1990207Z ^ 2025-05-07T20:03:52.1990568Z 2025-05-07T20:03:52.1992330Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:52.1994967Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:52.1996112Z ^ 2025-05-07T20:03:52.1996372Z 2025-05-07T20:03:52.1996932Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:52.1997601Z 2025-05-07T20:03:52.1999280Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:52.2001965Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:52.2003066Z ^ 2025-05-07T20:03:52.2003429Z 2025-05-07T20:03:52.2005111Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:52.2007788Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:52.2008937Z ^ 2025-05-07T20:03:52.2009209Z 2025-05-07T20:03:52.2009656Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:52.2010315Z 2025-05-07T20:03:52.2011983Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:52.2014529Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:52.2015711Z ^ 2025-05-07T20:03:52.2016072Z 2025-05-07T20:03:52.4126382Z [418/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu.o 2025-05-07T20:03:52.4149735Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:52.4152328Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:52.4153488Z ^ 2025-05-07T20:03:52.4153742Z 2025-05-07T20:03:52.4154185Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:52.4154852Z 2025-05-07T20:03:52.4156567Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:52.4159257Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:52.4160434Z ^ 2025-05-07T20:03:52.4160788Z 2025-05-07T20:03:52.4162410Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:52.4165015Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:52.4166049Z ^ 2025-05-07T20:03:52.4166308Z 2025-05-07T20:03:52.4166760Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:52.4167393Z 2025-05-07T20:03:52.4169026Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:52.4172175Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:52.4173247Z ^ 2025-05-07T20:03:52.4173569Z 2025-05-07T20:03:52.4175101Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:52.4178026Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:52.4179310Z ^ 2025-05-07T20:03:52.4179563Z 2025-05-07T20:03:52.4180018Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:52.4180685Z 2025-05-07T20:03:52.4182225Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:52.4184871Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:52.4186023Z ^ 2025-05-07T20:03:52.4186389Z 2025-05-07T20:03:52.4188142Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:52.4190813Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:52.4191970Z ^ 2025-05-07T20:03:52.4192224Z 2025-05-07T20:03:52.4192669Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:52.4193350Z 2025-05-07T20:03:52.4195019Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:52.4197687Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:52.4198855Z ^ 2025-05-07T20:03:52.4199227Z 2025-05-07T20:03:52.4200861Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:52.4203488Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:52.4204644Z ^ 2025-05-07T20:03:52.4204875Z 2025-05-07T20:03:52.4205332Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:52.4205929Z 2025-05-07T20:03:52.4207562Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:52.4209727Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:52.4210907Z ^ 2025-05-07T20:03:52.4211236Z 2025-05-07T20:03:53.7037981Z [419/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu.o 2025-05-07T20:03:53.7060957Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:53.7063572Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:53.7064700Z ^ 2025-05-07T20:03:53.7064940Z 2025-05-07T20:03:53.7065373Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:53.7065999Z 2025-05-07T20:03:53.7067617Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:53.7070452Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:53.7071559Z ^ 2025-05-07T20:03:53.7071916Z 2025-05-07T20:03:53.7073478Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:53.7075986Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:53.7077077Z ^ 2025-05-07T20:03:53.7077337Z 2025-05-07T20:03:53.7077772Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:53.7078395Z 2025-05-07T20:03:53.7079988Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:53.7082518Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:53.7083975Z ^ 2025-05-07T20:03:53.7084327Z 2025-05-07T20:03:53.7085881Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:53.7088444Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:53.7089563Z ^ 2025-05-07T20:03:53.7089802Z 2025-05-07T20:03:53.7090372Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:53.7091046Z 2025-05-07T20:03:53.7092631Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:53.7095351Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:53.7096684Z ^ 2025-05-07T20:03:53.7097013Z 2025-05-07T20:03:53.7098549Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:53.7101124Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:53.7102204Z ^ 2025-05-07T20:03:53.7102452Z 2025-05-07T20:03:53.7102889Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:53.7103491Z 2025-05-07T20:03:53.7105013Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:53.7107521Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:53.7108618Z ^ 2025-05-07T20:03:53.7108944Z 2025-05-07T20:03:53.7110503Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:53.7112982Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:53.7114107Z ^ 2025-05-07T20:03:53.7114371Z 2025-05-07T20:03:53.7114790Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:53.7115436Z 2025-05-07T20:03:53.7116925Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:53.7118987Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:53.7120018Z ^ 2025-05-07T20:03:53.7120353Z 2025-05-07T20:03:54.3721475Z [420/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad.cpp 2025-05-07T20:03:54.3740787Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:03:55.2674804Z [421/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp 2025-05-07T20:03:55.2694251Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:03:56.4982541Z [422/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_lamb.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_lamb.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_lamb.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_partial_rowwise_lamb.cpp 2025-05-07T20:03:56.5002560Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:03:56.5717236Z [423/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adagrad_split_weighted_meta.cpp 2025-05-07T20:03:56.5737482Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:03:56.9620040Z [424/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T20:03:56.9640331Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:03:57.1915312Z [425/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T20:03:57.1935578Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:03:57.2089887Z [426/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu.o 2025-05-07T20:03:57.2110468Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:57.2112997Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:57.2114116Z ^ 2025-05-07T20:03:57.2114348Z 2025-05-07T20:03:57.2114873Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:57.2115500Z 2025-05-07T20:03:57.2116966Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:57.2119503Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:57.2120683Z ^ 2025-05-07T20:03:57.2121014Z 2025-05-07T20:03:57.2122440Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:57.2124835Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:57.2125941Z ^ 2025-05-07T20:03:57.2126178Z 2025-05-07T20:03:57.2126614Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:57.2127220Z 2025-05-07T20:03:57.2128840Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:57.2131496Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:57.2132673Z ^ 2025-05-07T20:03:57.2133034Z 2025-05-07T20:03:57.2134673Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:57.2137422Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:57.2138546Z ^ 2025-05-07T20:03:57.2138803Z 2025-05-07T20:03:57.2139234Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:57.2139888Z 2025-05-07T20:03:57.2141566Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:57.2143927Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:57.2145083Z ^ 2025-05-07T20:03:57.2145651Z 2025-05-07T20:03:57.2147378Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:57.2150008Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:57.2151198Z ^ 2025-05-07T20:03:57.2151447Z 2025-05-07T20:03:57.2151887Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:57.2152628Z 2025-05-07T20:03:57.2154298Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:57.2156924Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:57.2158141Z ^ 2025-05-07T20:03:57.2158515Z 2025-05-07T20:03:57.2160125Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:57.2162797Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:57.2163970Z ^ 2025-05-07T20:03:57.2164239Z 2025-05-07T20:03:57.2164678Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:57.2165338Z 2025-05-07T20:03:57.2167000Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:57.2169617Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:57.2171032Z ^ 2025-05-07T20:03:57.2171381Z 2025-05-07T20:03:57.5034837Z [427/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adagrad_split_unweighted_meta.cpp 2025-05-07T20:03:57.5055224Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:03:57.6964745Z [428/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp 2025-05-07T20:03:57.6983901Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:03:57.7287274Z [429/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_weighted_adagrad.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_weighted_adagrad.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_weighted_adagrad.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_rowwise_weighted_adagrad.cpp 2025-05-07T20:03:57.7306621Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:03:58.0732442Z [430/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_none.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_none.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_none.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_none.cpp 2025-05-07T20:03:58.0752219Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:03:58.3204985Z [431/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp 2025-05-07T20:03:58.3215364Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:03:59.1733272Z [432/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_sgd_split_weighted_meta.cpp 2025-05-07T20:03:59.1752901Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:03:59.9746920Z [433/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_sgd_split_unweighted_meta.cpp 2025-05-07T20:03:59.9765455Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:00.2553808Z [434/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adam_split_weighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_cuda.cu.o 2025-05-07T20:04:00.2577745Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:00.2580214Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:00.2581306Z ^ 2025-05-07T20:04:00.2581515Z 2025-05-07T20:04:00.2581930Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:00.2582582Z 2025-05-07T20:04:00.2584101Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:00.2586584Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:00.2587661Z ^ 2025-05-07T20:04:00.2588028Z 2025-05-07T20:04:00.2589535Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:00.2592063Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:00.2593112Z ^ 2025-05-07T20:04:00.2593378Z 2025-05-07T20:04:00.2593788Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:00.2594403Z 2025-05-07T20:04:00.2595965Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:00.2598376Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:00.2599501Z ^ 2025-05-07T20:04:00.2599837Z 2025-05-07T20:04:00.2601386Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:00.2604195Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:00.2605255Z ^ 2025-05-07T20:04:00.2605488Z 2025-05-07T20:04:00.2605891Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:00.2606482Z 2025-05-07T20:04:00.2608190Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:00.2610682Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:00.2611769Z ^ 2025-05-07T20:04:00.2612106Z 2025-05-07T20:04:00.2613746Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:00.2616252Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:00.2617599Z ^ 2025-05-07T20:04:00.2617851Z 2025-05-07T20:04:00.2618195Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:00.2618811Z 2025-05-07T20:04:00.2620375Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:00.2622863Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:00.2623971Z ^ 2025-05-07T20:04:00.2624309Z 2025-05-07T20:04:00.2625800Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:00.2628274Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:00.2629358Z ^ 2025-05-07T20:04:00.2629584Z 2025-05-07T20:04:00.2630003Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:00.2630628Z 2025-05-07T20:04:00.2632136Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:00.2634614Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:00.2635716Z ^ 2025-05-07T20:04:00.2636076Z 2025-05-07T20:04:00.5080079Z [435/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu.o 2025-05-07T20:04:00.5101823Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:00.5104375Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:00.5105465Z ^ 2025-05-07T20:04:00.5105689Z 2025-05-07T20:04:00.5106078Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:00.5106666Z 2025-05-07T20:04:00.5108154Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:00.5110579Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:00.5111621Z ^ 2025-05-07T20:04:00.5111957Z 2025-05-07T20:04:00.5113549Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:00.5116039Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:00.5117051Z ^ 2025-05-07T20:04:00.5117291Z 2025-05-07T20:04:00.5117667Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:00.5118208Z 2025-05-07T20:04:00.5119719Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:00.5122091Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:00.5123489Z ^ 2025-05-07T20:04:00.5123842Z 2025-05-07T20:04:00.5125298Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:00.5127486Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:00.5128546Z ^ 2025-05-07T20:04:00.5128777Z 2025-05-07T20:04:00.5129307Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:00.5129948Z 2025-05-07T20:04:00.5131506Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:00.5133918Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:00.5134947Z ^ 2025-05-07T20:04:00.5135282Z 2025-05-07T20:04:00.5136847Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:00.5139235Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:00.5140285Z ^ 2025-05-07T20:04:00.5140547Z 2025-05-07T20:04:00.5140976Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:00.5141609Z 2025-05-07T20:04:00.5143205Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:00.5145583Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:00.5146694Z ^ 2025-05-07T20:04:00.5147042Z 2025-05-07T20:04:00.5148462Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:00.5150678Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:00.5151725Z ^ 2025-05-07T20:04:00.5151960Z 2025-05-07T20:04:00.5152378Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:00.5153017Z 2025-05-07T20:04:00.5154562Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:00.5157031Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:00.5158102Z ^ 2025-05-07T20:04:00.5158459Z 2025-05-07T20:04:00.6636532Z [436/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adam_split_weighted_meta.cpp 2025-05-07T20:04:00.6656483Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:01.3185154Z [437/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_lamb_split_weighted_meta.cpp 2025-05-07T20:04:01.3207615Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:01.5855860Z [438/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adam_split_unweighted_meta.cpp 2025-05-07T20:04:01.5876250Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:02.7103004Z [439/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_lamb_split_unweighted_meta.cpp 2025-05-07T20:04:02.7122470Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:03.3652130Z [440/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_lars_sgd_split_weighted_meta.cpp 2025-05-07T20:04:03.3672488Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:04.1521869Z [441/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp 2025-05-07T20:04:04.1542028Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:04.1588823Z [442/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_ops.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_ops.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_ops.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/training/index_select/batch_index_select_dim0_ops.cpp 2025-05-07T20:04:04.1607520Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:04.5864491Z [443/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu.o 2025-05-07T20:04:04.5884944Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:04.5887401Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:04.5888462Z ^ 2025-05-07T20:04:04.5888745Z 2025-05-07T20:04:04.5889144Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:04.5889747Z 2025-05-07T20:04:04.5891284Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:04.5893729Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:04.5894827Z ^ 2025-05-07T20:04:04.5895161Z 2025-05-07T20:04:04.5896772Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:04.5899564Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:04.5900621Z ^ 2025-05-07T20:04:04.5900849Z 2025-05-07T20:04:04.5901252Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:04.5901907Z 2025-05-07T20:04:04.5903522Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:04.5905944Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:04.5906957Z ^ 2025-05-07T20:04:04.5907324Z 2025-05-07T20:04:04.5908995Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:04.5911386Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:04.5912378Z ^ 2025-05-07T20:04:04.5912562Z 2025-05-07T20:04:04.5912907Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:04.5913375Z 2025-05-07T20:04:04.5914695Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:04.5917106Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:04.5918173Z ^ 2025-05-07T20:04:04.5918511Z 2025-05-07T20:04:04.5920024Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:04.5922490Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:04.5923570Z ^ 2025-05-07T20:04:04.5923828Z 2025-05-07T20:04:04.5924264Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:04.5924895Z 2025-05-07T20:04:04.5926278Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:04.5928632Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:04.5929706Z ^ 2025-05-07T20:04:04.5930041Z 2025-05-07T20:04:04.5931551Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:04.5933866Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:04.5934851Z ^ 2025-05-07T20:04:04.5935086Z 2025-05-07T20:04:04.5935497Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:04.5936064Z 2025-05-07T20:04:04.5937637Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:04.5940187Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:04.5941280Z ^ 2025-05-07T20:04:04.5941633Z 2025-05-07T20:04:04.8627724Z [444/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp 2025-05-07T20:04:04.8645720Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:05.4769751Z [445/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp 2025-05-07T20:04:05.4790795Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:05.5594776Z [446/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp 2025-05-07T20:04:05.5612650Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:06.0394301Z [447/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp 2025-05-07T20:04:06.0422290Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:06.2320149Z [448/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp 2025-05-07T20:04:06.2332253Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:06.3028501Z [449/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_none_split_weighted_meta.cpp 2025-05-07T20:04:06.3046913Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:06.3758484Z [450/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu.o 2025-05-07T20:04:06.3780969Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:06.3783380Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:06.3784507Z ^ 2025-05-07T20:04:06.3784775Z 2025-05-07T20:04:06.3785249Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:06.3785869Z 2025-05-07T20:04:06.3787446Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:06.3789955Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:06.3791061Z ^ 2025-05-07T20:04:06.3791385Z 2025-05-07T20:04:06.3792888Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:06.3795443Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:06.3796513Z ^ 2025-05-07T20:04:06.3796772Z 2025-05-07T20:04:06.3797200Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:06.3797806Z 2025-05-07T20:04:06.3799349Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:06.3801704Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:06.3802847Z ^ 2025-05-07T20:04:06.3803194Z 2025-05-07T20:04:06.3804831Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:06.3807930Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:06.3809070Z ^ 2025-05-07T20:04:06.3809316Z 2025-05-07T20:04:06.3809725Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:06.3810266Z 2025-05-07T20:04:06.3811851Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:06.3814290Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:06.3815357Z ^ 2025-05-07T20:04:06.3815721Z 2025-05-07T20:04:06.3817565Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:06.3820003Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:06.3821075Z ^ 2025-05-07T20:04:06.3821336Z 2025-05-07T20:04:06.3821757Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:06.3822378Z 2025-05-07T20:04:06.3823973Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:06.3826455Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:06.3827519Z ^ 2025-05-07T20:04:06.3827860Z 2025-05-07T20:04:06.3829385Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:06.3831884Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:06.3832937Z ^ 2025-05-07T20:04:06.3833167Z 2025-05-07T20:04:06.3833607Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:06.3834218Z 2025-05-07T20:04:06.3835747Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:06.3838118Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:06.3839231Z ^ 2025-05-07T20:04:06.3839602Z 2025-05-07T20:04:06.4434919Z [451/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_none_split_unweighted_meta.cpp 2025-05-07T20:04:06.4454155Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:06.7168134Z [452/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp 2025-05-07T20:04:06.7182342Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:06.7769849Z [453/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp 2025-05-07T20:04:06.7788482Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:09.5139879Z [454/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu.o 2025-05-07T20:04:09.5163688Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:09.5166282Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:09.5167449Z ^ 2025-05-07T20:04:09.5167707Z 2025-05-07T20:04:09.5168154Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:09.5168836Z 2025-05-07T20:04:09.5170822Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:09.5173523Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:09.5174647Z ^ 2025-05-07T20:04:09.5174984Z 2025-05-07T20:04:09.5176583Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:09.5179245Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:09.5180355Z ^ 2025-05-07T20:04:09.5180598Z 2025-05-07T20:04:09.5181039Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:09.5181709Z 2025-05-07T20:04:09.5183367Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:09.5186063Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:09.5187276Z ^ 2025-05-07T20:04:09.5187649Z 2025-05-07T20:04:09.5189341Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:09.5192062Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:09.5193246Z ^ 2025-05-07T20:04:09.5193798Z 2025-05-07T20:04:09.5197642Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:09.5198353Z 2025-05-07T20:04:09.5199944Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:09.5202221Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:09.5203529Z ^ 2025-05-07T20:04:09.5203898Z 2025-05-07T20:04:09.5205549Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:09.5208173Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:09.5209484Z ^ 2025-05-07T20:04:09.5209751Z 2025-05-07T20:04:09.5210226Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:09.5210891Z 2025-05-07T20:04:09.5212533Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:09.5215105Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:09.5216451Z ^ 2025-05-07T20:04:09.5216824Z 2025-05-07T20:04:09.5218463Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:09.5221040Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:09.5222119Z ^ 2025-05-07T20:04:09.5222395Z 2025-05-07T20:04:09.5222821Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:09.5223493Z 2025-05-07T20:04:09.5225139Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:09.5227784Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:09.5228934Z ^ 2025-05-07T20:04:09.5229309Z 2025-05-07T20:04:09.5338591Z [455/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu.o 2025-05-07T20:04:09.5359572Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:09.5361939Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:09.5363051Z ^ 2025-05-07T20:04:09.5363309Z 2025-05-07T20:04:09.5363747Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:09.5364370Z 2025-05-07T20:04:09.5366002Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:09.5368609Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:09.5369780Z ^ 2025-05-07T20:04:09.5370464Z 2025-05-07T20:04:09.5372037Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:09.5374611Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:09.5375769Z ^ 2025-05-07T20:04:09.5375990Z 2025-05-07T20:04:09.5376521Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:09.5377135Z 2025-05-07T20:04:09.5378762Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:09.5381455Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:09.5382606Z ^ 2025-05-07T20:04:09.5382965Z 2025-05-07T20:04:09.5384616Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:09.5387663Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:09.5388828Z ^ 2025-05-07T20:04:09.5389089Z 2025-05-07T20:04:09.5389511Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:09.5390151Z 2025-05-07T20:04:09.5391953Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:09.5394672Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:09.5395876Z ^ 2025-05-07T20:04:09.5396244Z 2025-05-07T20:04:09.5398004Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:09.5400570Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:09.5401732Z ^ 2025-05-07T20:04:09.5401985Z 2025-05-07T20:04:09.5402426Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:09.5403089Z 2025-05-07T20:04:09.5404735Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:09.5407327Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:09.5408487Z ^ 2025-05-07T20:04:09.5408868Z 2025-05-07T20:04:09.5410518Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:09.5413143Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:09.5414304Z ^ 2025-05-07T20:04:09.5414571Z 2025-05-07T20:04:09.5415011Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:09.5415644Z 2025-05-07T20:04:09.5417398Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:09.5420097Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:09.5421291Z ^ 2025-05-07T20:04:09.5421662Z 2025-05-07T20:04:10.2189008Z [456/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu.o 2025-05-07T20:04:10.2212637Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:10.2215335Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:10.2216706Z ^ 2025-05-07T20:04:10.2216969Z 2025-05-07T20:04:10.2217402Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:10.2218034Z 2025-05-07T20:04:10.2219706Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:10.2222350Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:10.2223570Z ^ 2025-05-07T20:04:10.2223939Z 2025-05-07T20:04:10.2225596Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:10.2228145Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:10.2229334Z ^ 2025-05-07T20:04:10.2229588Z 2025-05-07T20:04:10.2230041Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:10.2230699Z 2025-05-07T20:04:10.2232365Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:10.2235314Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:10.2236406Z ^ 2025-05-07T20:04:10.2236737Z 2025-05-07T20:04:10.2238379Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:10.2241101Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:10.2242276Z ^ 2025-05-07T20:04:10.2242507Z 2025-05-07T20:04:10.2242959Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:10.2243565Z 2025-05-07T20:04:10.2245322Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:10.2248048Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:10.2249247Z ^ 2025-05-07T20:04:10.2249615Z 2025-05-07T20:04:10.2251280Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:10.2253812Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:10.2254916Z ^ 2025-05-07T20:04:10.2255167Z 2025-05-07T20:04:10.2255609Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:10.2256289Z 2025-05-07T20:04:10.2258036Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:10.2260637Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:10.2261823Z ^ 2025-05-07T20:04:10.2262188Z 2025-05-07T20:04:10.2263859Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:10.2266446Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:10.2267570Z ^ 2025-05-07T20:04:10.2267822Z 2025-05-07T20:04:10.2268284Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:10.2268944Z 2025-05-07T20:04:10.2270808Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:10.2273440Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:10.2274592Z ^ 2025-05-07T20:04:10.2274961Z 2025-05-07T20:04:11.3801686Z [457/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_ssd_rowwise_adagrad.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_ssd_rowwise_adagrad.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_ssd_rowwise_adagrad.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_rowwise_adagrad.cpp 2025-05-07T20:04:11.3821535Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:12.0240367Z [458/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_training_backward_split_host.so -o fbgemm_gpu_tbe_training_backward_split_host.so CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adagrad.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_sgd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adam.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lamb.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lars_sgd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_adam.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_lamb.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_none.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_sgd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_weighted_adagrad.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_ssd_rowwise_adagrad.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm_gpu_config.so fbgemm_gpu_tbe_utils.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so && : 2025-05-07T20:04:12.2724565Z [459/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/memory_utils/memory_utils.cpp 2025-05-07T20:04:12.9219142Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:12.9239519Z [460/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu.o 2025-05-07T20:04:12.9261007Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:12.9263515Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:12.9264482Z ^ 2025-05-07T20:04:12.9264727Z 2025-05-07T20:04:12.9265126Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:12.9265776Z 2025-05-07T20:04:12.9267306Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:12.9269813Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:12.9271066Z ^ 2025-05-07T20:04:12.9271400Z 2025-05-07T20:04:12.9272911Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:12.9275249Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:12.9276320Z ^ 2025-05-07T20:04:12.9276532Z 2025-05-07T20:04:12.9276924Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:12.9277520Z 2025-05-07T20:04:12.9278921Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:12.9281423Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:12.9282540Z ^ 2025-05-07T20:04:12.9282874Z 2025-05-07T20:04:12.9284404Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:12.9286922Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:12.9288007Z ^ 2025-05-07T20:04:12.9288249Z 2025-05-07T20:04:12.9288663Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:12.9289243Z 2025-05-07T20:04:12.9290647Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:12.9293434Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:12.9294561Z ^ 2025-05-07T20:04:12.9294908Z 2025-05-07T20:04:12.9296753Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:12.9299205Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:12.9300186Z ^ 2025-05-07T20:04:12.9300430Z 2025-05-07T20:04:12.9300853Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:12.9301420Z 2025-05-07T20:04:12.9302999Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:12.9305483Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:12.9306652Z ^ 2025-05-07T20:04:12.9307045Z 2025-05-07T20:04:12.9308581Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:12.9311242Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:12.9312304Z ^ 2025-05-07T20:04:12.9312528Z 2025-05-07T20:04:12.9312959Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:12.9313527Z 2025-05-07T20:04:12.9315020Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:12.9317437Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:12.9318432Z ^ 2025-05-07T20:04:12.9318752Z 2025-05-07T20:04:13.3407642Z [461/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu.o 2025-05-07T20:04:13.3429389Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:13.3431920Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:13.3433011Z ^ 2025-05-07T20:04:13.3433241Z 2025-05-07T20:04:13.3433650Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:13.3434304Z 2025-05-07T20:04:13.3435857Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:13.3438371Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:13.3439464Z ^ 2025-05-07T20:04:13.3439788Z 2025-05-07T20:04:13.3441297Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:13.3443778Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:13.3444909Z ^ 2025-05-07T20:04:13.3445158Z 2025-05-07T20:04:13.3445574Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:13.3446194Z 2025-05-07T20:04:13.3447690Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:13.3450191Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:13.3451312Z ^ 2025-05-07T20:04:13.3451660Z 2025-05-07T20:04:13.3453189Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:13.3455649Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:13.3457131Z ^ 2025-05-07T20:04:13.3457388Z 2025-05-07T20:04:13.3457820Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:13.3458448Z 2025-05-07T20:04:13.3459956Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:13.3465557Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:13.3466742Z ^ 2025-05-07T20:04:13.3467086Z 2025-05-07T20:04:13.3468620Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:13.3471828Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:13.3472903Z ^ 2025-05-07T20:04:13.3473138Z 2025-05-07T20:04:13.3473547Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:13.3474120Z 2025-05-07T20:04:13.3475617Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:13.3478081Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:13.3479135Z ^ 2025-05-07T20:04:13.3479497Z 2025-05-07T20:04:13.3481028Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:13.3483498Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:13.3484486Z ^ 2025-05-07T20:04:13.3484739Z 2025-05-07T20:04:13.3485152Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:13.3485748Z 2025-05-07T20:04:13.3487306Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:13.3489763Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:13.3490848Z ^ 2025-05-07T20:04:13.3491188Z 2025-05-07T20:04:13.6721813Z [462/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/memory_utils/memory_utils_ops.cpp 2025-05-07T20:04:13.6739065Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:14.2934848Z [463/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_models.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_models.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_models.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/tbe/eeg/eeg_models.cpp 2025-05-07T20:04:14.2948305Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:14.6094975Z [464/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_cpu.cpp 2025-05-07T20:04:14.6109384Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:16.0369351Z [465/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_cpu_host.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_cpu_host.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_cpu_host.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/training/index_select/batch_index_select_dim0_cpu_host.cpp 2025-05-07T20:04:16.0388321Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:16.9084791Z [466/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_function.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_function.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_function.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_pooled_embedding_ops/permute_pooled_embedding_function.cpp 2025-05-07T20:04:16.9103797Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:16.9676226Z [467/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_function.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_function.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_function.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_multi_embedding_ops/permute_multi_embedding_function.cpp 2025-05-07T20:04:16.9694811Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:17.3830511Z [468/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_host.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_host.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_host.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/training/index_select/batch_index_select_dim0_host.cpp 2025-05-07T20:04:17.3850170Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:18.1094122Z [469/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_tensor_ops_meta.cpp 2025-05-07T20:04:18.1111268Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:19.0613071Z [470/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/layout_transform_ops/layout_transform_ops_cpu.cpp 2025-05-07T20:04:19.0630418Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:19.4603590Z [471/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_embedding_inplace_ops_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/embedding_inplace_ops/embedding_inplace_update_cpu.cpp 2025-05-07T20:04:19.4621165Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:19.9122618Z [472/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_embedding_inplace_ops_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/embedding_inplace_ops/embedding_inplace_update_gpu.cpp 2025-05-07T20:04:19.9140235Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:20.3101427Z [473/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_cpu.cpp 2025-05-07T20:04:21.2853075Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:21.2867277Z [474/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_ops_cpu.cpp 2025-05-07T20:04:21.2882676Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:22.4263378Z [475/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_multi_embedding_ops/permute_multi_embedding_ops_cpu.cpp 2025-05-07T20:04:22.4280524Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:22.4491956Z [476/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_ops_meta.cpp 2025-05-07T20:04:22.4508388Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:22.5465671Z [477/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_async_batched_cumsum.cpp 2025-05-07T20:04:22.5481960Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:23.6471304Z [478/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/input_combine_ops/input_combine_cpu.cpp 2025-05-07T20:04:23.6489175Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:23.7502383Z [479/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_cpu.cpp 2025-05-07T20:04:23.7521684Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:24.2141230Z [480/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_ops_meta.cpp 2025-05-07T20:04:24.2159629Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:25.2806275Z [481/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/topology_utils.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/topology_utils.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/topology_utils.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/topology_utils.cpp 2025-05-07T20:04:25.2824063Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:26.8894435Z [482/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_dense_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_warp.cu.o 2025-05-07T20:04:26.8917283Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:26.8919954Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:26.8921130Z ^ 2025-05-07T20:04:26.8921409Z 2025-05-07T20:04:26.8921838Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:26.8922519Z 2025-05-07T20:04:26.8924146Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:26.8926840Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:26.8928048Z ^ 2025-05-07T20:04:26.8928415Z 2025-05-07T20:04:26.8930026Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:26.8932664Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:26.8933837Z ^ 2025-05-07T20:04:26.8934078Z 2025-05-07T20:04:26.8934520Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:26.8935176Z 2025-05-07T20:04:26.8937003Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:26.8939880Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:26.8941063Z ^ 2025-05-07T20:04:26.8941456Z 2025-05-07T20:04:26.8943136Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:26.8945750Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:26.8946906Z ^ 2025-05-07T20:04:26.8947198Z 2025-05-07T20:04:26.8947656Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:26.8948274Z 2025-05-07T20:04:26.8949904Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:26.8952503Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:26.8953621Z ^ 2025-05-07T20:04:26.8953902Z 2025-05-07T20:04:26.8955365Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:26.8957870Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:26.8958981Z ^ 2025-05-07T20:04:26.8959212Z 2025-05-07T20:04:26.8959644Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:26.8960229Z 2025-05-07T20:04:26.8961547Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:26.8964129Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:26.8965315Z ^ 2025-05-07T20:04:26.8965701Z 2025-05-07T20:04:26.8967323Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:26.8969920Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:26.8971348Z ^ 2025-05-07T20:04:26.8971555Z 2025-05-07T20:04:26.8971957Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:26.8972590Z 2025-05-07T20:04:26.8974139Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:26.8976749Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:26.8977913Z ^ 2025-05-07T20:04:26.8978259Z 2025-05-07T20:04:28.2474170Z [483/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/layout_transform_ops/layout_transform_ops_gpu.cpp 2025-05-07T20:04:28.2491216Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:28.5475150Z [484/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_dense_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_cuda.cu.o 2025-05-07T20:04:28.5497834Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:28.5500266Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:28.5501681Z ^ 2025-05-07T20:04:28.5501948Z 2025-05-07T20:04:28.5502397Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:28.5503083Z 2025-05-07T20:04:28.5504727Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:28.5507312Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:28.5508486Z ^ 2025-05-07T20:04:28.5508862Z 2025-05-07T20:04:28.5510388Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:28.5512995Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:28.5514133Z ^ 2025-05-07T20:04:28.5514388Z 2025-05-07T20:04:28.5514853Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:28.5515502Z 2025-05-07T20:04:28.5517239Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:28.5519902Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:28.5521125Z ^ 2025-05-07T20:04:28.5521497Z 2025-05-07T20:04:28.5523179Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:28.5525719Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:28.5526905Z ^ 2025-05-07T20:04:28.5527165Z 2025-05-07T20:04:28.5527615Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:28.5528305Z 2025-05-07T20:04:28.5529966Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:28.5532663Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:28.5534025Z ^ 2025-05-07T20:04:28.5534486Z 2025-05-07T20:04:28.5536148Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:28.5538962Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:28.5540032Z ^ 2025-05-07T20:04:28.5540302Z 2025-05-07T20:04:28.5540830Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:28.5541493Z 2025-05-07T20:04:28.5543064Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:28.5545593Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:28.5546779Z ^ 2025-05-07T20:04:28.5547155Z 2025-05-07T20:04:28.5548703Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:28.5551304Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:28.5552440Z ^ 2025-05-07T20:04:28.5552678Z 2025-05-07T20:04:28.5553082Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:28.5553727Z 2025-05-07T20:04:28.5555260Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:28.5557925Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:28.5559127Z ^ 2025-05-07T20:04:28.5559490Z 2025-05-07T20:04:29.5880328Z [485/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_utils.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_utils.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_utils.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/tbe/eeg/eeg_utils.cpp 2025-05-07T20:04:29.5898689Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:30.0578608Z [486/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops_host.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops_host.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops_host.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/metric_ops/metric_ops_host.cpp 2025-05-07T20:04:30.0596528Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:30.1342136Z [487/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_ops_gpu.cpp 2025-05-07T20:04:30.1360755Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:30.3267746Z [488/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator_ops.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator_ops.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator_ops.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/tbe/eeg/indices_estimator_ops.cpp 2025-05-07T20:04:30.3286113Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:31.5060776Z [489/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/tbe/eeg/indices_estimator.cpp 2025-05-07T20:04:31.5078305Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:31.5289655Z [490/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/input_combine_ops/input_combine_gpu.cpp 2025-05-07T20:04:31.5307828Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:31.8510072Z [491/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu.o 2025-05-07T20:04:31.8532891Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:31.8535617Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:31.8536795Z ^ 2025-05-07T20:04:31.8537021Z 2025-05-07T20:04:31.8537431Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:31.8538045Z 2025-05-07T20:04:31.8539712Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:31.8542320Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:31.8543416Z ^ 2025-05-07T20:04:31.8543731Z 2025-05-07T20:04:31.8545273Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:31.8547693Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:31.8548748Z ^ 2025-05-07T20:04:31.8548964Z 2025-05-07T20:04:31.8549392Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:31.8550006Z 2025-05-07T20:04:31.8551540Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:31.8553932Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:31.8555317Z ^ 2025-05-07T20:04:31.8555656Z 2025-05-07T20:04:31.8557120Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:31.8559446Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:31.8560589Z ^ 2025-05-07T20:04:31.8560967Z 2025-05-07T20:04:31.8561414Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:31.8562076Z 2025-05-07T20:04:31.8563756Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:31.8566476Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:31.8567655Z ^ 2025-05-07T20:04:31.8567982Z 2025-05-07T20:04:31.8569551Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:31.8572343Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:31.8573462Z ^ 2025-05-07T20:04:31.8573714Z 2025-05-07T20:04:31.8574150Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:31.8574803Z 2025-05-07T20:04:31.8576491Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:31.8579070Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:31.8580212Z ^ 2025-05-07T20:04:31.8580586Z 2025-05-07T20:04:31.8582183Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:31.8584747Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:31.8585845Z ^ 2025-05-07T20:04:31.8586110Z 2025-05-07T20:04:31.8586548Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:31.8587157Z 2025-05-07T20:04:31.8588794Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:31.8591343Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:31.8592541Z ^ 2025-05-07T20:04:31.8592915Z 2025-05-07T20:04:32.9540045Z [492/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator_ops.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator_ops.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator_ops.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/tbe/eeg/indices_generator_ops.cpp 2025-05-07T20:04:32.9549573Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:33.2013516Z [493/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/tbe/eeg/indices_generator.cpp 2025-05-07T20:04:33.2030540Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:33.3149735Z [494/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/intraining_embedding_pruning_ops/intraining_embedding_pruning_gpu.cpp 2025-05-07T20:04:33.3166709Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:33.6846083Z [495/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_gpu.cpp 2025-05-07T20:04:33.6863833Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:34.4531108Z [496/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_tensor_ops_autograd.cpp 2025-05-07T20:04:34.4549723Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:34.7382300Z [497/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_gpu.cpp 2025-05-07T20:04:34.7401578Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:36.1573964Z [498/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_gpu.cpp 2025-05-07T20:04:36.1592517Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:38.1315045Z [499/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu.o 2025-05-07T20:04:38.1339453Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:38.1342108Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:38.1343284Z ^ 2025-05-07T20:04:38.1343673Z 2025-05-07T20:04:38.1344130Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:38.1344808Z 2025-05-07T20:04:38.1346413Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:38.1349073Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:38.1350299Z ^ 2025-05-07T20:04:38.1350651Z 2025-05-07T20:04:38.1352248Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:38.1354904Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:38.1356108Z ^ 2025-05-07T20:04:38.1356371Z 2025-05-07T20:04:38.1356827Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:38.1357537Z 2025-05-07T20:04:38.1359235Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:38.1361907Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:38.1363073Z ^ 2025-05-07T20:04:38.1363671Z 2025-05-07T20:04:38.1364976Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu(245): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T20:04:38.1366703Z const auto offset_idx = idx * D_emb; 2025-05-07T20:04:38.1367259Z ^ 2025-05-07T20:04:38.1367514Z 2025-05-07T20:04:38.1369244Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:38.1372168Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:38.1373264Z ^ 2025-05-07T20:04:38.1373499Z 2025-05-07T20:04:38.1373932Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:38.1374601Z 2025-05-07T20:04:38.1376560Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:38.1379270Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:38.1380396Z ^ 2025-05-07T20:04:38.1380769Z 2025-05-07T20:04:38.1382097Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu(245): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T20:04:38.1383765Z const auto offset_idx = idx * D_emb; 2025-05-07T20:04:38.1384297Z ^ 2025-05-07T20:04:38.1384545Z 2025-05-07T20:04:38.1386218Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:38.1388826Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:38.1389986Z ^ 2025-05-07T20:04:38.1390226Z 2025-05-07T20:04:38.1390687Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:38.1391340Z 2025-05-07T20:04:38.1392962Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:38.1395577Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:38.1396750Z ^ 2025-05-07T20:04:38.1397144Z 2025-05-07T20:04:38.1398333Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu(245): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T20:04:38.1400050Z const auto offset_idx = idx * D_emb; 2025-05-07T20:04:38.1400559Z ^ 2025-05-07T20:04:38.1400818Z 2025-05-07T20:04:38.1402430Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:38.1405051Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:38.1406155Z ^ 2025-05-07T20:04:38.1406362Z 2025-05-07T20:04:38.1407043Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:38.1407773Z 2025-05-07T20:04:38.1409401Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:38.1412133Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:38.1413302Z ^ 2025-05-07T20:04:38.1413773Z 2025-05-07T20:04:38.1415080Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu(245): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T20:04:38.1416962Z const auto offset_idx = idx * D_emb; 2025-05-07T20:04:38.1417511Z ^ 2025-05-07T20:04:38.1417757Z 2025-05-07T20:04:38.7096618Z [500/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_ops_gpu.cpp 2025-05-07T20:04:38.7114640Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:40.2067825Z [501/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_dense_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_warp.cu.o 2025-05-07T20:04:40.2090885Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:40.2093480Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:40.2094591Z ^ 2025-05-07T20:04:40.2094823Z 2025-05-07T20:04:40.2095242Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:40.2095866Z 2025-05-07T20:04:40.2097412Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:40.2099857Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:40.2101055Z ^ 2025-05-07T20:04:40.2101416Z 2025-05-07T20:04:40.2102996Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:40.2105521Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:40.2106629Z ^ 2025-05-07T20:04:40.2106889Z 2025-05-07T20:04:40.2107311Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:40.2107927Z 2025-05-07T20:04:40.2109458Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:40.2111927Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:40.2113025Z ^ 2025-05-07T20:04:40.2113726Z 2025-05-07T20:04:40.2115262Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:40.2117860Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:40.2119040Z ^ 2025-05-07T20:04:40.2119295Z 2025-05-07T20:04:40.2119856Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:40.2120516Z 2025-05-07T20:04:40.2122029Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:40.2124614Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:40.2125881Z ^ 2025-05-07T20:04:40.2126265Z 2025-05-07T20:04:40.2127789Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:40.2130195Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:40.2131318Z ^ 2025-05-07T20:04:40.2131591Z 2025-05-07T20:04:40.2132027Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:40.2132685Z 2025-05-07T20:04:40.2134346Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:40.2136934Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:40.2137877Z ^ 2025-05-07T20:04:40.2138129Z 2025-05-07T20:04:40.2139351Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:40.2141668Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:40.2142771Z ^ 2025-05-07T20:04:40.2142987Z 2025-05-07T20:04:40.2143390Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:40.2144040Z 2025-05-07T20:04:40.2145587Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:40.2148140Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:40.2149288Z ^ 2025-05-07T20:04:40.2149669Z 2025-05-07T20:04:40.7356590Z [502/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T20:04:40.7379662Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:40.7381868Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:40.7382941Z ^ 2025-05-07T20:04:40.7383189Z 2025-05-07T20:04:40.7383650Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:40.7384257Z 2025-05-07T20:04:40.7385819Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:40.7388349Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:40.7389502Z ^ 2025-05-07T20:04:40.7389854Z 2025-05-07T20:04:40.7391509Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:40.7394074Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:40.7395156Z ^ 2025-05-07T20:04:40.7395384Z 2025-05-07T20:04:40.7395817Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:40.7396446Z 2025-05-07T20:04:40.7398056Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:40.7401040Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:40.7402202Z ^ 2025-05-07T20:04:40.7402559Z 2025-05-07T20:04:40.7404199Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:40.7406854Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:40.7408018Z ^ 2025-05-07T20:04:40.7408288Z 2025-05-07T20:04:40.7408739Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:40.7409329Z 2025-05-07T20:04:40.7411034Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:40.7413710Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:40.7414835Z ^ 2025-05-07T20:04:40.7415189Z 2025-05-07T20:04:40.7416918Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:40.7419534Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:40.7420716Z ^ 2025-05-07T20:04:40.7420971Z 2025-05-07T20:04:40.7421421Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:40.7422114Z 2025-05-07T20:04:40.7423796Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:40.7426463Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:40.7427643Z ^ 2025-05-07T20:04:40.7428022Z 2025-05-07T20:04:40.7429682Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:40.7432353Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:40.7433514Z ^ 2025-05-07T20:04:40.7433767Z 2025-05-07T20:04:40.7434207Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:40.7434877Z 2025-05-07T20:04:40.7436558Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:40.7439212Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:40.7440350Z ^ 2025-05-07T20:04:40.7440696Z 2025-05-07T20:04:45.3233233Z [503/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_ops_cpu.cpp 2025-05-07T20:04:45.3248300Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:45.7974618Z [504/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu.o 2025-05-07T20:04:45.7993454Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:45.7995474Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:45.7996370Z ^ 2025-05-07T20:04:45.7996655Z 2025-05-07T20:04:45.7997107Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:45.7997707Z 2025-05-07T20:04:45.7999058Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:45.8001037Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:45.8001927Z ^ 2025-05-07T20:04:45.8002204Z 2025-05-07T20:04:45.8003418Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:45.8005365Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:45.8006255Z ^ 2025-05-07T20:04:45.8006479Z 2025-05-07T20:04:45.8006831Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:45.8007400Z 2025-05-07T20:04:45.8008697Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:45.8010959Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:45.8011883Z ^ 2025-05-07T20:04:45.8012170Z 2025-05-07T20:04:45.8013295Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu(236): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T20:04:45.8014754Z const auto offset_idx = idx * D_emb; 2025-05-07T20:04:45.8015168Z ^ 2025-05-07T20:04:45.8015360Z 2025-05-07T20:04:45.8016843Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:45.8018985Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:45.8019838Z ^ 2025-05-07T20:04:45.8020029Z 2025-05-07T20:04:45.8020414Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:45.8020968Z 2025-05-07T20:04:45.8022324Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:45.8024685Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:45.8025563Z ^ 2025-05-07T20:04:45.8025897Z 2025-05-07T20:04:45.8026974Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu(236): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T20:04:45.8028575Z const auto offset_idx = idx * D_emb; 2025-05-07T20:04:45.8028984Z ^ 2025-05-07T20:04:45.8029174Z 2025-05-07T20:04:45.8030528Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:45.8032737Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:45.8033800Z ^ 2025-05-07T20:04:45.8034003Z 2025-05-07T20:04:45.8034370Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:45.8034898Z 2025-05-07T20:04:45.8036252Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:45.8038511Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:45.8039484Z ^ 2025-05-07T20:04:45.8039867Z 2025-05-07T20:04:45.8040999Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu(236): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T20:04:45.8042393Z const auto offset_idx = idx * D_emb; 2025-05-07T20:04:45.8042798Z ^ 2025-05-07T20:04:45.8043028Z 2025-05-07T20:04:45.8044393Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:45.8046692Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:45.8047551Z ^ 2025-05-07T20:04:45.8047740Z 2025-05-07T20:04:45.8048079Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:45.8048575Z 2025-05-07T20:04:45.8049856Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:45.8052155Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:45.8053109Z ^ 2025-05-07T20:04:45.8053379Z 2025-05-07T20:04:45.8054401Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu(236): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T20:04:45.8056002Z const auto offset_idx = idx * D_emb; 2025-05-07T20:04:45.8056671Z ^ 2025-05-07T20:04:45.8056908Z 2025-05-07T20:04:46.1668724Z [505/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu.o 2025-05-07T20:04:46.1684614Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:46.1686398Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:46.1687181Z ^ 2025-05-07T20:04:46.1687360Z 2025-05-07T20:04:46.1687669Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:46.1688130Z 2025-05-07T20:04:46.1689209Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:46.1690964Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:46.1691794Z ^ 2025-05-07T20:04:46.1692048Z 2025-05-07T20:04:46.1693115Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:46.1694855Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:46.1695936Z ^ 2025-05-07T20:04:46.1696127Z 2025-05-07T20:04:46.1696530Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:46.1696968Z 2025-05-07T20:04:46.1698131Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:46.1700044Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:46.1700829Z ^ 2025-05-07T20:04:46.1701074Z 2025-05-07T20:04:46.1702139Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:46.1703974Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:46.1704764Z ^ 2025-05-07T20:04:46.1704945Z 2025-05-07T20:04:46.1705265Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:46.1705708Z 2025-05-07T20:04:46.1706781Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:46.1708513Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:46.1709301Z ^ 2025-05-07T20:04:46.1709546Z 2025-05-07T20:04:46.1710604Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:46.1712334Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:46.1713081Z ^ 2025-05-07T20:04:46.1713261Z 2025-05-07T20:04:46.1713547Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:46.1713973Z 2025-05-07T20:04:46.1715066Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:46.1716767Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:46.1717545Z ^ 2025-05-07T20:04:46.1717785Z 2025-05-07T20:04:46.1718848Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:46.1720540Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:46.1721297Z ^ 2025-05-07T20:04:46.1721463Z 2025-05-07T20:04:46.1721766Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:46.1722230Z 2025-05-07T20:04:46.1723288Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:46.1725018Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:46.1727445Z ^ 2025-05-07T20:04:46.1727698Z 2025-05-07T20:04:47.0320098Z [506/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_training_forward.so -o fbgemm_gpu_tbe_training_forward.so CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_v2_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_v2_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel_small.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm_gpu_tbe_common.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -lcudadevrt -lcudart_static -ldl -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && : 2025-05-07T20:04:47.8487371Z [507/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu.o 2025-05-07T20:04:47.8509672Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:47.8512194Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:47.8513246Z ^ 2025-05-07T20:04:47.8513489Z 2025-05-07T20:04:47.8513904Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:47.8514507Z 2025-05-07T20:04:47.8516079Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:47.8518647Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:47.8519769Z ^ 2025-05-07T20:04:47.8520107Z 2025-05-07T20:04:47.8521639Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:47.8524079Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:47.8525148Z ^ 2025-05-07T20:04:47.8525377Z 2025-05-07T20:04:47.8525808Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:47.8526673Z 2025-05-07T20:04:47.8528208Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:47.8530694Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:47.8531821Z ^ 2025-05-07T20:04:47.8532196Z 2025-05-07T20:04:47.8533880Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:47.8536484Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:47.8537549Z ^ 2025-05-07T20:04:47.8537816Z 2025-05-07T20:04:47.8538330Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:47.8538944Z 2025-05-07T20:04:47.8540412Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:47.8542776Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:47.8543873Z ^ 2025-05-07T20:04:47.8544209Z 2025-05-07T20:04:47.8545736Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:47.8548155Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:47.8549150Z ^ 2025-05-07T20:04:47.8549382Z 2025-05-07T20:04:47.8549793Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:47.8550430Z 2025-05-07T20:04:47.8551932Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:47.8554358Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:47.8555418Z ^ 2025-05-07T20:04:47.8555749Z 2025-05-07T20:04:47.8557241Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:47.8559688Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:47.8560737Z ^ 2025-05-07T20:04:47.8560972Z 2025-05-07T20:04:47.8561364Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:47.8561974Z 2025-05-07T20:04:47.8563526Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:47.8565947Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:47.8567026Z ^ 2025-05-07T20:04:47.8567509Z 2025-05-07T20:04:49.2613634Z [508/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_dense_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_cuda.cu.o 2025-05-07T20:04:49.2636634Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:49.2639355Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:49.2640526Z ^ 2025-05-07T20:04:49.2640789Z 2025-05-07T20:04:49.2641240Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:49.2641914Z 2025-05-07T20:04:49.2643566Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:49.2646126Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:49.2647284Z ^ 2025-05-07T20:04:49.2647662Z 2025-05-07T20:04:49.2649235Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:49.2663628Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:49.2665084Z ^ 2025-05-07T20:04:49.2665339Z 2025-05-07T20:04:49.2665789Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:49.2666436Z 2025-05-07T20:04:49.2668123Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:49.2671165Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:49.2672354Z ^ 2025-05-07T20:04:49.2672717Z 2025-05-07T20:04:49.2674400Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:49.2676707Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:49.2677819Z ^ 2025-05-07T20:04:49.2678063Z 2025-05-07T20:04:49.2678520Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:49.2679176Z 2025-05-07T20:04:49.2680752Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:49.2683415Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:49.2684555Z ^ 2025-05-07T20:04:49.2684951Z 2025-05-07T20:04:49.2686569Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:49.2689209Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:49.2690367Z ^ 2025-05-07T20:04:49.2690630Z 2025-05-07T20:04:49.2691083Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:49.2691756Z 2025-05-07T20:04:49.2693375Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:49.2696022Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:49.2697162Z ^ 2025-05-07T20:04:49.2697505Z 2025-05-07T20:04:49.2699028Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:49.2701543Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:49.2702667Z ^ 2025-05-07T20:04:49.2702896Z 2025-05-07T20:04:49.2703328Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:49.2703966Z 2025-05-07T20:04:49.2705484Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:49.2708310Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:49.2709464Z ^ 2025-05-07T20:04:49.2709831Z 2025-05-07T20:04:53.3968103Z [509/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_dense_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_cta.cu.o 2025-05-07T20:04:53.3992719Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:53.3995578Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:53.3996830Z ^ 2025-05-07T20:04:53.3997111Z 2025-05-07T20:04:53.3997580Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:53.3998286Z 2025-05-07T20:04:53.4000029Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:53.4002916Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:53.4004164Z ^ 2025-05-07T20:04:53.4004986Z 2025-05-07T20:04:53.4006720Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:53.4009560Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:53.4010767Z ^ 2025-05-07T20:04:53.4011053Z 2025-05-07T20:04:53.4011672Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:53.4012385Z 2025-05-07T20:04:53.4014123Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:53.4017143Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:53.4018582Z ^ 2025-05-07T20:04:53.4018975Z 2025-05-07T20:04:53.4020692Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:53.4023545Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:53.4024819Z ^ 2025-05-07T20:04:53.4025087Z 2025-05-07T20:04:53.4025556Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:53.4026290Z 2025-05-07T20:04:53.4028044Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:53.4030923Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:53.4032190Z ^ 2025-05-07T20:04:53.4032595Z 2025-05-07T20:04:53.4034308Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:53.4037176Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:53.4038422Z ^ 2025-05-07T20:04:53.4038691Z 2025-05-07T20:04:53.4039172Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:53.4039867Z 2025-05-07T20:04:53.4041602Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:53.4044470Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:53.4045761Z ^ 2025-05-07T20:04:53.4046141Z 2025-05-07T20:04:53.4047860Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:53.4050590Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:53.4051799Z ^ 2025-05-07T20:04:53.4052070Z 2025-05-07T20:04:53.4052684Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:53.4053405Z 2025-05-07T20:04:53.4055054Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:53.4058000Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:53.4059281Z ^ 2025-05-07T20:04:53.4059770Z 2025-05-07T20:04:59.1982386Z [510/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T20:04:59.2005159Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:59.2007807Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:59.2008912Z ^ 2025-05-07T20:04:59.2009154Z 2025-05-07T20:04:59.2009598Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:59.2010228Z 2025-05-07T20:04:59.2011836Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:59.2014793Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:59.2015981Z ^ 2025-05-07T20:04:59.2016475Z 2025-05-07T20:04:59.2018235Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:59.2020823Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:59.2022003Z ^ 2025-05-07T20:04:59.2022262Z 2025-05-07T20:04:59.2022683Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:59.2023352Z 2025-05-07T20:04:59.2025103Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:59.2027646Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:59.2028783Z ^ 2025-05-07T20:04:59.2029121Z 2025-05-07T20:04:59.2030727Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:59.2033267Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:59.2034457Z ^ 2025-05-07T20:04:59.2034730Z 2025-05-07T20:04:59.2035203Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:59.2035858Z 2025-05-07T20:04:59.2037472Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:59.2040128Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:59.2041319Z ^ 2025-05-07T20:04:59.2041708Z 2025-05-07T20:04:59.2043261Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:59.2045887Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:59.2047000Z ^ 2025-05-07T20:04:59.2047252Z 2025-05-07T20:04:59.2047710Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:59.2048338Z 2025-05-07T20:04:59.2049923Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:59.2052543Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:59.2053674Z ^ 2025-05-07T20:04:59.2054018Z 2025-05-07T20:04:59.2055591Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:59.2058468Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:59.2059619Z ^ 2025-05-07T20:04:59.2059856Z 2025-05-07T20:04:59.2060274Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:59.2060906Z 2025-05-07T20:04:59.2062670Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:59.2065234Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:59.2066376Z ^ 2025-05-07T20:04:59.2066726Z 2025-05-07T20:05:04.8040506Z [511/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_batch_index_select_dim0_forward_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_codegen_cuda.cu.o 2025-05-07T20:05:04.8062645Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:04.8065190Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:04.8066298Z ^ 2025-05-07T20:05:04.8066791Z 2025-05-07T20:05:04.8067325Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:04.8067944Z 2025-05-07T20:05:04.8069516Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:04.8072338Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:04.8073642Z ^ 2025-05-07T20:05:04.8073994Z 2025-05-07T20:05:04.8075552Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:04.8078052Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:04.8079203Z ^ 2025-05-07T20:05:04.8079471Z 2025-05-07T20:05:04.8079843Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:04.8080377Z 2025-05-07T20:05:04.8081975Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:04.8084499Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:04.8085672Z ^ 2025-05-07T20:05:04.8086027Z 2025-05-07T20:05:04.8087611Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:04.8090146Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:04.8091262Z ^ 2025-05-07T20:05:04.8091497Z 2025-05-07T20:05:04.8091924Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:04.8092544Z 2025-05-07T20:05:04.8094203Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:04.8096877Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:04.8098020Z ^ 2025-05-07T20:05:04.8098357Z 2025-05-07T20:05:04.8099879Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:04.8102462Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:04.8103595Z ^ 2025-05-07T20:05:04.8103863Z 2025-05-07T20:05:04.8104294Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:04.8104926Z 2025-05-07T20:05:04.8106519Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:04.8109088Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:04.8110577Z ^ 2025-05-07T20:05:04.8110944Z 2025-05-07T20:05:04.8112548Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:04.8115162Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:04.8116239Z ^ 2025-05-07T20:05:04.8116481Z 2025-05-07T20:05:04.8116962Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:04.8117589Z 2025-05-07T20:05:04.8119060Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:04.8121533Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:04.8122639Z ^ 2025-05-07T20:05:04.8123006Z 2025-05-07T20:05:06.5924210Z [512/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_embedding_backward_split_grad_index_select.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_embedding_backward_split_grad_index_select.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_grad_index_select.cu -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_embedding_backward_split_grad_index_select.cu.o 2025-05-07T20:05:06.5945721Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:06.5948512Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:06.5949524Z ^ 2025-05-07T20:05:06.5949776Z 2025-05-07T20:05:06.5950216Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:06.5950850Z 2025-05-07T20:05:06.5952624Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:06.5955273Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:06.5956423Z ^ 2025-05-07T20:05:06.5956779Z 2025-05-07T20:05:06.5958522Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:06.5961128Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:06.5962269Z ^ 2025-05-07T20:05:06.5962516Z 2025-05-07T20:05:06.5962963Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:06.5963624Z 2025-05-07T20:05:06.5965262Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:06.5967914Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:06.5969068Z ^ 2025-05-07T20:05:06.5969433Z 2025-05-07T20:05:06.5971292Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:06.5973876Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:06.5975021Z ^ 2025-05-07T20:05:06.5975281Z 2025-05-07T20:05:06.5975732Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:06.5976491Z 2025-05-07T20:05:06.5978086Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:06.5980435Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:06.5981516Z ^ 2025-05-07T20:05:06.5981841Z 2025-05-07T20:05:06.5983414Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:06.5985949Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:06.5987132Z ^ 2025-05-07T20:05:06.5987375Z 2025-05-07T20:05:06.5987812Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:06.5988494Z 2025-05-07T20:05:06.5990088Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:06.5993062Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:06.5994255Z ^ 2025-05-07T20:05:06.5994641Z 2025-05-07T20:05:06.5996308Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:06.5999063Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:06.6000259Z ^ 2025-05-07T20:05:06.6000524Z 2025-05-07T20:05:06.6000956Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:06.6001591Z 2025-05-07T20:05:06.6003310Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:06.6005945Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:06.6007132Z ^ 2025-05-07T20:05:06.6007490Z 2025-05-07T20:05:09.4545459Z [513/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_dense_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_cta.cu.o 2025-05-07T20:05:09.4568526Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:09.4571447Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:09.4572640Z ^ 2025-05-07T20:05:09.4572903Z 2025-05-07T20:05:09.4573625Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:09.4574282Z 2025-05-07T20:05:09.4575890Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:09.4578873Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:09.4580084Z ^ 2025-05-07T20:05:09.4580501Z 2025-05-07T20:05:09.4582103Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:09.4584770Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:09.4585872Z ^ 2025-05-07T20:05:09.4586113Z 2025-05-07T20:05:09.4586543Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:09.4587185Z 2025-05-07T20:05:09.4588874Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:09.4591565Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:09.4592709Z ^ 2025-05-07T20:05:09.4593060Z 2025-05-07T20:05:09.4594677Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:09.4597340Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:09.4598500Z ^ 2025-05-07T20:05:09.4598752Z 2025-05-07T20:05:09.4599204Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:09.4599900Z 2025-05-07T20:05:09.4601478Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:09.4604271Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:09.4605428Z ^ 2025-05-07T20:05:09.4605801Z 2025-05-07T20:05:09.4607429Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:09.4610002Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:09.4611189Z ^ 2025-05-07T20:05:09.4611701Z 2025-05-07T20:05:09.4612170Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:09.4612828Z 2025-05-07T20:05:09.4614506Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:09.4617332Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:09.4618592Z ^ 2025-05-07T20:05:09.4618961Z 2025-05-07T20:05:09.4620555Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:09.4623061Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:09.4624321Z ^ 2025-05-07T20:05:09.4624604Z 2025-05-07T20:05:09.4625041Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:09.4625688Z 2025-05-07T20:05:09.4627289Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:09.4629989Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:09.4631139Z ^ 2025-05-07T20:05:09.4631496Z 2025-05-07T20:05:09.8944463Z [514/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_batch_index_select_dim0_forward_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel.cu.o 2025-05-07T20:05:09.8966548Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:09.8969555Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:09.8970988Z ^ 2025-05-07T20:05:09.8971234Z 2025-05-07T20:05:09.8971679Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:09.8972323Z 2025-05-07T20:05:09.8974086Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:09.8976865Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:09.8978001Z ^ 2025-05-07T20:05:09.8978349Z 2025-05-07T20:05:09.8979974Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:09.8982560Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:09.8983683Z ^ 2025-05-07T20:05:09.8983955Z 2025-05-07T20:05:09.8984398Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:09.8985052Z 2025-05-07T20:05:09.8986682Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:09.8989232Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:09.8990428Z ^ 2025-05-07T20:05:09.8990797Z 2025-05-07T20:05:09.8992450Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:09.8995011Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:09.8996165Z ^ 2025-05-07T20:05:09.8996410Z 2025-05-07T20:05:09.8996835Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:09.8997498Z 2025-05-07T20:05:09.8999131Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:09.9001747Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:09.9002830Z ^ 2025-05-07T20:05:09.9003194Z 2025-05-07T20:05:09.9004764Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:09.9007644Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:09.9008724Z ^ 2025-05-07T20:05:09.9008963Z 2025-05-07T20:05:09.9009371Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:09.9010007Z 2025-05-07T20:05:09.9011758Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:09.9014330Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:09.9015517Z ^ 2025-05-07T20:05:09.9015884Z 2025-05-07T20:05:09.9017759Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:09.9020337Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:09.9021465Z ^ 2025-05-07T20:05:09.9021697Z 2025-05-07T20:05:09.9022156Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:09.9022790Z 2025-05-07T20:05:09.9024165Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:09.9026647Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:09.9027729Z ^ 2025-05-07T20:05:09.9028078Z 2025-05-07T20:05:11.3532013Z [515/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_tensor_ops_cpu.cpp 2025-05-07T20:05:11.3551086Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:05:13.4320927Z [516/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/input_combine_ops/input_combine.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine.cu.o 2025-05-07T20:05:16.2947421Z [517/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/memory_utils/memory_utils.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cu.o 2025-05-07T20:05:23.3425778Z [518/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel_small.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel_small.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_batch_index_select_dim0_forward_kernel_small.cu -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel_small.cu.o 2025-05-07T20:05:23.3444798Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:23.3447292Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:23.3448292Z ^ 2025-05-07T20:05:23.3448530Z 2025-05-07T20:05:23.3449179Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:23.3449847Z 2025-05-07T20:05:23.3451296Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:23.3453728Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:23.3454779Z ^ 2025-05-07T20:05:23.3455260Z 2025-05-07T20:05:23.3456831Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:23.3459114Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:23.3460186Z ^ 2025-05-07T20:05:23.3460522Z 2025-05-07T20:05:23.3460944Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:23.3461625Z 2025-05-07T20:05:23.3463299Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:23.3465925Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:23.3466996Z ^ 2025-05-07T20:05:23.3467340Z 2025-05-07T20:05:23.3468840Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:23.3471650Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:23.3472776Z ^ 2025-05-07T20:05:23.3472999Z 2025-05-07T20:05:23.3473360Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:23.3473865Z 2025-05-07T20:05:23.3475181Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:23.3477503Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:23.3478647Z ^ 2025-05-07T20:05:23.3478998Z 2025-05-07T20:05:23.3480591Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:23.3483211Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:23.3484396Z ^ 2025-05-07T20:05:23.3484645Z 2025-05-07T20:05:23.3485092Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:23.3485771Z 2025-05-07T20:05:23.3487433Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:23.3490111Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:23.3491500Z ^ 2025-05-07T20:05:23.3491988Z 2025-05-07T20:05:23.3493592Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:23.3496198Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:23.3497456Z ^ 2025-05-07T20:05:23.3497701Z 2025-05-07T20:05:23.3498310Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:23.3498980Z 2025-05-07T20:05:23.3500635Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:23.3503399Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:23.3504572Z ^ 2025-05-07T20:05:23.3504933Z 2025-05-07T20:05:25.3182597Z [519/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/memory_utils/memory_utils_ops.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cu.o 2025-05-07T20:05:34.3576659Z [520/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/histogram_binning_calibration_ops.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/histogram_binning_calibration_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/histogram_binning_calibration_ops.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/histogram_binning_calibration_ops.cu.o 2025-05-07T20:05:35.9965560Z [521/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_batch_index_select_dim0_backward_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_codegen_cuda.cu.o 2025-05-07T20:05:35.9985874Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:35.9988076Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:35.9989121Z ^ 2025-05-07T20:05:35.9989375Z 2025-05-07T20:05:35.9989848Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:35.9990565Z 2025-05-07T20:05:35.9992062Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:35.9994435Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:35.9995520Z ^ 2025-05-07T20:05:35.9995891Z 2025-05-07T20:05:35.9997310Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:35.9999566Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:36.0000619Z ^ 2025-05-07T20:05:36.0000852Z 2025-05-07T20:05:36.0001295Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:36.0001953Z 2025-05-07T20:05:36.0003597Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:36.0005976Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:36.0006989Z ^ 2025-05-07T20:05:36.0007291Z 2025-05-07T20:05:36.0008513Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:36.0010777Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:36.0011796Z ^ 2025-05-07T20:05:36.0012027Z 2025-05-07T20:05:36.0012406Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:36.0017445Z 2025-05-07T20:05:36.0018960Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:36.0021406Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:36.0022524Z ^ 2025-05-07T20:05:36.0022872Z 2025-05-07T20:05:36.0024429Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:36.0027037Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:36.0028138Z ^ 2025-05-07T20:05:36.0028442Z 2025-05-07T20:05:36.0028870Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:36.0029522Z 2025-05-07T20:05:36.0031032Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:36.0033391Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:36.0034635Z ^ 2025-05-07T20:05:36.0035067Z 2025-05-07T20:05:36.0036702Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:36.0039030Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:36.0040072Z ^ 2025-05-07T20:05:36.0040340Z 2025-05-07T20:05:36.0040752Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:36.0041325Z 2025-05-07T20:05:36.0042724Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:36.0045065Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:36.0046137Z ^ 2025-05-07T20:05:36.0046457Z 2025-05-07T20:05:36.8491794Z [522/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split.cu.o 2025-05-07T20:05:38.0836317Z [523/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_batch_index_select_dim0_backward_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_warp.cu.o 2025-05-07T20:05:38.0856589Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:38.0859434Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:38.0860440Z ^ 2025-05-07T20:05:38.0860699Z 2025-05-07T20:05:38.0861104Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:38.0861694Z 2025-05-07T20:05:38.0863145Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:38.0865769Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:38.0866877Z ^ 2025-05-07T20:05:38.0867218Z 2025-05-07T20:05:38.0868907Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:38.0871440Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:38.0872443Z ^ 2025-05-07T20:05:38.0872680Z 2025-05-07T20:05:38.0873067Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:38.0873673Z 2025-05-07T20:05:38.0875315Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:38.0877758Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:38.0878893Z ^ 2025-05-07T20:05:38.0879246Z 2025-05-07T20:05:38.0880756Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:38.0883123Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:38.0884140Z ^ 2025-05-07T20:05:38.0884391Z 2025-05-07T20:05:38.0884793Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:38.0885273Z 2025-05-07T20:05:38.0886483Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:38.0888833Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:38.0889855Z ^ 2025-05-07T20:05:38.0890171Z 2025-05-07T20:05:38.0891543Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:38.0893872Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:38.0894921Z ^ 2025-05-07T20:05:38.0895148Z 2025-05-07T20:05:38.0895554Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:38.0896178Z 2025-05-07T20:05:38.0897689Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:38.0900302Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:38.0901343Z ^ 2025-05-07T20:05:38.0901671Z 2025-05-07T20:05:38.0903100Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:38.0905689Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:38.0906758Z ^ 2025-05-07T20:05:38.0906993Z 2025-05-07T20:05:38.0907430Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:38.0908072Z 2025-05-07T20:05:38.0909648Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:38.0911991Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:38.0913041Z ^ 2025-05-07T20:05:38.0913409Z 2025-05-07T20:05:43.7414725Z [524/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_forward.cu.o 2025-05-07T20:05:43.8297247Z [525/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_backward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_backward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_backward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_backward.cu.o 2025-05-07T20:05:46.1516288Z [526/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_add_2d_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_add_2d_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_index_add_2d_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_add_2d_forward.cu.o 2025-05-07T20:05:46.8780092Z [527/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_bmm_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_bmm_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_dense_bmm_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_bmm_forward.cu.o 2025-05-07T20:05:47.0769286Z [528/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_select_2d_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_select_2d_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_index_select_2d_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_select_2d_forward.cu.o 2025-05-07T20:05:47.7167598Z [529/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_tensor_ops.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops.cu.o 2025-05-07T20:05:48.7894768Z [530/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_backward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_backward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_softmax_backward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_backward.cu.o 2025-05-07T20:05:49.2001087Z [531/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/metric_ops/metric_ops.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops.cu.o 2025-05-07T20:05:49.5928713Z [532/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_jagged_bmm_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_jagged_bmm_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_jagged_bmm_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_jagged_bmm_forward.cu.o 2025-05-07T20:05:49.9541813Z [533/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_softmax_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_forward.cu.o 2025-05-07T20:05:50.3506839Z [534/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/layout_transform_ops/layout_transform_ops.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops.cu.o 2025-05-07T20:05:50.3609442Z [535/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/asmjit.so 2025-05-07T20:05:50.3611570Z ################################################################################ 2025-05-07T20:05:50.3612433Z [CMAKE] Running post-build script ... 2025-05-07T20:05:50.3613311Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/asmjit.so 2025-05-07T20:05:50.3614231Z Removing all RPATHs ... 2025-05-07T20:05:50.3614708Z ################################################################################ 2025-05-07T20:05:50.3732653Z [536/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm.so 1 2025-05-07T20:05:50.3735132Z ################################################################################ 2025-05-07T20:05:50.3735755Z [CMAKE] Running post-build script ... 2025-05-07T20:05:50.3736779Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm.so 2025-05-07T20:05:50.3737719Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:05:50.3738383Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:05:50.3739268Z ################################################################################ 2025-05-07T20:05:50.5113373Z [537/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:05:50.5115797Z ################################################################################ 2025-05-07T20:05:50.5116448Z [CMAKE] Running post-build script ... 2025-05-07T20:05:50.5117427Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:05:50.5118527Z Removing all RPATHs ... 2025-05-07T20:05:50.5119027Z ################################################################################ 2025-05-07T20:05:50.8901402Z [538/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_inference.so 1 2025-05-07T20:05:50.8903726Z ################################################################################ 2025-05-07T20:05:50.8904375Z [CMAKE] Running post-build script ... 2025-05-07T20:05:50.8905434Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:05:50.8906504Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:05:50.8907187Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:05:50.8907927Z ################################################################################ 2025-05-07T20:05:50.8980009Z [539/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:05:50.8982241Z ################################################################################ 2025-05-07T20:05:50.8982857Z [CMAKE] Running post-build script ... 2025-05-07T20:05:50.8983870Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:05:50.8984912Z Removing all RPATHs ... 2025-05-07T20:05:50.8985379Z ################################################################################ 2025-05-07T20:05:50.9386197Z [540/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:05:50.9388511Z ################################################################################ 2025-05-07T20:05:50.9389140Z [CMAKE] Running post-build script ... 2025-05-07T20:05:50.9390186Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:05:50.9391432Z Removing all RPATHs ... 2025-05-07T20:05:50.9391924Z ################################################################################ 2025-05-07T20:05:50.9481340Z [541/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 1 2025-05-07T20:05:50.9483877Z ################################################################################ 2025-05-07T20:05:50.9484468Z [CMAKE] Running post-build script ... 2025-05-07T20:05:50.9485491Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:05:50.9486521Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:05:50.9487123Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:05:50.9488069Z ################################################################################ 2025-05-07T20:05:50.9584354Z [542/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_common.so 1 2025-05-07T20:05:50.9586458Z ################################################################################ 2025-05-07T20:05:50.9587088Z [CMAKE] Running post-build script ... 2025-05-07T20:05:50.9587991Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:05:50.9588923Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:05:50.9589494Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:05:50.9590172Z ################################################################################ 2025-05-07T20:05:50.9828989Z [543/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:05:50.9831079Z ################################################################################ 2025-05-07T20:05:50.9831657Z [CMAKE] Running post-build script ... 2025-05-07T20:05:50.9832638Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:05:50.9833520Z Removing all RPATHs ... 2025-05-07T20:05:50.9833981Z ################################################################################ 2025-05-07T20:05:51.1257804Z [544/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_unique_indices.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_unique_indices.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_unique_indices.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_unique_indices.cu.o 2025-05-07T20:05:51.1277527Z In file included from tmpxft_00004282_00000000-6_jagged_unique_indices.compute_90a.cudafe1.stub.c:1: 2025-05-07T20:05:51.1279449Z /tmp/tmpxft_00004282_00000000-6_jagged_unique_indices.compute_90a.cudafe1.stub.c:46:1022: warning: integer literal is too large to be represented in a signed integer type, interpreting as unsigned [-Wimplicitly-unsigned-literal] 2025-05-07T20:05:51.1288398Z static void __device_stub__ZN10fbgemm_gpu28unique_indices_length_kernelIlLl9223372036854775807ELln9223372036854775808EEEvN2at27GenericPackedTensorAccessorIT_Lm1ENS1_17RestrictPtrTraitsEiEES5_S5_S5_(const _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE&__par0, const _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE&__par1, const _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE&__par2, _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE&__par3){__cudaLaunchPrologue(4);__cudaSetupArg(__par0, 0UL);__cudaSetupArg(__par1, 16UL);__cudaSetupArg(__par2, 32UL);__cudaSetupArg(__par3, 48UL);__cudaLaunch(((char *)((void ( *)(const _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE, const _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE, const _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE, _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE))fbgemm_gpu::unique_indices_length_kernel )));}namespace fbgemm_gpu{ 2025-05-07T20:05:51.1297044Z ^ 2025-05-07T20:05:51.1299627Z /tmp/tmpxft_00004282_00000000-6_jagged_unique_indices.compute_90a.cudafe1.stub.c:46:1022: warning: integer literal is too large to be represented in a signed integer type, interpreting as unsigned [-Wimplicitly-unsigned-literal] 2025-05-07T20:05:51.1302431Z /tmp/tmpxft_00004282_00000000-6_jagged_unique_indices.compute_90a.cudafe1.stub.c:46:1022: warning: integer literal is too large to be represented in a signed integer type, interpreting as unsigned [-Wimplicitly-unsigned-literal] 2025-05-07T20:05:51.1305577Z /tmp/tmpxft_00004282_00000000-6_jagged_unique_indices.compute_90a.cudafe1.stub.c:52:860: warning: integer literal is too large to be represented in a signed integer type, interpreting as unsigned [-Wimplicitly-unsigned-literal] 2025-05-07T20:05:51.1313558Z static void __device_stub__ZN10fbgemm_gpu24compute_hash_size_kernelIlLln9223372036854775808EEEvN2at27GenericPackedTensorAccessorIT_Lm1ENS1_17RestrictPtrTraitsEiEES5_lS5_(const _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE&__par0, const _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE&__par1, const int64_t __par2, _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE&__par3){__cudaLaunchPrologue(4);__cudaSetupArg(__par0, 0UL);__cudaSetupArg(__par1, 16UL);__cudaSetupArgSimple(__par2, 32UL);__cudaSetupArg(__par3, 40UL);__cudaLaunch(((char *)((void ( *)(const _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE, const _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE, const int64_t, _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE))fbgemm_gpu::compute_hash_size_kernel )));}namespace fbgemm_gpu{ 2025-05-07T20:05:51.1319970Z ^ 2025-05-07T20:05:51.1321762Z /tmp/tmpxft_00004282_00000000-6_jagged_unique_indices.compute_90a.cudafe1.stub.c:52:860: warning: integer literal is too large to be represented in a signed integer type, interpreting as unsigned [-Wimplicitly-unsigned-literal] 2025-05-07T20:05:51.1324654Z /tmp/tmpxft_00004282_00000000-6_jagged_unique_indices.compute_90a.cudafe1.stub.c:52:860: warning: integer literal is too large to be represented in a signed integer type, interpreting as unsigned [-Wimplicitly-unsigned-literal] 2025-05-07T20:05:51.1327454Z /tmp/tmpxft_00004282_00000000-6_jagged_unique_indices.compute_90a.cudafe1.stub.c:55:445: warning: integer literal is too large to be represented in a signed integer type, interpreting as unsigned [-Wimplicitly-unsigned-literal] 2025-05-07T20:05:51.1330175Z /tmp/tmpxft_00004282_00000000-6_jagged_unique_indices.compute_90a.cudafe1.stub.c:55:1476: warning: integer literal is too large to be represented in a signed integer type, interpreting as unsigned [-Wimplicitly-unsigned-literal] 2025-05-07T20:05:51.1331892Z 8 warnings generated. 2025-05-07T20:05:51.1433218Z [545/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 1 2025-05-07T20:05:51.1435775Z ################################################################################ 2025-05-07T20:05:51.1436367Z [CMAKE] Running post-build script ... 2025-05-07T20:05:51.1437415Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:05:51.1438475Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:05:51.1439072Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:05:51.1439736Z ################################################################################ 2025-05-07T20:05:51.1551343Z [546/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 1 2025-05-07T20:05:51.1553680Z ################################################################################ 2025-05-07T20:05:51.1554278Z [CMAKE] Running post-build script ... 2025-05-07T20:05:51.1555542Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:05:51.1556663Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:05:51.1557258Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:05:51.1557957Z ################################################################################ 2025-05-07T20:05:51.3293631Z [547/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_forward.so 1 2025-05-07T20:05:51.3295975Z ################################################################################ 2025-05-07T20:05:51.3296685Z [CMAKE] Running post-build script ... 2025-05-07T20:05:51.3297766Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:05:51.3298822Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:05:51.3299435Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:05:51.3300049Z ################################################################################ 2025-05-07T20:05:52.9099969Z [548/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu.o 2025-05-07T20:05:52.9119763Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:52.9121890Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:52.9122734Z ^ 2025-05-07T20:05:52.9122964Z 2025-05-07T20:05:52.9123309Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:52.9123805Z 2025-05-07T20:05:52.9125070Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:52.9126983Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:52.9127853Z ^ 2025-05-07T20:05:52.9128137Z 2025-05-07T20:05:52.9129299Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:52.9131234Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:52.9132099Z ^ 2025-05-07T20:05:52.9132285Z 2025-05-07T20:05:52.9132610Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:52.9133092Z 2025-05-07T20:05:52.9134253Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:52.9136135Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:52.9137137Z ^ 2025-05-07T20:05:52.9137408Z 2025-05-07T20:05:52.9138613Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:52.9140535Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:52.9141402Z ^ 2025-05-07T20:05:52.9141592Z 2025-05-07T20:05:52.9141947Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:52.9142443Z 2025-05-07T20:05:52.9143647Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:52.9145719Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:52.9146628Z ^ 2025-05-07T20:05:52.9146896Z 2025-05-07T20:05:52.9148066Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:52.9150037Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:52.9150855Z ^ 2025-05-07T20:05:52.9151061Z 2025-05-07T20:05:52.9151380Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:52.9151865Z 2025-05-07T20:05:52.9153180Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:52.9155156Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:52.9156010Z ^ 2025-05-07T20:05:52.9156275Z 2025-05-07T20:05:52.9157607Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:52.9159534Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:52.9160414Z ^ 2025-05-07T20:05:52.9160596Z 2025-05-07T20:05:52.9160932Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:52.9161413Z 2025-05-07T20:05:52.9162662Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:52.9164594Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:52.9165491Z ^ 2025-05-07T20:05:52.9165827Z 2025-05-07T20:05:53.5766858Z [549/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu.o 2025-05-07T20:05:53.5796779Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:53.5800489Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:53.5802088Z ^ 2025-05-07T20:05:53.5802408Z 2025-05-07T20:05:53.5802995Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:53.5803849Z 2025-05-07T20:05:53.5805965Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:53.5809213Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:53.5810635Z ^ 2025-05-07T20:05:53.5811065Z 2025-05-07T20:05:53.5813067Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:53.5816366Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:53.5817987Z ^ 2025-05-07T20:05:53.5818307Z 2025-05-07T20:05:53.5818864Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:53.5819705Z 2025-05-07T20:05:53.5821938Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:53.5825433Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:53.5826971Z ^ 2025-05-07T20:05:53.5827405Z 2025-05-07T20:05:53.5829307Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:53.5832507Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:53.5834134Z ^ 2025-05-07T20:05:53.5834438Z 2025-05-07T20:05:53.5834989Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:53.5835799Z 2025-05-07T20:05:53.5837770Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:53.5841320Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:53.5842757Z ^ 2025-05-07T20:05:53.5843184Z 2025-05-07T20:05:53.5845184Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:53.5848655Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:53.5850097Z ^ 2025-05-07T20:05:53.5850388Z 2025-05-07T20:05:53.5850929Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:53.5851690Z 2025-05-07T20:05:53.5853791Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:53.5857161Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:53.5858576Z ^ 2025-05-07T20:05:53.5859017Z 2025-05-07T20:05:53.5860946Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:53.5864151Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:53.5865535Z ^ 2025-05-07T20:05:53.5865842Z 2025-05-07T20:05:53.5866356Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:53.5867144Z 2025-05-07T20:05:53.5869230Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:53.5872361Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:53.5873385Z ^ 2025-05-07T20:05:53.5873701Z 2025-05-07T20:05:54.0994709Z [550/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_dense_elementwise_mul_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_forward.cu.o 2025-05-07T20:05:54.8357413Z [551/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T20:05:54.8375574Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:54.8377645Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:54.8378743Z ^ 2025-05-07T20:05:54.8378972Z 2025-05-07T20:05:54.8379297Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:54.8379782Z 2025-05-07T20:05:54.8381019Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:54.8383129Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:54.8384005Z ^ 2025-05-07T20:05:54.8384282Z 2025-05-07T20:05:54.8385482Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:54.8387485Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:54.8388356Z ^ 2025-05-07T20:05:54.8388550Z 2025-05-07T20:05:54.8388865Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:54.8389354Z 2025-05-07T20:05:54.8390520Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:54.8392407Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:54.8393247Z ^ 2025-05-07T20:05:54.8393523Z 2025-05-07T20:05:54.8394662Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:54.8396537Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:54.8397373Z ^ 2025-05-07T20:05:54.8397565Z 2025-05-07T20:05:54.8397897Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:54.8398364Z 2025-05-07T20:05:54.8399562Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:54.8401478Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:54.8402362Z ^ 2025-05-07T20:05:54.8402639Z 2025-05-07T20:05:54.8403797Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:54.8405763Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:54.8406819Z ^ 2025-05-07T20:05:54.8407008Z 2025-05-07T20:05:54.8407356Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:54.8407862Z 2025-05-07T20:05:54.8409071Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:54.8411287Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:54.8412208Z ^ 2025-05-07T20:05:54.8412500Z 2025-05-07T20:05:54.8413823Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:54.8416382Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:54.8417646Z ^ 2025-05-07T20:05:54.8417887Z 2025-05-07T20:05:54.8418264Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:54.8418960Z 2025-05-07T20:05:54.8420702Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:54.8422948Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:54.8423831Z ^ 2025-05-07T20:05:54.8424155Z 2025-05-07T20:05:54.9014109Z [552/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_embedding_inplace_ops_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update.cu.o -MF CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/embedding_inplace_ops/embedding_inplace_update.cu -o CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update.cu.o 2025-05-07T20:05:55.0942497Z [553/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_backward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_backward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_to_padded_dense_backward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_backward.cu.o 2025-05-07T20:05:55.4799825Z [554/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_embedding_inplace_ops.so -o fbgemm_gpu_embedding_inplace_ops.so CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_cpu.cpp.o CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_gpu.cpp.o CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,/lib/intel64:/lib/intel64_win:/lib/win-x64:/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib:/github/home/miniconda/envs/build_binary/lib/stubs: /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so" -Wl,--as-needed -lcudadevrt -lcudart_static -ldl && : 2025-05-07T20:05:55.4894887Z [555/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:05:55.4897564Z ################################################################################ 2025-05-07T20:05:55.4898230Z [CMAKE] Running post-build script ... 2025-05-07T20:05:55.4899316Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:05:55.4900372Z Removing all RPATHs ... 2025-05-07T20:05:55.4900883Z ################################################################################ 2025-05-07T20:05:55.9114940Z [556/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/dense_to_jagged_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/dense_to_jagged_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/dense_to_jagged_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/dense_to_jagged_forward.cu.o 2025-05-07T20:05:56.0240014Z [557/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops.cu.o 2025-05-07T20:05:56.4148349Z [558/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_backward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_backward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_dense_elementwise_mul_backward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_backward.cu.o 2025-05-07T20:05:58.1011937Z [559/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_dense_elementwise_add_jagged_output_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_dense_elementwise_add_jagged_output_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_dense_dense_elementwise_add_jagged_output_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_dense_elementwise_add_jagged_output_forward.cu.o 2025-05-07T20:05:58.4110715Z [560/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fp8_rowwise.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fp8_rowwise.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fp8_rowwise.cu.o 2025-05-07T20:05:58.4121479Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:05:58.4122287Z __attribute__((global)) inline void _float_to_FP8rowwise_cuda_kernel( 2025-05-07T20:05:58.4122690Z ^ 2025-05-07T20:05:58.4122822Z 2025-05-07T20:05:58.4123057Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:58.4123415Z 2025-05-07T20:05:58.4123895Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(61): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:05:58.4124653Z __attribute__((global)) inline void _get_FP8_qparam_cuda_kernel( 2025-05-07T20:05:58.4125025Z ^ 2025-05-07T20:05:58.4125157Z 2025-05-07T20:05:58.4125656Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(121): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:05:58.4126447Z __attribute__((global)) inline void _compute_FP8_quantize_cuda_kernel( 2025-05-07T20:05:58.4126848Z ^ 2025-05-07T20:05:58.4126984Z 2025-05-07T20:05:58.4127470Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(161): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:05:58.4128326Z __attribute__((global)) inline void _FP8rowwise_to_float_cuda_kernel( 2025-05-07T20:05:58.4128711Z ^ 2025-05-07T20:05:58.4128837Z 2025-05-07T20:05:58.4129316Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:05:58.4130189Z __attribute__((global)) inline void _float_to_FP8rowwise_cuda_kernel( 2025-05-07T20:05:58.4130585Z ^ 2025-05-07T20:05:58.4130715Z 2025-05-07T20:05:58.4130947Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:58.4131300Z 2025-05-07T20:05:58.4131779Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(61): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:05:58.4132541Z __attribute__((global)) inline void _get_FP8_qparam_cuda_kernel( 2025-05-07T20:05:58.4132958Z ^ 2025-05-07T20:05:58.4133084Z 2025-05-07T20:05:58.4133585Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(121): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:05:58.4134367Z __attribute__((global)) inline void _compute_FP8_quantize_cuda_kernel( 2025-05-07T20:05:58.4134762Z ^ 2025-05-07T20:05:58.4134886Z 2025-05-07T20:05:58.4135403Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(161): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:05:58.4136193Z __attribute__((global)) inline void _FP8rowwise_to_float_cuda_kernel( 2025-05-07T20:05:58.4136721Z ^ 2025-05-07T20:05:58.4136844Z 2025-05-07T20:05:58.4137372Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:05:58.4138159Z __attribute__((global)) inline void _float_to_FP8rowwise_cuda_kernel( 2025-05-07T20:05:58.4138583Z ^ 2025-05-07T20:05:58.4138719Z 2025-05-07T20:05:58.4138952Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:58.4139291Z 2025-05-07T20:05:58.4139778Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(61): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:05:58.4140545Z __attribute__((global)) inline void _get_FP8_qparam_cuda_kernel( 2025-05-07T20:05:58.4140922Z ^ 2025-05-07T20:05:58.4141047Z 2025-05-07T20:05:58.4141533Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(121): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:05:58.4142322Z __attribute__((global)) inline void _compute_FP8_quantize_cuda_kernel( 2025-05-07T20:05:58.4142711Z ^ 2025-05-07T20:05:58.4142838Z 2025-05-07T20:05:58.4143326Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(161): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:05:58.4144116Z __attribute__((global)) inline void _FP8rowwise_to_float_cuda_kernel( 2025-05-07T20:05:58.4144493Z ^ 2025-05-07T20:05:58.4144631Z 2025-05-07T20:05:58.4145106Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:05:58.4145893Z __attribute__((global)) inline void _float_to_FP8rowwise_cuda_kernel( 2025-05-07T20:05:58.4146268Z ^ 2025-05-07T20:05:58.4146392Z 2025-05-07T20:05:58.4146633Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:58.4146972Z 2025-05-07T20:05:58.4147451Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(61): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:05:58.4148222Z __attribute__((global)) inline void _get_FP8_qparam_cuda_kernel( 2025-05-07T20:05:58.4148600Z ^ 2025-05-07T20:05:58.4148727Z 2025-05-07T20:05:58.4149211Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(121): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:05:58.4150003Z __attribute__((global)) inline void _compute_FP8_quantize_cuda_kernel( 2025-05-07T20:05:58.4152575Z ^ 2025-05-07T20:05:58.4152717Z 2025-05-07T20:05:58.4153214Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(161): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:05:58.4154013Z __attribute__((global)) inline void _FP8rowwise_to_float_cuda_kernel( 2025-05-07T20:05:58.4154389Z ^ 2025-05-07T20:05:58.4154517Z 2025-05-07T20:05:58.4166296Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:05:58.4167499Z __attribute__((global)) inline void _float_to_FP8rowwise_cuda_kernel( 2025-05-07T20:05:58.4167910Z ^ 2025-05-07T20:05:58.4168060Z 2025-05-07T20:05:58.4168341Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:58.4168700Z 2025-05-07T20:05:58.4169195Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(61): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:05:58.4170015Z __attribute__((global)) inline void _get_FP8_qparam_cuda_kernel( 2025-05-07T20:05:58.4170884Z ^ 2025-05-07T20:05:58.4171030Z 2025-05-07T20:05:58.4171530Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(121): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:05:58.4172365Z __attribute__((global)) inline void _compute_FP8_quantize_cuda_kernel( 2025-05-07T20:05:58.4172768Z ^ 2025-05-07T20:05:58.4172934Z 2025-05-07T20:05:58.4173500Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(161): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:05:58.4174327Z __attribute__((global)) inline void _FP8rowwise_to_float_cuda_kernel( 2025-05-07T20:05:58.4174730Z ^ 2025-05-07T20:05:58.4174864Z 2025-05-07T20:06:00.3634320Z [561/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_multi_embedding_ops/permute_multi_embedding_ops.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops.cu.o 2025-05-07T20:06:01.2630537Z [562/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_batch_index_select_dim0_backward_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_cta.cu.o 2025-05-07T20:06:01.2650930Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:01.2653391Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:01.2654510Z ^ 2025-05-07T20:06:01.2654747Z 2025-05-07T20:06:01.2655162Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:01.2655794Z 2025-05-07T20:06:01.2657432Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:01.2659687Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:01.2660749Z ^ 2025-05-07T20:06:01.2661347Z 2025-05-07T20:06:01.2662760Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:01.2664682Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:01.2665698Z ^ 2025-05-07T20:06:01.2665922Z 2025-05-07T20:06:01.2666247Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:01.2666847Z 2025-05-07T20:06:01.2668089Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:01.2670625Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:01.2671659Z ^ 2025-05-07T20:06:01.2672176Z 2025-05-07T20:06:01.2673618Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:01.2675712Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:01.2676734Z ^ 2025-05-07T20:06:01.2677120Z 2025-05-07T20:06:01.2677509Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:01.2678119Z 2025-05-07T20:06:01.2679611Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:01.2682060Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:01.2683161Z ^ 2025-05-07T20:06:01.2683507Z 2025-05-07T20:06:01.2684904Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:01.2687342Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:01.2688401Z ^ 2025-05-07T20:06:01.2688634Z 2025-05-07T20:06:01.2689041Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:01.2689651Z 2025-05-07T20:06:01.2691114Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:01.2693516Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:01.2694644Z ^ 2025-05-07T20:06:01.2694976Z 2025-05-07T20:06:01.2696584Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:01.2698978Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:01.2700000Z ^ 2025-05-07T20:06:01.2700456Z 2025-05-07T20:06:01.2700857Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:01.2701454Z 2025-05-07T20:06:01.2702892Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:01.2705323Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:01.2706431Z ^ 2025-05-07T20:06:01.2706989Z 2025-05-07T20:06:01.7404681Z [563/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_8bit_rowwise.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_8bit_rowwise.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_8bit_rowwise.cu.o 2025-05-07T20:06:01.7424174Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:01.7425750Z __attribute__((global)) inline void _float_to_fused8bitrowwise_cuda_kernel( 2025-05-07T20:06:01.7426459Z ^ 2025-05-07T20:06:01.7426700Z 2025-05-07T20:06:01.7427116Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:01.7427736Z 2025-05-07T20:06:01.7428685Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(52): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:01.7430123Z __attribute__((global)) inline void _get_8bit_qparam_cuda_kernel( 2025-05-07T20:06:01.7430741Z ^ 2025-05-07T20:06:01.7430956Z 2025-05-07T20:06:01.7431834Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(118): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:01.7433480Z __attribute__((global)) inline void _compute_8bit_quantize_cuda_kernel( 2025-05-07T20:06:01.7434167Z ^ 2025-05-07T20:06:01.7434397Z 2025-05-07T20:06:01.7435300Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(154): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:01.7436561Z __attribute__((global)) inline void _fused8bitrowwise_to_float_cuda_kernel( 2025-05-07T20:06:01.7437167Z ^ 2025-05-07T20:06:01.7437561Z 2025-05-07T20:06:01.7438458Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(195): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:01.7439697Z __attribute__((global)) inline void _fused8bitrowwise_to_float_mixed_dim_cuda_kernel( 2025-05-07T20:06:01.7440314Z ^ 2025-05-07T20:06:01.7440487Z 2025-05-07T20:06:01.7441201Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:01.7442492Z __attribute__((global)) inline void _float_to_fused8bitrowwise_cuda_kernel( 2025-05-07T20:06:01.7443053Z ^ 2025-05-07T20:06:01.7443258Z 2025-05-07T20:06:01.7443626Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:01.7444153Z 2025-05-07T20:06:01.7444959Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(52): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:01.7446221Z __attribute__((global)) inline void _get_8bit_qparam_cuda_kernel( 2025-05-07T20:06:01.7446805Z ^ 2025-05-07T20:06:01.7447003Z 2025-05-07T20:06:01.7447771Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(118): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:01.7448855Z __attribute__((global)) inline void _compute_8bit_quantize_cuda_kernel( 2025-05-07T20:06:01.7449492Z ^ 2025-05-07T20:06:01.7449704Z 2025-05-07T20:06:01.7450619Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(154): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:01.7451816Z __attribute__((global)) inline void _fused8bitrowwise_to_float_cuda_kernel( 2025-05-07T20:06:01.7452587Z ^ 2025-05-07T20:06:01.7452794Z 2025-05-07T20:06:01.7453629Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(195): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:01.7455140Z __attribute__((global)) inline void _fused8bitrowwise_to_float_mixed_dim_cuda_kernel( 2025-05-07T20:06:01.7455905Z ^ 2025-05-07T20:06:01.7456132Z 2025-05-07T20:06:01.7457067Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:01.7458451Z __attribute__((global)) inline void _float_to_fused8bitrowwise_cuda_kernel( 2025-05-07T20:06:01.7459162Z ^ 2025-05-07T20:06:01.7459342Z 2025-05-07T20:06:01.7459712Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:01.7460266Z 2025-05-07T20:06:01.7461120Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(52): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:01.7462486Z __attribute__((global)) inline void _get_8bit_qparam_cuda_kernel( 2025-05-07T20:06:01.7463054Z ^ 2025-05-07T20:06:01.7463280Z 2025-05-07T20:06:01.7463984Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(118): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:01.7465402Z __attribute__((global)) inline void _compute_8bit_quantize_cuda_kernel( 2025-05-07T20:06:01.7466130Z ^ 2025-05-07T20:06:01.7466340Z 2025-05-07T20:06:01.7467274Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(154): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:01.7468648Z __attribute__((global)) inline void _fused8bitrowwise_to_float_cuda_kernel( 2025-05-07T20:06:01.7469276Z ^ 2025-05-07T20:06:01.7469481Z 2025-05-07T20:06:01.7470571Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(195): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:01.7472244Z __attribute__((global)) inline void _fused8bitrowwise_to_float_mixed_dim_cuda_kernel( 2025-05-07T20:06:01.7472942Z ^ 2025-05-07T20:06:01.7473154Z 2025-05-07T20:06:01.7474030Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:01.7475522Z __attribute__((global)) inline void _float_to_fused8bitrowwise_cuda_kernel( 2025-05-07T20:06:01.7476195Z ^ 2025-05-07T20:06:01.7476361Z 2025-05-07T20:06:01.7476847Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:01.7477309Z 2025-05-07T20:06:01.7478166Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(52): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:01.7479541Z __attribute__((global)) inline void _get_8bit_qparam_cuda_kernel( 2025-05-07T20:06:01.7480178Z ^ 2025-05-07T20:06:01.7480396Z 2025-05-07T20:06:01.7481423Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(118): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:01.7482841Z __attribute__((global)) inline void _compute_8bit_quantize_cuda_kernel( 2025-05-07T20:06:01.7483536Z ^ 2025-05-07T20:06:01.7483762Z 2025-05-07T20:06:01.7484682Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(154): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:01.7486137Z __attribute__((global)) inline void _fused8bitrowwise_to_float_cuda_kernel( 2025-05-07T20:06:01.7486853Z ^ 2025-05-07T20:06:01.7487074Z 2025-05-07T20:06:01.7488006Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(195): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:01.7489487Z __attribute__((global)) inline void _fused8bitrowwise_to_float_mixed_dim_cuda_kernel( 2025-05-07T20:06:01.7490254Z ^ 2025-05-07T20:06:01.7490492Z 2025-05-07T20:06:01.7491342Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:01.7492794Z __attribute__((global)) inline void _float_to_fused8bitrowwise_cuda_kernel( 2025-05-07T20:06:01.7493610Z ^ 2025-05-07T20:06:01.7493875Z 2025-05-07T20:06:01.7494272Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:01.7494932Z 2025-05-07T20:06:01.7495799Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(52): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:01.7497355Z __attribute__((global)) inline void _get_8bit_qparam_cuda_kernel( 2025-05-07T20:06:01.7497986Z ^ 2025-05-07T20:06:01.7498205Z 2025-05-07T20:06:01.7499109Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(118): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:01.7500276Z __attribute__((global)) inline void _compute_8bit_quantize_cuda_kernel( 2025-05-07T20:06:01.7500886Z ^ 2025-05-07T20:06:01.7501118Z 2025-05-07T20:06:01.7501973Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(154): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:01.7503630Z __attribute__((global)) inline void _fused8bitrowwise_to_float_cuda_kernel( 2025-05-07T20:06:01.7504321Z ^ 2025-05-07T20:06:01.7504555Z 2025-05-07T20:06:01.7505448Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(195): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:01.7506901Z __attribute__((global)) inline void _fused8bitrowwise_to_float_mixed_dim_cuda_kernel( 2025-05-07T20:06:01.7507783Z ^ 2025-05-07T20:06:01.7507980Z 2025-05-07T20:06:01.8718640Z [564/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_index_select.so -o fbgemm_gpu_tbe_index_select.so CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_cpu_host.cpp.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_ops.cpp.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_host.cpp.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel_small.cu.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_embedding_backward_split_grad_index_select.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm_gpu_sparse_async_cumsum.so fbgemm_gpu_tbe_utils.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -lcudadevrt -lcudart_static -ldl && : 2025-05-07T20:06:01.9103521Z [565/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_index_select.so 1 2025-05-07T20:06:01.9105627Z ################################################################################ 2025-05-07T20:06:01.9106557Z [CMAKE] Running post-build script ... 2025-05-07T20:06:01.9107526Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:06:01.9108416Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:06:01.9108945Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:06:01.9109577Z ################################################################################ 2025-05-07T20:06:04.1477312Z [566/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_bfloat16.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_bfloat16.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_bfloat16.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_bfloat16.cu.o 2025-05-07T20:06:04.6683080Z [567/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_nbit_rowwise.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_nbit_rowwise.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_nbit_rowwise.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_nbit_rowwise.cu.o 2025-05-07T20:06:04.6702740Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_nbit_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:04.6704318Z __attribute__((global)) inline void _float_to_fusednbitrowwise_cuda_kernel( 2025-05-07T20:06:04.6704985Z ^ 2025-05-07T20:06:04.6705211Z 2025-05-07T20:06:04.6705769Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:04.6706412Z 2025-05-07T20:06:04.6707351Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_nbit_rowwise.cu(78): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:04.6708880Z __attribute__((global)) inline void _fusednbitrowwise_to_float_cuda_kernel( 2025-05-07T20:06:04.6709623Z ^ 2025-05-07T20:06:04.6709866Z 2025-05-07T20:06:04.6710824Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_nbit_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:04.6712274Z __attribute__((global)) inline void _float_to_fusednbitrowwise_cuda_kernel( 2025-05-07T20:06:04.6713005Z ^ 2025-05-07T20:06:04.6713220Z 2025-05-07T20:06:04.6713631Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:04.6714209Z 2025-05-07T20:06:04.6715141Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_nbit_rowwise.cu(78): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:04.6716671Z __attribute__((global)) inline void _fusednbitrowwise_to_float_cuda_kernel( 2025-05-07T20:06:04.6717357Z ^ 2025-05-07T20:06:04.6717583Z 2025-05-07T20:06:04.6718522Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_nbit_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:04.6720060Z __attribute__((global)) inline void _float_to_fusednbitrowwise_cuda_kernel( 2025-05-07T20:06:04.6720790Z ^ 2025-05-07T20:06:04.6721017Z 2025-05-07T20:06:04.6721429Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:04.6721993Z 2025-05-07T20:06:04.6722857Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_nbit_rowwise.cu(78): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:04.6724329Z __attribute__((global)) inline void _fusednbitrowwise_to_float_cuda_kernel( 2025-05-07T20:06:04.6725031Z ^ 2025-05-07T20:06:04.6725239Z 2025-05-07T20:06:04.6726144Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_nbit_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:04.6727690Z __attribute__((global)) inline void _float_to_fusednbitrowwise_cuda_kernel( 2025-05-07T20:06:04.6728523Z ^ 2025-05-07T20:06:04.6728756Z 2025-05-07T20:06:04.6729179Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:04.6729757Z 2025-05-07T20:06:04.6730651Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_nbit_rowwise.cu(78): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:04.6732055Z __attribute__((global)) inline void _fusednbitrowwise_to_float_cuda_kernel( 2025-05-07T20:06:04.6732700Z ^ 2025-05-07T20:06:04.6732970Z 2025-05-07T20:06:04.6733829Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_nbit_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:04.6735225Z __attribute__((global)) inline void _float_to_fusednbitrowwise_cuda_kernel( 2025-05-07T20:06:04.6735821Z ^ 2025-05-07T20:06:04.6736034Z 2025-05-07T20:06:04.6736508Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:04.6737097Z 2025-05-07T20:06:04.6738049Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_nbit_rowwise.cu(78): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:04.6739482Z __attribute__((global)) inline void _fusednbitrowwise_to_float_cuda_kernel( 2025-05-07T20:06:04.6740171Z ^ 2025-05-07T20:06:04.6740369Z 2025-05-07T20:06:11.5745615Z [568/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_mx.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_mx.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_mx.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_mx.cu.o 2025-05-07T20:06:16.3825112Z [569/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_hfp8.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_hfp8.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_hfp8.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_hfp8.cu.o 2025-05-07T20:06:16.9262994Z [570/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_padded_fp8_rowwise.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_padded_fp8_rowwise.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_padded_fp8_rowwise.cu.o 2025-05-07T20:06:16.9278557Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:16.9279955Z __attribute__((global)) inline void _float_to_paddedFP8rowwise_cuda_kernel( 2025-05-07T20:06:16.9280927Z ^ 2025-05-07T20:06:16.9281217Z 2025-05-07T20:06:16.9281751Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:16.9282264Z 2025-05-07T20:06:16.9283278Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(94): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:16.9284445Z __attribute__((global)) inline void _get_padding_value_kernel( 2025-05-07T20:06:16.9284928Z ^ 2025-05-07T20:06:16.9285103Z 2025-05-07T20:06:16.9285775Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(110): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:16.9287042Z __attribute__((global)) inline void _single_thread_sum_padding_kernel( 2025-05-07T20:06:16.9287711Z ^ 2025-05-07T20:06:16.9287876Z 2025-05-07T20:06:16.9288587Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(137): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:16.9289722Z __attribute__((global)) inline void _PaddedFP8rowwise_to_float_1d_cuda_kernel( 2025-05-07T20:06:16.9290280Z ^ 2025-05-07T20:06:16.9290441Z 2025-05-07T20:06:16.9291185Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(166): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:16.9292317Z __attribute__((global)) inline void _PaddedFP8rowwise_to_float_2d_cuda_kernel( 2025-05-07T20:06:16.9292891Z ^ 2025-05-07T20:06:16.9293061Z 2025-05-07T20:06:16.9293746Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:16.9294934Z __attribute__((global)) inline void _float_to_paddedFP8rowwise_cuda_kernel( 2025-05-07T20:06:16.9295472Z ^ 2025-05-07T20:06:16.9295634Z 2025-05-07T20:06:16.9295930Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:16.9296461Z 2025-05-07T20:06:16.9297139Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(94): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:16.9298173Z __attribute__((global)) inline void _get_padding_value_kernel( 2025-05-07T20:06:16.9298640Z ^ 2025-05-07T20:06:16.9298804Z 2025-05-07T20:06:16.9299502Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(110): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:16.9300597Z __attribute__((global)) inline void _single_thread_sum_padding_kernel( 2025-05-07T20:06:16.9301212Z ^ 2025-05-07T20:06:16.9301385Z 2025-05-07T20:06:16.9302101Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(137): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:16.9303257Z __attribute__((global)) inline void _PaddedFP8rowwise_to_float_1d_cuda_kernel( 2025-05-07T20:06:16.9303940Z ^ 2025-05-07T20:06:16.9304125Z 2025-05-07T20:06:16.9304826Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(166): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:16.9305972Z __attribute__((global)) inline void _PaddedFP8rowwise_to_float_2d_cuda_kernel( 2025-05-07T20:06:16.9306512Z ^ 2025-05-07T20:06:16.9306687Z 2025-05-07T20:06:16.9307378Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:16.9308654Z __attribute__((global)) inline void _float_to_paddedFP8rowwise_cuda_kernel( 2025-05-07T20:06:16.9309265Z ^ 2025-05-07T20:06:16.9309451Z 2025-05-07T20:06:16.9309798Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:16.9310325Z 2025-05-07T20:06:16.9311109Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(94): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:16.9312239Z __attribute__((global)) inline void _get_padding_value_kernel( 2025-05-07T20:06:16.9312809Z ^ 2025-05-07T20:06:16.9313014Z 2025-05-07T20:06:16.9313772Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(110): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:16.9315057Z __attribute__((global)) inline void _single_thread_sum_padding_kernel( 2025-05-07T20:06:16.9315696Z ^ 2025-05-07T20:06:16.9315901Z 2025-05-07T20:06:16.9316760Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(137): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:16.9317995Z __attribute__((global)) inline void _PaddedFP8rowwise_to_float_1d_cuda_kernel( 2025-05-07T20:06:16.9318631Z ^ 2025-05-07T20:06:16.9318824Z 2025-05-07T20:06:16.9319662Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(166): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:16.9320803Z __attribute__((global)) inline void _PaddedFP8rowwise_to_float_2d_cuda_kernel( 2025-05-07T20:06:16.9321365Z ^ 2025-05-07T20:06:16.9321536Z 2025-05-07T20:06:16.9322193Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:16.9323296Z __attribute__((global)) inline void _float_to_paddedFP8rowwise_cuda_kernel( 2025-05-07T20:06:16.9323826Z ^ 2025-05-07T20:06:16.9323995Z 2025-05-07T20:06:16.9324293Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:16.9324765Z 2025-05-07T20:06:16.9325442Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(94): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:16.9326487Z __attribute__((global)) inline void _get_padding_value_kernel( 2025-05-07T20:06:16.9326974Z ^ 2025-05-07T20:06:16.9327135Z 2025-05-07T20:06:16.9327818Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(110): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:16.9328912Z __attribute__((global)) inline void _single_thread_sum_padding_kernel( 2025-05-07T20:06:16.9329436Z ^ 2025-05-07T20:06:16.9329598Z 2025-05-07T20:06:16.9330270Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(137): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:16.9331360Z __attribute__((global)) inline void _PaddedFP8rowwise_to_float_1d_cuda_kernel( 2025-05-07T20:06:16.9331891Z ^ 2025-05-07T20:06:16.9332045Z 2025-05-07T20:06:16.9332689Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(166): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:16.9333876Z __attribute__((global)) inline void _PaddedFP8rowwise_to_float_2d_cuda_kernel( 2025-05-07T20:06:16.9334397Z ^ 2025-05-07T20:06:16.9334574Z 2025-05-07T20:06:16.9335233Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:16.9336406Z __attribute__((global)) inline void _float_to_paddedFP8rowwise_cuda_kernel( 2025-05-07T20:06:16.9336937Z ^ 2025-05-07T20:06:16.9337195Z 2025-05-07T20:06:16.9337506Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:16.9337961Z 2025-05-07T20:06:16.9338613Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(94): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:16.9339653Z __attribute__((global)) inline void _get_padding_value_kernel( 2025-05-07T20:06:16.9340123Z ^ 2025-05-07T20:06:16.9340284Z 2025-05-07T20:06:16.9341022Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(110): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:16.9342095Z __attribute__((global)) inline void _single_thread_sum_padding_kernel( 2025-05-07T20:06:16.9342583Z ^ 2025-05-07T20:06:16.9342754Z 2025-05-07T20:06:16.9343444Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(137): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:16.9344665Z __attribute__((global)) inline void _PaddedFP8rowwise_to_float_1d_cuda_kernel( 2025-05-07T20:06:16.9345206Z ^ 2025-05-07T20:06:16.9345362Z 2025-05-07T20:06:16.9346057Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(166): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:16.9347196Z __attribute__((global)) inline void _PaddedFP8rowwise_to_float_2d_cuda_kernel( 2025-05-07T20:06:16.9347742Z ^ 2025-05-07T20:06:16.9347927Z 2025-05-07T20:06:17.8816279Z [571/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/keyed_jagged_index_select_dim1.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/keyed_jagged_index_select_dim1.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/keyed_jagged_index_select_dim1.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/keyed_jagged_index_select_dim1.cu.o 2025-05-07T20:06:19.0574610Z [572/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_msfp.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_msfp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_msfp.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_msfp.cu.o 2025-05-07T20:06:19.0589680Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_msfp.cu(73): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:19.0590863Z __attribute__((global)) inline void _compute_msfp_shared_exponent_cuda_kernel( 2025-05-07T20:06:19.0591465Z ^ 2025-05-07T20:06:19.0591647Z 2025-05-07T20:06:19.0591973Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:19.0592476Z 2025-05-07T20:06:19.0593128Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_msfp.cu(73): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:19.0594234Z __attribute__((global)) inline void _compute_msfp_shared_exponent_cuda_kernel( 2025-05-07T20:06:19.0594833Z ^ 2025-05-07T20:06:19.0595019Z 2025-05-07T20:06:19.0595372Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:19.0595847Z 2025-05-07T20:06:19.0596486Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_msfp.cu(73): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:19.0597834Z __attribute__((global)) inline void _compute_msfp_shared_exponent_cuda_kernel( 2025-05-07T20:06:19.0598418Z ^ 2025-05-07T20:06:19.0598639Z 2025-05-07T20:06:19.0598963Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:19.0599440Z 2025-05-07T20:06:19.0600099Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_msfp.cu(73): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:19.0601196Z __attribute__((global)) inline void _compute_msfp_shared_exponent_cuda_kernel( 2025-05-07T20:06:19.0601969Z ^ 2025-05-07T20:06:19.0602151Z 2025-05-07T20:06:19.0602473Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:19.0602976Z 2025-05-07T20:06:19.0603617Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_msfp.cu(73): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:19.0604750Z __attribute__((global)) inline void _compute_msfp_shared_exponent_cuda_kernel( 2025-05-07T20:06:19.0605556Z ^ 2025-05-07T20:06:19.0605757Z 2025-05-07T20:06:19.0606078Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:19.0606564Z 2025-05-07T20:06:24.8266563Z [573/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_to_padded_dense_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_forward.cu.o 2025-05-07T20:06:28.2172567Z [574/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_bucketize_features.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_bucketize_features.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_bucketize_features.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_bucketize_features.cu.o 2025-05-07T20:06:28.2193150Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:28.2195689Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:28.2196820Z ^ 2025-05-07T20:06:28.2197089Z 2025-05-07T20:06:28.2197551Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:28.2198157Z 2025-05-07T20:06:28.2199662Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:28.2202221Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:28.2203383Z ^ 2025-05-07T20:06:28.2203735Z 2025-05-07T20:06:28.2205274Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:28.2207761Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:28.2208911Z ^ 2025-05-07T20:06:28.2209169Z 2025-05-07T20:06:28.2209600Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:28.2210260Z 2025-05-07T20:06:28.2212030Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:28.2214685Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:28.2215820Z ^ 2025-05-07T20:06:28.2216191Z 2025-05-07T20:06:28.2217790Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:28.2220410Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:28.2221514Z ^ 2025-05-07T20:06:28.2221794Z 2025-05-07T20:06:28.2222254Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:28.2222884Z 2025-05-07T20:06:28.2224549Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:28.2226999Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:28.2228088Z ^ 2025-05-07T20:06:28.2228464Z 2025-05-07T20:06:28.2232532Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:28.2235232Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:28.2236360Z ^ 2025-05-07T20:06:28.2236609Z 2025-05-07T20:06:28.2237045Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:28.2237712Z 2025-05-07T20:06:28.2239266Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:28.2241836Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:28.2242970Z ^ 2025-05-07T20:06:28.2243333Z 2025-05-07T20:06:28.2244853Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:28.2247291Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:28.2248389Z ^ 2025-05-07T20:06:28.2248627Z 2025-05-07T20:06:28.2249080Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:28.2249688Z 2025-05-07T20:06:28.2251185Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:28.2253693Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:28.2254808Z ^ 2025-05-07T20:06:28.2255152Z 2025-05-07T20:06:29.1778071Z [575/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_async_batched_cumsum.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cu.o 2025-05-07T20:06:29.1789254Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:29.1790675Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:29.1791272Z ^ 2025-05-07T20:06:29.1791411Z 2025-05-07T20:06:29.1791661Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:29.1792000Z 2025-05-07T20:06:29.1792834Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:29.1794180Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:29.1794793Z ^ 2025-05-07T20:06:29.1794982Z 2025-05-07T20:06:29.1795804Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:29.1797150Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:29.1797750Z ^ 2025-05-07T20:06:29.1797890Z 2025-05-07T20:06:29.1798190Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:29.1798534Z 2025-05-07T20:06:29.1799366Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:29.1800702Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:29.1801317Z ^ 2025-05-07T20:06:29.1801543Z 2025-05-07T20:06:29.1802061Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_async_batched_cumsum.cu(16): warning #177-D: variable "kMaxThreads" was declared but never referenced 2025-05-07T20:06:29.1802819Z static constexpr uint32_t kMaxThreads = 1024; 2025-05-07T20:06:29.1803136Z ^ 2025-05-07T20:06:29.1803319Z 2025-05-07T20:06:29.1804179Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:29.1805511Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:29.1806100Z ^ 2025-05-07T20:06:29.1806251Z 2025-05-07T20:06:29.1806482Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:29.1806828Z 2025-05-07T20:06:29.1807716Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:29.1809049Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:29.1809667Z ^ 2025-05-07T20:06:29.1809861Z 2025-05-07T20:06:29.1810379Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_async_batched_cumsum.cu(16): warning #177-D: variable "kMaxThreads" was declared but never referenced 2025-05-07T20:06:29.1811130Z static constexpr uint32_t kMaxThreads = 1024; 2025-05-07T20:06:29.1811453Z ^ 2025-05-07T20:06:29.1811616Z 2025-05-07T20:06:29.1812442Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:29.1813774Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:29.1814369Z ^ 2025-05-07T20:06:29.1814520Z 2025-05-07T20:06:29.1814753Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:29.1815095Z 2025-05-07T20:06:29.1815934Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:29.1817353Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:29.1817961Z ^ 2025-05-07T20:06:29.1818155Z 2025-05-07T20:06:29.1818683Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_async_batched_cumsum.cu(16): warning #177-D: variable "kMaxThreads" was declared but never referenced 2025-05-07T20:06:29.1819418Z static constexpr uint32_t kMaxThreads = 1024; 2025-05-07T20:06:29.1819735Z ^ 2025-05-07T20:06:29.1819900Z 2025-05-07T20:06:29.1820731Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:29.1822134Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:29.1822734Z ^ 2025-05-07T20:06:29.1822868Z 2025-05-07T20:06:29.1823098Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:29.1823448Z 2025-05-07T20:06:29.1824310Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:29.1825648Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:29.1826250Z ^ 2025-05-07T20:06:29.1826452Z 2025-05-07T20:06:29.1827005Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_async_batched_cumsum.cu(16): warning #177-D: variable "kMaxThreads" was declared but never referenced 2025-05-07T20:06:29.1827757Z static constexpr uint32_t kMaxThreads = 1024; 2025-05-07T20:06:29.1828076Z ^ 2025-05-07T20:06:29.1828240Z 2025-05-07T20:06:30.3900048Z [576/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_batched_unary_embeddings.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_batched_unary_embeddings.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_batched_unary_embeddings.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_batched_unary_embeddings.cu.o 2025-05-07T20:06:30.3910997Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:30.3912475Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:30.3913107Z ^ 2025-05-07T20:06:30.3913251Z 2025-05-07T20:06:30.3913515Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:30.3913860Z 2025-05-07T20:06:30.3914697Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:30.3916132Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:30.3916734Z ^ 2025-05-07T20:06:30.3916946Z 2025-05-07T20:06:30.3917892Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:30.3919235Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:30.3919846Z ^ 2025-05-07T20:06:30.3919983Z 2025-05-07T20:06:30.3920212Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:30.3920555Z 2025-05-07T20:06:30.3921437Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:30.3922758Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:30.3923377Z ^ 2025-05-07T20:06:30.3923567Z 2025-05-07T20:06:30.3924398Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:30.3925720Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:30.3926319Z ^ 2025-05-07T20:06:30.3926453Z 2025-05-07T20:06:30.3926698Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:30.3927037Z 2025-05-07T20:06:30.3927865Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:30.3929209Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:30.3929807Z ^ 2025-05-07T20:06:30.3930012Z 2025-05-07T20:06:30.3930827Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:30.3932164Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:30.3932752Z ^ 2025-05-07T20:06:30.3932903Z 2025-05-07T20:06:30.3933129Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:30.3933467Z 2025-05-07T20:06:30.3934312Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:30.3935706Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:30.3936325Z ^ 2025-05-07T20:06:30.3936613Z 2025-05-07T20:06:30.3937448Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:30.3938830Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:30.3939428Z ^ 2025-05-07T20:06:30.3939563Z 2025-05-07T20:06:30.3939794Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:30.3940147Z 2025-05-07T20:06:30.3941015Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:30.3942363Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:30.3942955Z ^ 2025-05-07T20:06:30.3943155Z 2025-05-07T20:06:34.7708543Z [577/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_compute_frequency_sequence.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_compute_frequency_sequence.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_compute_frequency_sequence.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_compute_frequency_sequence.cu.o 2025-05-07T20:06:34.7728116Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:34.7730305Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:34.7731349Z ^ 2025-05-07T20:06:34.7731684Z 2025-05-07T20:06:34.7732090Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:34.7732907Z 2025-05-07T20:06:34.7734382Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:34.7736917Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:34.7738070Z ^ 2025-05-07T20:06:34.7738393Z 2025-05-07T20:06:34.7740012Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:34.7742535Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:34.7743573Z ^ 2025-05-07T20:06:34.7743797Z 2025-05-07T20:06:34.7744318Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:34.7744920Z 2025-05-07T20:06:34.7746380Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:34.7748789Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:34.7749831Z ^ 2025-05-07T20:06:34.7750173Z 2025-05-07T20:06:34.7751575Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:34.7753947Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:34.7754926Z ^ 2025-05-07T20:06:34.7755142Z 2025-05-07T20:06:34.7755522Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:34.7756116Z 2025-05-07T20:06:34.7757447Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:34.7759614Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:34.7760612Z ^ 2025-05-07T20:06:34.7760928Z 2025-05-07T20:06:34.7762258Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:34.7764457Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:34.7765353Z ^ 2025-05-07T20:06:34.7765581Z 2025-05-07T20:06:34.7765910Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:34.7766623Z 2025-05-07T20:06:34.7768206Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:34.7770739Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:34.7771771Z ^ 2025-05-07T20:06:34.7772291Z 2025-05-07T20:06:34.7773743Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:34.7776036Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:34.7777199Z ^ 2025-05-07T20:06:34.7777449Z 2025-05-07T20:06:34.7778020Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:34.7778646Z 2025-05-07T20:06:34.7780144Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:34.7782441Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:34.7783782Z ^ 2025-05-07T20:06:34.7784151Z 2025-05-07T20:06:37.9661892Z [578/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_expand_into_jagged_permute.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_expand_into_jagged_permute.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_expand_into_jagged_permute.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_expand_into_jagged_permute.cu.o 2025-05-07T20:06:37.9677015Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:37.9678789Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:37.9679758Z ^ 2025-05-07T20:06:37.9679934Z 2025-05-07T20:06:37.9680245Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:37.9680703Z 2025-05-07T20:06:37.9681787Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:37.9683673Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:37.9684460Z ^ 2025-05-07T20:06:37.9684719Z 2025-05-07T20:06:37.9685788Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:37.9687647Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:37.9688515Z ^ 2025-05-07T20:06:37.9688734Z 2025-05-07T20:06:37.9689037Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:37.9689698Z 2025-05-07T20:06:37.9691099Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:37.9693167Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:37.9694116Z ^ 2025-05-07T20:06:37.9694552Z 2025-05-07T20:06:37.9696609Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:37.9698795Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:37.9699676Z ^ 2025-05-07T20:06:37.9699862Z 2025-05-07T20:06:37.9700177Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:37.9700655Z 2025-05-07T20:06:37.9702071Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:37.9703945Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:37.9704755Z ^ 2025-05-07T20:06:37.9705054Z 2025-05-07T20:06:37.9706242Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:37.9708116Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:37.9709073Z ^ 2025-05-07T20:06:37.9709297Z 2025-05-07T20:06:37.9709661Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:37.9710109Z 2025-05-07T20:06:37.9711219Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:37.9713021Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:37.9713966Z ^ 2025-05-07T20:06:37.9714227Z 2025-05-07T20:06:37.9715353Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:37.9717288Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:37.9718208Z ^ 2025-05-07T20:06:37.9718405Z 2025-05-07T20:06:37.9718728Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:37.9719223Z 2025-05-07T20:06:37.9720384Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:37.9722340Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:37.9723199Z ^ 2025-05-07T20:06:37.9723469Z 2025-05-07T20:06:44.9597018Z [579/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_block_bucketize_features.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_block_bucketize_features.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_block_bucketize_features.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_block_bucketize_features.cu.o 2025-05-07T20:06:44.9617645Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:44.9620413Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:44.9621490Z ^ 2025-05-07T20:06:44.9621805Z 2025-05-07T20:06:44.9622195Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:44.9622795Z 2025-05-07T20:06:44.9624294Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:44.9626791Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:44.9627846Z ^ 2025-05-07T20:06:44.9628194Z 2025-05-07T20:06:44.9629881Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:44.9632037Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:44.9633032Z ^ 2025-05-07T20:06:44.9633273Z 2025-05-07T20:06:44.9633702Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:44.9634283Z 2025-05-07T20:06:44.9635717Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:44.9638012Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:44.9639035Z ^ 2025-05-07T20:06:44.9639352Z 2025-05-07T20:06:44.9640706Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:44.9642999Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:44.9644051Z ^ 2025-05-07T20:06:44.9644303Z 2025-05-07T20:06:44.9644686Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:44.9645268Z 2025-05-07T20:06:44.9646696Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:44.9649029Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:44.9650149Z ^ 2025-05-07T20:06:44.9650489Z 2025-05-07T20:06:44.9651945Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:44.9654291Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:44.9655284Z ^ 2025-05-07T20:06:44.9655499Z 2025-05-07T20:06:44.9655884Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:44.9656578Z 2025-05-07T20:06:44.9658010Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:44.9660520Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:44.9661553Z ^ 2025-05-07T20:06:44.9661901Z 2025-05-07T20:06:44.9663323Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:44.9665773Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:44.9666762Z ^ 2025-05-07T20:06:44.9667001Z 2025-05-07T20:06:44.9667380Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:44.9667945Z 2025-05-07T20:06:44.9669484Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:44.9672017Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:44.9673064Z ^ 2025-05-07T20:06:44.9673399Z 2025-05-07T20:06:47.4327640Z [580/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_group_index.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_group_index.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_group_index.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_group_index.cu.o 2025-05-07T20:06:47.4346780Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:47.4349453Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:47.4350510Z ^ 2025-05-07T20:06:47.4350726Z 2025-05-07T20:06:47.4351099Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:47.4351688Z 2025-05-07T20:06:47.4353313Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:47.4355671Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:47.4356763Z ^ 2025-05-07T20:06:47.4357077Z 2025-05-07T20:06:47.4358657Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:47.4361081Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:47.4362108Z ^ 2025-05-07T20:06:47.4362333Z 2025-05-07T20:06:47.4362757Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:47.4363321Z 2025-05-07T20:06:47.4364779Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:47.4367123Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:47.4368184Z ^ 2025-05-07T20:06:47.4368512Z 2025-05-07T20:06:47.4369937Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:47.4372503Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:47.4373586Z ^ 2025-05-07T20:06:47.4373817Z 2025-05-07T20:06:47.4374222Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:47.4374822Z 2025-05-07T20:06:47.4376344Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:47.4378749Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:47.4379812Z ^ 2025-05-07T20:06:47.4380149Z 2025-05-07T20:06:47.4381632Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:47.4384240Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:47.4385302Z ^ 2025-05-07T20:06:47.4385516Z 2025-05-07T20:06:47.4385891Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:47.4386464Z 2025-05-07T20:06:47.4387948Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:47.4390531Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:47.4391620Z ^ 2025-05-07T20:06:47.4391986Z 2025-05-07T20:06:47.4393601Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:47.4395875Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:47.4396878Z ^ 2025-05-07T20:06:47.4397124Z 2025-05-07T20:06:47.4397514Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:47.4398118Z 2025-05-07T20:06:47.4399679Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:47.4401884Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:47.4402867Z ^ 2025-05-07T20:06:47.4403181Z 2025-05-07T20:06:48.2653196Z [581/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_add.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_add.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_index_add.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_add.cu.o 2025-05-07T20:06:48.2672220Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:48.2674780Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:48.2675765Z ^ 2025-05-07T20:06:48.2676011Z 2025-05-07T20:06:48.2676458Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:48.2677021Z 2025-05-07T20:06:48.2678625Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:48.2680969Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:48.2682010Z ^ 2025-05-07T20:06:48.2682479Z 2025-05-07T20:06:48.2683931Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:48.2686279Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:48.2687300Z ^ 2025-05-07T20:06:48.2687537Z 2025-05-07T20:06:48.2687932Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:48.2688553Z 2025-05-07T20:06:48.2689972Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:48.2692268Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:48.2693286Z ^ 2025-05-07T20:06:48.2693611Z 2025-05-07T20:06:48.2695006Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:48.2697500Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:48.2698567Z ^ 2025-05-07T20:06:48.2698809Z 2025-05-07T20:06:48.2699217Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:48.2699829Z 2025-05-07T20:06:48.2701245Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:48.2703473Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:48.2704537Z ^ 2025-05-07T20:06:48.2704863Z 2025-05-07T20:06:48.2706428Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:48.2708761Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:48.2709812Z ^ 2025-05-07T20:06:48.2710049Z 2025-05-07T20:06:48.2710438Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:48.2711159Z 2025-05-07T20:06:48.2712577Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:48.2714966Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:48.2716032Z ^ 2025-05-07T20:06:48.2716369Z 2025-05-07T20:06:48.2717995Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:48.2720323Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:48.2721314Z ^ 2025-05-07T20:06:48.2721552Z 2025-05-07T20:06:48.2722077Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:48.2722676Z 2025-05-07T20:06:48.2724050Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:48.2726361Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:48.2727401Z ^ 2025-05-07T20:06:48.2727732Z 2025-05-07T20:06:48.8426042Z [582/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_select.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_select.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_index_select.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_select.cu.o 2025-05-07T20:06:48.8447932Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:48.8450612Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:48.8451764Z ^ 2025-05-07T20:06:48.8452027Z 2025-05-07T20:06:48.8452563Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:48.8453227Z 2025-05-07T20:06:48.8454911Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:48.8457690Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:48.8458874Z ^ 2025-05-07T20:06:48.8459232Z 2025-05-07T20:06:48.8460885Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:48.8463538Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:48.8464707Z ^ 2025-05-07T20:06:48.8464949Z 2025-05-07T20:06:48.8465386Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:48.8466058Z 2025-05-07T20:06:48.8467714Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:48.8470581Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:48.8471743Z ^ 2025-05-07T20:06:48.8472119Z 2025-05-07T20:06:48.8473736Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:48.8476347Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:48.8477479Z ^ 2025-05-07T20:06:48.8477734Z 2025-05-07T20:06:48.8478167Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:48.8478817Z 2025-05-07T20:06:48.8480468Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:48.8483093Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:48.8484348Z ^ 2025-05-07T20:06:48.8484703Z 2025-05-07T20:06:48.8486325Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:48.8488925Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:48.8490061Z ^ 2025-05-07T20:06:48.8490302Z 2025-05-07T20:06:48.8490821Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:48.8491487Z 2025-05-07T20:06:48.8493123Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:48.8495763Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:48.8497050Z ^ 2025-05-07T20:06:48.8497421Z 2025-05-07T20:06:48.8499029Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:48.8501632Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:48.8502833Z ^ 2025-05-07T20:06:48.8503076Z 2025-05-07T20:06:48.8503521Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:48.8504172Z 2025-05-07T20:06:48.8505813Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:48.8508473Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:48.8509635Z ^ 2025-05-07T20:06:48.8509989Z 2025-05-07T20:06:49.0716827Z [583/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_invert_permute.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_invert_permute.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_invert_permute.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_invert_permute.cu.o 2025-05-07T20:06:49.0727895Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:49.0729249Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:49.0729904Z ^ 2025-05-07T20:06:49.0730060Z 2025-05-07T20:06:49.0730298Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:49.0730639Z 2025-05-07T20:06:49.0731481Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:49.0732878Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:49.0733492Z ^ 2025-05-07T20:06:49.0733684Z 2025-05-07T20:06:49.0734513Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:49.0735834Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:49.0736432Z ^ 2025-05-07T20:06:49.0736658Z 2025-05-07T20:06:49.0736894Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:49.0737247Z 2025-05-07T20:06:49.0738086Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:49.0739433Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:49.0740034Z ^ 2025-05-07T20:06:49.0740240Z 2025-05-07T20:06:49.0741060Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:49.0742389Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:49.0742972Z ^ 2025-05-07T20:06:49.0743119Z 2025-05-07T20:06:49.0743350Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:49.0743690Z 2025-05-07T20:06:49.0744516Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:49.0745852Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:49.0746511Z ^ 2025-05-07T20:06:49.0746703Z 2025-05-07T20:06:49.0747517Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:49.0748851Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:49.0749488Z ^ 2025-05-07T20:06:49.0749626Z 2025-05-07T20:06:49.0749855Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:49.0750208Z 2025-05-07T20:06:49.0751028Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:49.0752398Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:49.0752998Z ^ 2025-05-07T20:06:49.0753189Z 2025-05-07T20:06:49.0754016Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:49.0755384Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:49.0755981Z ^ 2025-05-07T20:06:49.0756114Z 2025-05-07T20:06:49.0756356Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:49.0756694Z 2025-05-07T20:06:49.0757523Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:49.0758860Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:49.0759474Z ^ 2025-05-07T20:06:49.0759662Z 2025-05-07T20:06:56.5753252Z [584/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_backward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_backward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_pack_segments_backward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_backward.cu.o 2025-05-07T20:06:56.5764423Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:56.5765819Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:56.5766428Z ^ 2025-05-07T20:06:56.5766567Z 2025-05-07T20:06:56.5766800Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:56.5767150Z 2025-05-07T20:06:56.5768038Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:56.5769388Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:56.5769990Z ^ 2025-05-07T20:06:56.5770380Z 2025-05-07T20:06:56.5771206Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:56.5772532Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:56.5773159Z ^ 2025-05-07T20:06:56.5773295Z 2025-05-07T20:06:56.5773526Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:56.5773884Z 2025-05-07T20:06:56.5774715Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:56.5776079Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:56.5776786Z ^ 2025-05-07T20:06:56.5776997Z 2025-05-07T20:06:56.5777814Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:56.5779148Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:56.5779744Z ^ 2025-05-07T20:06:56.5779899Z 2025-05-07T20:06:56.5780131Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:56.5780470Z 2025-05-07T20:06:56.5781309Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:56.5782706Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:56.5783315Z ^ 2025-05-07T20:06:56.5783508Z 2025-05-07T20:06:56.5784348Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:56.5785739Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:56.5786347Z ^ 2025-05-07T20:06:56.5786483Z 2025-05-07T20:06:56.5786714Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:56.5787068Z 2025-05-07T20:06:56.5787933Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:56.5789279Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:56.5789879Z ^ 2025-05-07T20:06:56.5790087Z 2025-05-07T20:06:56.5790944Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:56.5792280Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:56.5792866Z ^ 2025-05-07T20:06:56.5793017Z 2025-05-07T20:06:56.5793248Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:56.5793585Z 2025-05-07T20:06:56.5794412Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:56.5795750Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:56.5796356Z ^ 2025-05-07T20:06:56.5796548Z 2025-05-07T20:06:56.9594859Z [585/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute102.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute102.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_permute102.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute102.cu.o 2025-05-07T20:06:56.9607681Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:56.9609067Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:56.9609707Z ^ 2025-05-07T20:06:56.9609854Z 2025-05-07T20:06:56.9610100Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:56.9610474Z 2025-05-07T20:06:56.9611381Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:56.9612742Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:56.9613373Z ^ 2025-05-07T20:06:56.9613597Z 2025-05-07T20:06:56.9614430Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:56.9615786Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:56.9616471Z ^ 2025-05-07T20:06:56.9616675Z 2025-05-07T20:06:56.9616918Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:56.9617263Z 2025-05-07T20:06:56.9618116Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:56.9619461Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:56.9620097Z ^ 2025-05-07T20:06:56.9620294Z 2025-05-07T20:06:56.9621140Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:56.9622476Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:56.9623107Z ^ 2025-05-07T20:06:56.9623256Z 2025-05-07T20:06:56.9623496Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:56.9623865Z 2025-05-07T20:06:56.9624706Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:56.9626125Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:56.9626744Z ^ 2025-05-07T20:06:56.9626968Z 2025-05-07T20:06:56.9627798Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:56.9629212Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:56.9629821Z ^ 2025-05-07T20:06:56.9629987Z 2025-05-07T20:06:56.9630226Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:56.9630577Z 2025-05-07T20:06:56.9631470Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:56.9632810Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:56.9633447Z ^ 2025-05-07T20:06:56.9633646Z 2025-05-07T20:06:56.9634504Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:56.9635869Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:56.9636503Z ^ 2025-05-07T20:06:56.9636647Z 2025-05-07T20:06:56.9636890Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:56.9637263Z 2025-05-07T20:06:56.9638094Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:56.9639459Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:56.9640073Z ^ 2025-05-07T20:06:56.9640293Z 2025-05-07T20:06:58.5809965Z [586/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_embeddings.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_embeddings.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_permute_embeddings.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_embeddings.cu.o 2025-05-07T20:06:58.5821436Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:58.5822813Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:58.5823449Z ^ 2025-05-07T20:06:58.5823601Z 2025-05-07T20:06:58.5823911Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:58.5824285Z 2025-05-07T20:06:58.5825124Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:58.5826496Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:58.5827123Z ^ 2025-05-07T20:06:58.5827350Z 2025-05-07T20:06:58.5828213Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:58.5829573Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:58.5830183Z ^ 2025-05-07T20:06:58.5830355Z 2025-05-07T20:06:58.5830600Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:58.5830946Z 2025-05-07T20:06:58.5831808Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:58.5833151Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:58.5833787Z ^ 2025-05-07T20:06:58.5833983Z 2025-05-07T20:06:58.5834813Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:58.5836165Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:58.5845440Z ^ 2025-05-07T20:06:58.5845738Z 2025-05-07T20:06:58.5846002Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:58.5847721Z 2025-05-07T20:06:58.5848590Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:58.5849991Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:58.5850617Z ^ 2025-05-07T20:06:58.5850845Z 2025-05-07T20:06:58.5851727Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:58.5853093Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:58.5853706Z ^ 2025-05-07T20:06:58.5853856Z 2025-05-07T20:06:58.5854130Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:58.5854521Z 2025-05-07T20:06:58.5855366Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:58.5856839Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:58.5857538Z ^ 2025-05-07T20:06:58.5857742Z 2025-05-07T20:06:58.5858573Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:58.5859938Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:58.5860572Z ^ 2025-05-07T20:06:58.5860717Z 2025-05-07T20:06:58.5860951Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:58.5861322Z 2025-05-07T20:06:58.5862156Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:58.5863531Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:58.5864155Z ^ 2025-05-07T20:06:58.5864353Z 2025-05-07T20:06:59.5775095Z [587/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_range.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_range.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_range.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_range.cu.o 2025-05-07T20:06:59.5786065Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:59.5787451Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:59.5788151Z ^ 2025-05-07T20:06:59.5788303Z 2025-05-07T20:06:59.5788685Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:59.5789029Z 2025-05-07T20:06:59.5789851Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:59.5791200Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:59.5791826Z ^ 2025-05-07T20:06:59.5792021Z 2025-05-07T20:06:59.5792824Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:59.5794150Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:59.5794763Z ^ 2025-05-07T20:06:59.5794904Z 2025-05-07T20:06:59.5795139Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:59.5795504Z 2025-05-07T20:06:59.5796320Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:59.5797689Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:59.5798291Z ^ 2025-05-07T20:06:59.5798517Z 2025-05-07T20:06:59.5799325Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:59.5800650Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:59.5801246Z ^ 2025-05-07T20:06:59.5801444Z 2025-05-07T20:06:59.5801679Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:59.5802020Z 2025-05-07T20:06:59.5802841Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:59.5804180Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:59.5804809Z ^ 2025-05-07T20:06:59.5805037Z 2025-05-07T20:06:59.5805846Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:59.5807170Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:59.5807788Z ^ 2025-05-07T20:06:59.5807931Z 2025-05-07T20:06:59.5808194Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:59.5808560Z 2025-05-07T20:06:59.5809377Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:59.5810741Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:59.5811353Z ^ 2025-05-07T20:06:59.5811548Z 2025-05-07T20:06:59.5812370Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:59.5813674Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:59.5814289Z ^ 2025-05-07T20:06:59.5814431Z 2025-05-07T20:06:59.5814688Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:59.5815025Z 2025-05-07T20:06:59.5815837Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:59.5817413Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:59.5818056Z ^ 2025-05-07T20:06:59.5818257Z 2025-05-07T20:07:00.0437300Z [588/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_zipf.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_zipf.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_zipf.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_zipf.cu.o 2025-05-07T20:07:00.0448023Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:00.0449442Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:00.0450062Z ^ 2025-05-07T20:07:00.0450203Z 2025-05-07T20:07:00.0450440Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:00.0450828Z 2025-05-07T20:07:00.0451656Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:00.0453010Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:00.0453614Z ^ 2025-05-07T20:07:00.0453820Z 2025-05-07T20:07:00.0454640Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:00.0455975Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:00.0456647Z ^ 2025-05-07T20:07:00.0456802Z 2025-05-07T20:07:00.0457035Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:00.0457380Z 2025-05-07T20:07:00.0458223Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:00.0459553Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:00.0460165Z ^ 2025-05-07T20:07:00.0460357Z 2025-05-07T20:07:00.0461176Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:00.0462506Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:00.0463155Z ^ 2025-05-07T20:07:00.0463292Z 2025-05-07T20:07:00.0463523Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:00.0463875Z 2025-05-07T20:07:00.0464700Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:00.0466040Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:00.0466669Z ^ 2025-05-07T20:07:00.0466875Z 2025-05-07T20:07:00.0467690Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:00.0469030Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:00.0469652Z ^ 2025-05-07T20:07:00.0469790Z 2025-05-07T20:07:00.0470037Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:00.0470560Z 2025-05-07T20:07:00.0471400Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:00.0472828Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:00.0473473Z ^ 2025-05-07T20:07:00.0473679Z 2025-05-07T20:07:00.0474507Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:00.0475877Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:00.0476519Z ^ 2025-05-07T20:07:00.0476665Z 2025-05-07T20:07:00.0476910Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:00.0477281Z 2025-05-07T20:07:00.0478119Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:00.0479483Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:00.0480088Z ^ 2025-05-07T20:07:00.0480279Z 2025-05-07T20:07:00.1864457Z [589/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/intraining_embedding_pruning_ops/intraining_embedding_pruning.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning.cu.o 2025-05-07T20:07:00.9609793Z [590/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_pack_segments_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_forward.cu.o 2025-05-07T20:07:00.9621330Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:00.9622804Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:00.9623450Z ^ 2025-05-07T20:07:00.9623596Z 2025-05-07T20:07:00.9623836Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:00.9624203Z 2025-05-07T20:07:00.9625044Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:00.9626515Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:00.9627140Z ^ 2025-05-07T20:07:00.9627365Z 2025-05-07T20:07:00.9628260Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:00.9629706Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:00.9630291Z ^ 2025-05-07T20:07:00.9630458Z 2025-05-07T20:07:00.9630727Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:00.9631065Z 2025-05-07T20:07:00.9631903Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:00.9633239Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:00.9633866Z ^ 2025-05-07T20:07:00.9634061Z 2025-05-07T20:07:00.9634883Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:00.9636183Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:00.9636800Z ^ 2025-05-07T20:07:00.9636940Z 2025-05-07T20:07:00.9637174Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:00.9637541Z 2025-05-07T20:07:00.9638367Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:00.9639702Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:00.9640298Z ^ 2025-05-07T20:07:00.9640512Z 2025-05-07T20:07:00.9641313Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:00.9642636Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:00.9643222Z ^ 2025-05-07T20:07:00.9643389Z 2025-05-07T20:07:00.9643620Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:00.9643991Z 2025-05-07T20:07:00.9644821Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:00.9646130Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:00.9646754Z ^ 2025-05-07T20:07:00.9646948Z 2025-05-07T20:07:00.9647754Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:00.9649121Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:00.9649730Z ^ 2025-05-07T20:07:00.9649870Z 2025-05-07T20:07:00.9650102Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:00.9650459Z 2025-05-07T20:07:00.9651303Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:00.9652630Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:00.9653227Z ^ 2025-05-07T20:07:00.9653443Z 2025-05-07T20:07:02.5830161Z [591/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_1d.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_1d.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_permute_1d.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_1d.cu.o 2025-05-07T20:07:02.5840955Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:02.5842308Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:02.5842905Z ^ 2025-05-07T20:07:02.5843070Z 2025-05-07T20:07:02.5843339Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:02.5843766Z 2025-05-07T20:07:02.5844611Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:02.5845925Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:02.5846559Z ^ 2025-05-07T20:07:02.5846756Z 2025-05-07T20:07:02.5847642Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:02.5848949Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:02.5849568Z ^ 2025-05-07T20:07:02.5849710Z 2025-05-07T20:07:02.5850002Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:02.5850341Z 2025-05-07T20:07:02.5851156Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:02.5852490Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:02.5853091Z ^ 2025-05-07T20:07:02.5853315Z 2025-05-07T20:07:02.5854285Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:02.5855648Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:02.5856253Z ^ 2025-05-07T20:07:02.5856497Z 2025-05-07T20:07:02.5856741Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:02.5857085Z 2025-05-07T20:07:02.5857946Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:02.5859297Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:02.5859930Z ^ 2025-05-07T20:07:02.5860126Z 2025-05-07T20:07:02.5860984Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:02.5862314Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:02.5862934Z ^ 2025-05-07T20:07:02.5863076Z 2025-05-07T20:07:02.5863358Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:02.5863728Z 2025-05-07T20:07:02.5864564Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:02.5865933Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:02.5866545Z ^ 2025-05-07T20:07:02.5866803Z 2025-05-07T20:07:02.5867632Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:02.5868978Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:02.5869580Z ^ 2025-05-07T20:07:02.5869741Z 2025-05-07T20:07:02.5870009Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:02.5870513Z 2025-05-07T20:07:02.5871368Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:02.5872726Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:02.5873435Z ^ 2025-05-07T20:07:02.5873636Z 2025-05-07T20:07:03.2142422Z [592/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_2d.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_2d.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_permute_2d.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_2d.cu.o 2025-05-07T20:07:03.2153268Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:03.2154580Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:03.2155175Z ^ 2025-05-07T20:07:03.2155398Z 2025-05-07T20:07:03.2155630Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:03.2155981Z 2025-05-07T20:07:03.2156795Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:03.2158104Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:03.2158757Z ^ 2025-05-07T20:07:03.2158947Z 2025-05-07T20:07:03.2159761Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:03.2161161Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:03.2161738Z ^ 2025-05-07T20:07:03.2161885Z 2025-05-07T20:07:03.2162109Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:03.2162440Z 2025-05-07T20:07:03.2163245Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:03.2164561Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:03.2165156Z ^ 2025-05-07T20:07:03.2165342Z 2025-05-07T20:07:03.2166136Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:03.2167430Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:03.2168014Z ^ 2025-05-07T20:07:03.2168147Z 2025-05-07T20:07:03.2168370Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:03.2168715Z 2025-05-07T20:07:03.2169526Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:03.2171154Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:03.2171754Z ^ 2025-05-07T20:07:03.2171959Z 2025-05-07T20:07:03.2172780Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:03.2174100Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:03.2174766Z ^ 2025-05-07T20:07:03.2174902Z 2025-05-07T20:07:03.2175142Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:03.2175480Z 2025-05-07T20:07:03.2176313Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:03.2177725Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:03.2178383Z ^ 2025-05-07T20:07:03.2178573Z 2025-05-07T20:07:03.2179388Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:03.2180718Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:03.2181315Z ^ 2025-05-07T20:07:03.2181490Z 2025-05-07T20:07:03.2181721Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:03.2182060Z 2025-05-07T20:07:03.2182900Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:03.2184260Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:03.2184871Z ^ 2025-05-07T20:07:03.2185058Z 2025-05-07T20:07:03.6783902Z [593/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_segment_sum_csr.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_segment_sum_csr.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_segment_sum_csr.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_segment_sum_csr.cu.o 2025-05-07T20:07:03.6795020Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:03.6796363Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:03.6797048Z ^ 2025-05-07T20:07:03.6797219Z 2025-05-07T20:07:03.6797459Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:03.6797805Z 2025-05-07T20:07:03.6798657Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:03.6800041Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:03.6800671Z ^ 2025-05-07T20:07:03.6800869Z 2025-05-07T20:07:03.6801700Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:03.6803068Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:03.6803684Z ^ 2025-05-07T20:07:03.6803830Z 2025-05-07T20:07:03.6804065Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:03.6804436Z 2025-05-07T20:07:03.6805254Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:03.6806589Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:03.6807193Z ^ 2025-05-07T20:07:03.6807408Z 2025-05-07T20:07:03.6808214Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:03.6809537Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:03.6810126Z ^ 2025-05-07T20:07:03.6810288Z 2025-05-07T20:07:03.6810521Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:03.6810859Z 2025-05-07T20:07:03.6811670Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:03.6812997Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:03.6813623Z ^ 2025-05-07T20:07:03.6813818Z 2025-05-07T20:07:03.6814621Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:03.6815940Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:03.6816666Z ^ 2025-05-07T20:07:03.6816968Z 2025-05-07T20:07:03.6817213Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:03.6817587Z 2025-05-07T20:07:03.6818484Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:03.6819896Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:03.6820510Z ^ 2025-05-07T20:07:03.6820706Z 2025-05-07T20:07:03.6821554Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:03.6822915Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:03.6823538Z ^ 2025-05-07T20:07:03.6823681Z 2025-05-07T20:07:03.6823937Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:03.6824285Z 2025-05-07T20:07:03.6825152Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:03.6826517Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:03.6827154Z ^ 2025-05-07T20:07:03.6827351Z 2025-05-07T20:07:05.6596992Z [594/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_reorder_batched_ad.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_reorder_batched_ad.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_reorder_batched_ad.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_reorder_batched_ad.cu.o 2025-05-07T20:07:05.6608177Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:05.6609690Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:05.6610311Z ^ 2025-05-07T20:07:05.6610455Z 2025-05-07T20:07:05.6610719Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:05.6611068Z 2025-05-07T20:07:05.6611964Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:05.6613300Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:05.6613923Z ^ 2025-05-07T20:07:05.6614119Z 2025-05-07T20:07:05.6615000Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:05.6616328Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:05.6617171Z ^ 2025-05-07T20:07:05.6617343Z 2025-05-07T20:07:05.6617584Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:05.6617934Z 2025-05-07T20:07:05.6618802Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:05.6620151Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:05.6620804Z ^ 2025-05-07T20:07:05.6621006Z 2025-05-07T20:07:05.6621856Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:05.6623189Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:05.6623818Z ^ 2025-05-07T20:07:05.6623965Z 2025-05-07T20:07:05.6624205Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:05.6624569Z 2025-05-07T20:07:05.6625409Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:05.6626734Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:05.6627360Z ^ 2025-05-07T20:07:05.6627561Z 2025-05-07T20:07:05.6628388Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:05.6629875Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:05.6630500Z ^ 2025-05-07T20:07:05.6630645Z 2025-05-07T20:07:05.6630879Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:05.6631241Z 2025-05-07T20:07:05.6632052Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:05.6633431Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:05.6634032Z ^ 2025-05-07T20:07:05.6634250Z 2025-05-07T20:07:05.6635109Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:05.6636427Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:05.6637013Z ^ 2025-05-07T20:07:05.6637153Z 2025-05-07T20:07:05.6637406Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:05.6637751Z 2025-05-07T20:07:05.6638593Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:05.6639928Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:05.6640548Z ^ 2025-05-07T20:07:05.6640745Z 2025-05-07T20:07:06.3591062Z [595/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_py.so -o fbgemm_gpu_py.so CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_function.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_function.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_autograd.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_meta.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_meta.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_meta.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_models.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_utils.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator_ops.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator_ops.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops_host.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/topology_utils.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/histogram_binning_calibration_ops.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_backward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/dense_to_jagged_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_bmm_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_dense_elementwise_add_jagged_output_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_backward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_add_2d_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_select_2d_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_jagged_bmm_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_backward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_backward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_unique_indices.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/keyed_jagged_index_select_dim1.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_bfloat16.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fp8_rowwise.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_8bit_rowwise.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_nbit_rowwise.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_hfp8.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_msfp.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_padded_fp8_rowwise.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_mx.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_block_bucketize_features.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_bucketize_features.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_batched_unary_embeddings.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_compute_frequency_sequence.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_expand_into_jagged_permute.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_group_index.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_add.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_select.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_invert_permute.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_backward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_1d.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_2d.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute102.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_embeddings.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_range.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_reorder_batched_ad.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_segment_sum_csr.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_zipf.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm.so fbgemm_gpu_embedding_inplace_ops.so fbgemm_gpu_tbe_index_select.so fbgemm_gpu_tbe_cache.so fbgemm_gpu_tbe_optimizers.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so" -Wl,--as-needed fbgemm_gpu_sparse_async_cumsum.so fbgemm_gpu_tbe_utils.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -lcudadevrt -lcudart_static -ldl -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && : 2025-05-07T20:07:06.4236629Z [596/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_py.so 1 2025-05-07T20:07:06.4237909Z ################################################################################ 2025-05-07T20:07:06.4238268Z [CMAKE] Running post-build script ... 2025-05-07T20:07:06.4238816Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:07:06.4239338Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:07:06.4239707Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:07:06.4240096Z ################################################################################ 2025-05-07T20:08:29.7994626Z [597/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu.o 2025-05-07T20:08:29.8006453Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:29.8007733Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:29.8008330Z ^ 2025-05-07T20:08:29.8008472Z 2025-05-07T20:08:29.8008708Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:08:29.8009042Z 2025-05-07T20:08:29.8009838Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:29.8011085Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:29.8011691Z ^ 2025-05-07T20:08:29.8011877Z 2025-05-07T20:08:29.8012665Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:29.8013939Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:29.8014525Z ^ 2025-05-07T20:08:29.8014659Z 2025-05-07T20:08:29.8014902Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:08:29.8015225Z 2025-05-07T20:08:29.8016003Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:29.8017581Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:29.8018199Z ^ 2025-05-07T20:08:29.8018424Z 2025-05-07T20:08:29.8019254Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:29.8020650Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:29.8021253Z ^ 2025-05-07T20:08:29.8021419Z 2025-05-07T20:08:29.8021658Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:08:29.8022006Z 2025-05-07T20:08:29.8023014Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:29.8024258Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:29.8024850Z ^ 2025-05-07T20:08:29.8025037Z 2025-05-07T20:08:29.8025829Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:29.8027065Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:29.8027650Z ^ 2025-05-07T20:08:29.8027782Z 2025-05-07T20:08:29.8028003Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:08:29.8028349Z 2025-05-07T20:08:29.8029121Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:29.8030383Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:29.8030956Z ^ 2025-05-07T20:08:29.8031161Z 2025-05-07T20:08:29.8031920Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:29.8033170Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:29.8033733Z ^ 2025-05-07T20:08:29.8033885Z 2025-05-07T20:08:29.8034105Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:08:29.8034417Z 2025-05-07T20:08:29.8035216Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:29.8036519Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:29.8037089Z ^ 2025-05-07T20:08:29.8037264Z 2025-05-07T20:08:36.7850169Z [598/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T20:08:36.7862042Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:36.7863396Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:36.7864196Z ^ 2025-05-07T20:08:36.7864354Z 2025-05-07T20:08:36.7864593Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:08:36.7864937Z 2025-05-07T20:08:36.7865789Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:36.7867126Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:36.7867826Z ^ 2025-05-07T20:08:36.7868024Z 2025-05-07T20:08:36.7868868Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:36.7870390Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:36.7871009Z ^ 2025-05-07T20:08:36.7872137Z 2025-05-07T20:08:36.7872402Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:08:36.7872750Z 2025-05-07T20:08:36.7873581Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:36.7874939Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:36.7875599Z ^ 2025-05-07T20:08:36.7875815Z 2025-05-07T20:08:36.7876639Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:36.7878018Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:36.7878624Z ^ 2025-05-07T20:08:36.7878780Z 2025-05-07T20:08:36.7879018Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:08:36.7879359Z 2025-05-07T20:08:36.7880210Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:36.7881552Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:36.7882294Z ^ 2025-05-07T20:08:36.7882483Z 2025-05-07T20:08:36.7883304Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:36.7884597Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:36.7885194Z ^ 2025-05-07T20:08:36.7885331Z 2025-05-07T20:08:36.7885559Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:08:36.7885908Z 2025-05-07T20:08:36.7886719Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:36.7888033Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:36.7888622Z ^ 2025-05-07T20:08:36.7888827Z 2025-05-07T20:08:36.7889629Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:36.7890933Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:36.7891561Z ^ 2025-05-07T20:08:36.7891713Z 2025-05-07T20:08:36.7891940Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:08:36.7892273Z 2025-05-07T20:08:36.7893165Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:36.7894410Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:36.7895023Z ^ 2025-05-07T20:08:36.7895203Z 2025-05-07T20:08:39.4101724Z [599/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu.o 2025-05-07T20:08:39.4112842Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:39.4114151Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:39.4114851Z ^ 2025-05-07T20:08:39.4114995Z 2025-05-07T20:08:39.4115230Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:08:39.4115588Z 2025-05-07T20:08:39.4116364Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:39.4117720Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:39.4118311Z ^ 2025-05-07T20:08:39.4118504Z 2025-05-07T20:08:39.4119306Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:39.4120626Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:39.4121222Z ^ 2025-05-07T20:08:39.4121363Z 2025-05-07T20:08:39.4121620Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:08:39.4121948Z 2025-05-07T20:08:39.4122760Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:39.4124042Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:39.4124648Z ^ 2025-05-07T20:08:39.4124842Z 2025-05-07T20:08:39.4125638Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:39.4126910Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:39.4127482Z ^ 2025-05-07T20:08:39.4127648Z 2025-05-07T20:08:39.4127877Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:08:39.4128204Z 2025-05-07T20:08:39.4129008Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:39.4130403Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:39.4131009Z ^ 2025-05-07T20:08:39.4131198Z 2025-05-07T20:08:39.4131988Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:39.4133239Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:39.4133839Z ^ 2025-05-07T20:08:39.4133978Z 2025-05-07T20:08:39.4134231Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:08:39.4134556Z 2025-05-07T20:08:39.4135335Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:39.4136864Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:39.4137497Z ^ 2025-05-07T20:08:39.4137735Z 2025-05-07T20:08:39.4138569Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:39.4140049Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:39.4140665Z ^ 2025-05-07T20:08:39.4140843Z 2025-05-07T20:08:39.4141085Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:08:39.4141437Z 2025-05-07T20:08:39.4142309Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:39.4143700Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:39.4144356Z ^ 2025-05-07T20:08:39.4144564Z 2025-05-07T20:08:41.0252744Z [600/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_training_backward.so -o fbgemm_gpu_tbe_training_backward.so CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/codegen/training/backward/embedding_backward_dense_host_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_split_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adagrad_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_sgd_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adam_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lamb_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lars_sgd_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_none_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_sgd_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_grad_embedding_ops.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_indice_weights_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_indice_weights_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm_gpu_tbe_cache.so fbgemm_gpu_tbe_common.so fbgemm_gpu_sparse_async_cumsum.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so" -Wl,--as-needed fbgemm.so fbgemm_gpu_config.so fbgemm_gpu_tbe_utils.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -lcudadevrt -lcudart_static -ldl -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && : 2025-05-07T20:08:41.6532710Z [601/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_training_backward_dense.so -o fbgemm_gpu_tbe_training_backward_dense.so CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_split_dense.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_warp.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm_gpu_tbe_training_backward.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -lcudadevrt -lcudart_static -ldl -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && : 2025-05-07T20:08:41.6814784Z [602/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_training_backward_gwd.so -o fbgemm_gpu_tbe_training_backward_gwd.so CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm_gpu_tbe_training_backward.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -lcudadevrt -lcudart_static -lrt -lpthread -ldl -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build -L"/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs" -L"/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib" && : 2025-05-07T20:08:41.7099936Z [603/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 1 2025-05-07T20:08:41.7101421Z ################################################################################ 2025-05-07T20:08:41.7101783Z [CMAKE] Running post-build script ... 2025-05-07T20:08:41.7102406Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:08:41.7103024Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:08:41.7103477Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:08:41.7103874Z ################################################################################ 2025-05-07T20:08:41.8122473Z [604/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 1 2025-05-07T20:08:41.8124002Z ################################################################################ 2025-05-07T20:08:41.8124429Z [CMAKE] Running post-build script ... 2025-05-07T20:08:41.8125048Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:08:41.8125668Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:08:41.8126023Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:08:41.8126529Z ################################################################################ 2025-05-07T20:08:41.8554405Z [605/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward.so 1 2025-05-07T20:08:41.8555716Z ################################################################################ 2025-05-07T20:08:41.8556072Z [CMAKE] Running post-build script ... 2025-05-07T20:08:41.8556672Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:08:41.8557275Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:08:41.8557626Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:08:41.8558031Z ################################################################################ 2025-05-07T20:08:41.9451065Z [606/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_training_backward_vbe.so -o fbgemm_gpu_tbe_training_backward_vbe.so CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm_gpu_tbe_training_backward.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -lcudadevrt -lcudart_static -ldl -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && : 2025-05-07T20:08:42.2398333Z [607/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 1 2025-05-07T20:08:42.2399665Z ################################################################################ 2025-05-07T20:08:42.2400039Z [CMAKE] Running post-build script ... 2025-05-07T20:08:42.2400648Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:08:42.2401282Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:08:42.2401663Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:08:42.2402073Z ################################################################################ 2025-05-07T20:08:42.2403047Z [607/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/cmake/data/bin/cmake -P cmake_install.cmake 2025-05-07T20:08:42.2443108Z -- Install configuration: "Release" 2025-05-07T20:08:42.2444252Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/asmjit.so 2025-05-07T20:08:42.2464284Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm.so 2025-05-07T20:08:42.2465400Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_cache.so 2025-05-07T20:08:42.2491700Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_inference.so 2025-05-07T20:08:42.2492854Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_config.so 2025-05-07T20:08:42.2517300Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_utils.so 2025-05-07T20:08:42.2543392Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:08:42.2544383Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_common.so 2025-05-07T20:08:42.2545682Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:08:42.2569082Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:08:42.2570298Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:08:42.2571472Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:08:48.4846530Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:08:49.6140855Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:08:52.2292676Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:08:52.6934910Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:08:52.6935980Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:08:52.6937243Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adagrad.py 2025-05-07T20:08:52.6938453Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad.py 2025-05-07T20:08:52.6939659Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_sgd.py 2025-05-07T20:08:52.6940840Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adam.py 2025-05-07T20:08:52.6941984Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lamb.py 2025-05-07T20:08:52.6943156Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lars_sgd.py 2025-05-07T20:08:52.6944398Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_adam.py 2025-05-07T20:08:52.6945667Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_lamb.py 2025-05-07T20:08:52.6946899Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_none.py 2025-05-07T20:08:52.6948155Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_with_counter.py 2025-05-07T20:08:52.6949458Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_ssd.py 2025-05-07T20:08:52.6950873Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/__init__.py 2025-05-07T20:08:52.6952030Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args.py 2025-05-07T20:08:52.6953192Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args_ssd.py 2025-05-07T20:08:52.6954554Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_optimizer_codegen/split_embedding_optimizer_rowwise_adagrad.py 2025-05-07T20:08:52.6955795Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_optimizer_codegen/optimizer_args.py 2025-05-07T20:08:52.6956891Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:08:52.6979536Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_py.so 2025-05-07T20:08:52.7031136Z 2025-05-07T20:08:52.7064344Z 2025-05-07T20:08:52.7064864Z copying fbgemm_gpu/__init__.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/__init__.py 2025-05-07T20:08:52.7065762Z copying fbgemm_gpu/batched_unary_embeddings_ops.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/batched_unary_embeddings_ops.py 2025-05-07T20:08:52.7066819Z copying fbgemm_gpu/enums.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/enums.py 2025-05-07T20:08:52.7067515Z copying fbgemm_gpu/metrics.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/metrics.py 2025-05-07T20:08:52.7068442Z copying fbgemm_gpu/permute_pooled_embedding_modules.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/permute_pooled_embedding_modules.py 2025-05-07T20:08:52.7069633Z copying fbgemm_gpu/permute_pooled_embedding_modules_split.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/permute_pooled_embedding_modules_split.py 2025-05-07T20:08:52.7070862Z copying fbgemm_gpu/quantize_comm.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/quantize_comm.py 2025-05-07T20:08:52.7071693Z copying fbgemm_gpu/quantize_utils.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/quantize_utils.py 2025-05-07T20:08:52.7072532Z copying fbgemm_gpu/runtime_monitor.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/runtime_monitor.py 2025-05-07T20:08:52.7073334Z copying fbgemm_gpu/sparse_ops.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sparse_ops.py 2025-05-07T20:08:52.7074881Z copying fbgemm_gpu/split_embedding_configs.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_configs.py 2025-05-07T20:08:52.7076302Z copying fbgemm_gpu/split_embedding_inference_converter.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_inference_converter.py 2025-05-07T20:08:52.7077493Z copying fbgemm_gpu/split_embedding_optimizer_ops.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_optimizer_ops.py 2025-05-07T20:08:52.7078494Z copying fbgemm_gpu/split_embedding_utils.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_utils.py 2025-05-07T20:08:52.7079513Z copying fbgemm_gpu/split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops.py 2025-05-07T20:08:52.7080713Z copying fbgemm_gpu/split_table_batched_embeddings_ops_common.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_common.py 2025-05-07T20:08:52.7081993Z copying fbgemm_gpu/split_table_batched_embeddings_ops_inference.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_inference.py 2025-05-07T20:08:52.7083262Z copying fbgemm_gpu/split_table_batched_embeddings_ops_training.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_training.py 2025-05-07T20:08:52.7084840Z copying fbgemm_gpu/split_table_batched_embeddings_ops_training_common.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_training_common.py 2025-05-07T20:08:52.7086132Z copying fbgemm_gpu/ssd_split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/ssd_split_table_batched_embeddings_ops.py 2025-05-07T20:08:52.7087173Z copying fbgemm_gpu/tbe_input_multiplexer.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe_input_multiplexer.py 2025-05-07T20:08:52.7088025Z copying fbgemm_gpu/uvm.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/uvm.py 2025-05-07T20:08:52.7088662Z creating directory _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/config 2025-05-07T20:08:52.7089360Z copying fbgemm_gpu/config/__init__.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/config/__init__.py 2025-05-07T20:08:52.7090238Z copying fbgemm_gpu/config/feature_list.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/config/feature_list.py 2025-05-07T20:08:52.7091026Z creating directory _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs 2025-05-07T20:08:52.7091722Z copying fbgemm_gpu/docs/__init__.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/__init__.py 2025-05-07T20:08:52.7092508Z copying fbgemm_gpu/docs/common.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/common.py 2025-05-07T20:08:52.7093298Z copying fbgemm_gpu/docs/examples.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/examples.py 2025-05-07T20:08:52.7094237Z copying fbgemm_gpu/docs/jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/jagged_tensor_ops.py 2025-05-07T20:08:52.7095229Z copying fbgemm_gpu/docs/merge_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/merge_pooled_embedding_ops.py 2025-05-07T20:08:52.7096426Z copying fbgemm_gpu/docs/permute_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/permute_pooled_embedding_ops.py 2025-05-07T20:08:52.7097432Z copying fbgemm_gpu/docs/quantize_ops.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/quantize_ops.py 2025-05-07T20:08:52.7098262Z copying fbgemm_gpu/docs/sparse_ops.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/sparse_ops.py 2025-05-07T20:08:52.7099073Z copying fbgemm_gpu/docs/version.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/version.py 2025-05-07T20:08:52.7099795Z creating directory _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/quantize 2025-05-07T20:08:52.7100508Z copying fbgemm_gpu/quantize/__init__.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/quantize/__init__.py 2025-05-07T20:08:52.7101404Z copying fbgemm_gpu/quantize/quantize_ops.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/quantize/quantize_ops.py 2025-05-07T20:08:52.7102147Z creating directory _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll 2025-05-07T20:08:52.7102808Z copying fbgemm_gpu/sll/__init__.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/__init__.py 2025-05-07T20:08:52.7103484Z creating directory _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe 2025-05-07T20:08:52.7104132Z copying fbgemm_gpu/tbe/__init__.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/__init__.py 2025-05-07T20:08:52.7104808Z creating directory _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/triton 2025-05-07T20:08:52.7105495Z copying fbgemm_gpu/triton/__init__.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/triton/__init__.py 2025-05-07T20:08:52.7106316Z copying fbgemm_gpu/triton/common.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/triton/common.py 2025-05-07T20:08:52.7107148Z copying fbgemm_gpu/triton/quantize.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/triton/quantize.py 2025-05-07T20:08:52.7108000Z copying fbgemm_gpu/triton/quantize_ref.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/triton/quantize_ref.py 2025-05-07T20:08:52.7108791Z creating directory _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/utils 2025-05-07T20:08:52.7109462Z copying fbgemm_gpu/utils/__init__.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/utils/__init__.py 2025-05-07T20:08:52.7110285Z copying fbgemm_gpu/utils/filestore.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/utils/filestore.py 2025-05-07T20:08:52.7111115Z copying fbgemm_gpu/utils/loader.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/utils/loader.py 2025-05-07T20:08:52.7111971Z copying fbgemm_gpu/utils/torch_library.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/utils/torch_library.py 2025-05-07T20:08:52.7112728Z creating directory _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/cpu 2025-05-07T20:08:52.7113425Z copying fbgemm_gpu/sll/cpu/__init__.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/cpu/__init__.py 2025-05-07T20:08:52.7114262Z copying fbgemm_gpu/sll/cpu/cpu_sll.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/cpu/cpu_sll.py 2025-05-07T20:08:52.7115025Z creating directory _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/meta 2025-05-07T20:08:52.7115734Z copying fbgemm_gpu/sll/meta/__init__.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/meta/__init__.py 2025-05-07T20:08:52.7116587Z copying fbgemm_gpu/sll/meta/meta_sll.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/meta/meta_sll.py 2025-05-07T20:08:52.7117317Z creating directory _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton 2025-05-07T20:08:52.7118103Z copying fbgemm_gpu/sll/triton/__init__.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/__init__.py 2025-05-07T20:08:52.7118987Z copying fbgemm_gpu/sll/triton/common.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/common.py 2025-05-07T20:08:52.7120070Z copying fbgemm_gpu/sll/triton/triton_dense_jagged_cat_jagged_out.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_dense_jagged_cat_jagged_out.py 2025-05-07T20:08:52.7121350Z copying fbgemm_gpu/sll/triton/triton_jagged2_to_padded_dense.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged2_to_padded_dense.py 2025-05-07T20:08:52.7122494Z copying fbgemm_gpu/sll/triton/triton_jagged_bmm.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_bmm.py 2025-05-07T20:08:52.7123594Z copying fbgemm_gpu/sll/triton/triton_jagged_bmm_jagged_out.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_bmm_jagged_out.py 2025-05-07T20:08:52.7124873Z copying fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_add.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_add.py 2025-05-07T20:08:52.7126293Z copying fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_mul_jagged_out.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_mul_jagged_out.py 2025-05-07T20:08:52.7127690Z copying fbgemm_gpu/sll/triton/triton_jagged_dense_flash_attention.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_flash_attention.py 2025-05-07T20:08:52.7129031Z copying fbgemm_gpu/sll/triton/triton_jagged_flash_attention_basic.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_flash_attention_basic.py 2025-05-07T20:08:52.7130424Z copying fbgemm_gpu/sll/triton/triton_jagged_self_substraction_jagged_out.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_self_substraction_jagged_out.py 2025-05-07T20:08:52.7131678Z copying fbgemm_gpu/sll/triton/triton_jagged_softmax.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_softmax.py 2025-05-07T20:08:52.7132937Z copying fbgemm_gpu/sll/triton/triton_multi_head_jagged_flash_attention.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_multi_head_jagged_flash_attention.py 2025-05-07T20:08:52.7133971Z creating directory _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench 2025-05-07T20:08:52.7134718Z copying fbgemm_gpu/tbe/bench/__init__.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/__init__.py 2025-05-07T20:08:52.7135644Z copying fbgemm_gpu/tbe/bench/bench_config.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/bench_config.py 2025-05-07T20:08:52.7136635Z copying fbgemm_gpu/tbe/bench/bench_runs.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/bench_runs.py 2025-05-07T20:08:52.7137633Z copying fbgemm_gpu/tbe/bench/eeg_cli.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/eeg_cli.py 2025-05-07T20:08:52.7138663Z copying fbgemm_gpu/tbe/bench/embedding_ops_common_config.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/embedding_ops_common_config.py 2025-05-07T20:08:52.7139751Z copying fbgemm_gpu/tbe/bench/eval_compression.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/eval_compression.py 2025-05-07T20:08:52.7140748Z copying fbgemm_gpu/tbe/bench/reporter.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/reporter.py 2025-05-07T20:08:52.7141677Z copying fbgemm_gpu/tbe/bench/tbe_data_config.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config.py 2025-05-07T20:08:52.7142714Z copying fbgemm_gpu/tbe/bench/tbe_data_config_loader.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config_loader.py 2025-05-07T20:08:52.7143873Z copying fbgemm_gpu/tbe/bench/tbe_data_config_param_models.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config_param_models.py 2025-05-07T20:08:52.7144863Z copying fbgemm_gpu/tbe/bench/utils.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/utils.py 2025-05-07T20:08:52.7145599Z creating directory _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/cache 2025-05-07T20:08:52.7146345Z copying fbgemm_gpu/tbe/cache/__init__.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/cache/__init__.py 2025-05-07T20:08:52.7147352Z copying fbgemm_gpu/tbe/cache/split_embeddings_cache_ops.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/cache/split_embeddings_cache_ops.py 2025-05-07T20:08:52.7148242Z creating directory _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/ssd 2025-05-07T20:08:52.7148943Z copying fbgemm_gpu/tbe/ssd/__init__.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/ssd/__init__.py 2025-05-07T20:08:52.7149769Z copying fbgemm_gpu/tbe/ssd/common.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/ssd/common.py 2025-05-07T20:08:52.7150639Z copying fbgemm_gpu/tbe/ssd/inference.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/ssd/inference.py 2025-05-07T20:08:52.7151499Z copying fbgemm_gpu/tbe/ssd/training.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/ssd/training.py 2025-05-07T20:08:52.7152252Z creating directory _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/stats 2025-05-07T20:08:52.7152982Z copying fbgemm_gpu/tbe/stats/__init__.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/stats/__init__.py 2025-05-07T20:08:52.7153964Z copying fbgemm_gpu/tbe/stats/bench_params_reporter.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/stats/bench_params_reporter.py 2025-05-07T20:08:52.7154982Z creating directory _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/utils 2025-05-07T20:08:52.7155706Z copying fbgemm_gpu/tbe/utils/__init__.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/utils/__init__.py 2025-05-07T20:08:52.7156569Z copying fbgemm_gpu/tbe/utils/common.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/utils/common.py 2025-05-07T20:08:52.7157424Z copying fbgemm_gpu/tbe/utils/offsets.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/utils/offsets.py 2025-05-07T20:08:52.7158326Z copying fbgemm_gpu/tbe/utils/quantize.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/utils/quantize.py 2025-05-07T20:08:52.7159264Z copying fbgemm_gpu/tbe/utils/requests.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/utils/requests.py 2025-05-07T20:08:52.7160020Z creating directory _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:08:52.7160809Z copying fbgemm_gpu/tbe/ssd/utils/__init__.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/ssd/utils/__init__.py 2025-05-07T20:08:52.7161950Z copying fbgemm_gpu/tbe/ssd/utils/partially_materialized_tensor.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/ssd/utils/partially_materialized_tensor.py 2025-05-07T20:08:52.7162954Z creating directory _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/triton/jagged 2025-05-07T20:08:52.7163750Z copying fbgemm_gpu/triton/jagged/__init__.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/triton/jagged/__init__.py 2025-05-07T20:08:52.7164802Z copying fbgemm_gpu/triton/jagged/triton_jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/triton/jagged/triton_jagged_tensor_ops.py 2025-05-07T20:08:52.7165475Z 2025-05-07T20:08:52.7281519Z INFO:root:running bdist_wheel 2025-05-07T20:08:52.7329645Z INFO:root:running build 2025-05-07T20:08:52.7330094Z INFO:root:running build_py 2025-05-07T20:08:52.7332253Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:08:52.7333931Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:08:52.7336525Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/batched_unary_embeddings_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:08:52.7337846Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/enums.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:08:52.7339218Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/metrics.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:08:52.7340772Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/permute_pooled_embedding_modules.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:08:52.7342192Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/permute_pooled_embedding_modules_split.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:08:52.7343548Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/quantize_comm.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:08:52.7345101Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/quantize_utils.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:08:52.7346513Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/runtime_monitor.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:08:52.7347889Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sparse_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:08:52.7349692Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_configs.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:08:52.7351122Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_inference_converter.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:08:52.7352644Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_optimizer_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:08:52.7354077Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_utils.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:08:52.7355416Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:08:52.7356842Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_common.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:08:52.7358525Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_inference.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:08:52.7360415Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_training.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:08:52.7363113Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_training_common.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:08:52.7364782Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/ssd_split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:08:52.7366184Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe_input_multiplexer.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:08:52.7367502Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/uvm.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:08:52.7369326Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/config 2025-05-07T20:08:52.7370617Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/config/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/config 2025-05-07T20:08:52.7372319Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/config/feature_list.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/config 2025-05-07T20:08:52.7374359Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs 2025-05-07T20:08:52.7375383Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs 2025-05-07T20:08:52.7376813Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/common.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs 2025-05-07T20:08:52.7378385Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/examples.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs 2025-05-07T20:08:52.7379726Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs 2025-05-07T20:08:52.7381107Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/merge_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs 2025-05-07T20:08:52.7382538Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/permute_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs 2025-05-07T20:08:52.7383924Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/quantize_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs 2025-05-07T20:08:52.7385358Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/sparse_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs 2025-05-07T20:08:52.7386669Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/version.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs 2025-05-07T20:08:52.7388441Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/quantize 2025-05-07T20:08:52.7389501Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/quantize/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/quantize 2025-05-07T20:08:52.7390955Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/quantize/quantize_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/quantize 2025-05-07T20:08:52.7392585Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll 2025-05-07T20:08:52.7393904Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll 2025-05-07T20:08:52.7395392Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe 2025-05-07T20:08:52.7396412Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe 2025-05-07T20:08:52.7398408Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/triton 2025-05-07T20:08:52.7399449Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/triton/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/triton 2025-05-07T20:08:52.7400766Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/triton/common.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/triton 2025-05-07T20:08:52.7402097Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/triton/quantize.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/triton 2025-05-07T20:08:52.7403573Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/triton/quantize_ref.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/triton 2025-05-07T20:08:52.7405443Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/utils 2025-05-07T20:08:52.7406457Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/utils/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/utils 2025-05-07T20:08:52.7407853Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/utils/filestore.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/utils 2025-05-07T20:08:52.7409503Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/utils/loader.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/utils 2025-05-07T20:08:52.7410885Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/utils/torch_library.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/utils 2025-05-07T20:08:52.7412169Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/cpu 2025-05-07T20:08:52.7413278Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/cpu/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/cpu 2025-05-07T20:08:52.7414737Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/cpu/cpu_sll.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/cpu 2025-05-07T20:08:52.7417244Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/meta 2025-05-07T20:08:52.7418432Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/meta/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/meta 2025-05-07T20:08:52.7419858Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/meta/meta_sll.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/meta 2025-05-07T20:08:52.7422173Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:08:52.7423308Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:08:52.7424783Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/common.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:08:52.7426446Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_dense_jagged_cat_jagged_out.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:08:52.7428070Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged2_to_padded_dense.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:08:52.7429679Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_bmm.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:08:52.7431207Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_bmm_jagged_out.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:08:52.7432835Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_add.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:08:52.7434537Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_mul_jagged_out.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:08:52.7436222Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_flash_attention.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:08:52.7437859Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_flash_attention_basic.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:08:52.7439534Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_self_substraction_jagged_out.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:08:52.7441159Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_softmax.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:08:52.7442758Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_multi_head_jagged_flash_attention.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:08:52.7444029Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench 2025-05-07T20:08:52.7445159Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench 2025-05-07T20:08:52.7446610Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/bench_config.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench 2025-05-07T20:08:52.7448007Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/bench_runs.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench 2025-05-07T20:08:52.7449437Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/eeg_cli.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench 2025-05-07T20:08:52.7451257Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/embedding_ops_common_config.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench 2025-05-07T20:08:52.7452767Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/eval_compression.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench 2025-05-07T20:08:52.7454253Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/reporter.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench 2025-05-07T20:08:52.7455701Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench 2025-05-07T20:08:52.7457281Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config_loader.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench 2025-05-07T20:08:52.7459009Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config_param_models.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench 2025-05-07T20:08:52.7460506Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/utils.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench 2025-05-07T20:08:52.7461646Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/cache 2025-05-07T20:08:52.7462816Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/cache/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/cache 2025-05-07T20:08:52.7464301Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/cache/split_embeddings_cache_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/cache 2025-05-07T20:08:52.7465548Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/ssd 2025-05-07T20:08:52.7466661Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/ssd/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/ssd 2025-05-07T20:08:52.7468081Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/ssd/common.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/ssd 2025-05-07T20:08:52.7469488Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/ssd/inference.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/ssd 2025-05-07T20:08:52.7471055Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/ssd/training.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/ssd 2025-05-07T20:08:52.7472814Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/stats 2025-05-07T20:08:52.7473954Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/stats/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/stats 2025-05-07T20:08:52.7475554Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/stats/bench_params_reporter.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/stats 2025-05-07T20:08:52.7477663Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/utils 2025-05-07T20:08:52.7478793Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/utils/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/utils 2025-05-07T20:08:52.7480336Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/utils/common.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/utils 2025-05-07T20:08:52.7481840Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/utils/offsets.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/utils 2025-05-07T20:08:52.7483278Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/utils/quantize.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/utils 2025-05-07T20:08:52.7484733Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/utils/requests.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/utils 2025-05-07T20:08:52.7485941Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:08:52.7487240Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/ssd/utils/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:08:52.7488869Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/ssd/utils/partially_materialized_tensor.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:08:52.7490130Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/triton/jagged 2025-05-07T20:08:52.7491354Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/triton/jagged/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/triton/jagged 2025-05-07T20:08:52.7492931Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/triton/jagged/triton_jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/triton/jagged 2025-05-07T20:08:52.7533577Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/asmjit.so -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:08:52.7568715Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm.so -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:08:52.7866681Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_cache.so -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:08:52.9179281Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_inference.so -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:08:56.3257445Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_config.so -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:08:56.3263855Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_utils.so -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:08:56.4557497Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_sparse_async_cumsum.so -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:08:56.4669168Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_common.so -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:08:56.4889129Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_optimizers.so -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:08:56.5589444Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_forward.so -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:08:59.2636287Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_pt2.so -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:08:59.3454777Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward.so -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:09:06.6466923Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_gwd.so -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:09:07.8698933Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_vbe.so -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:09:10.4865638Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_dense.so -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:09:10.9505773Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_split_host.so -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:09:10.9878303Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_index_select.so -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:09:11.2597686Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:11.2603008Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adagrad.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:11.2605008Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:11.2609719Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_sgd.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:11.2621099Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adam.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:11.2627308Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lamb.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:11.2634157Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lars_sgd.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:11.2640912Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_adam.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:11.2652175Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_lamb.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:11.2658465Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_none.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:11.2669696Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_with_counter.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:11.2676601Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_ssd.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:11.2689287Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:11.2698590Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:11.2705927Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args_ssd.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:11.2711891Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_optimizer_codegen 2025-05-07T20:09:11.2713434Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_optimizer_codegen/split_embedding_optimizer_rowwise_adagrad.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_optimizer_codegen 2025-05-07T20:09:11.2719437Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_optimizer_codegen/optimizer_args.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_optimizer_codegen 2025-05-07T20:09:11.2725539Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_embedding_inplace_ops.so -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:09:11.2764609Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_py.so -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:09:11.8480914Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:09:11.8482345Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/batched_unary_embeddings_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:09:11.8483685Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/enums.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:09:11.8484934Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/metrics.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:09:11.8495458Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/permute_pooled_embedding_modules.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:09:11.8497172Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/permute_pooled_embedding_modules_split.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:09:11.8498497Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/quantize_comm.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:09:11.8499749Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/quantize_utils.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:09:11.8501080Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/runtime_monitor.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:09:11.8502294Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sparse_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:09:11.8503632Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_configs.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:09:11.8505000Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_inference_converter.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:09:11.8506442Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_optimizer_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:09:11.8507770Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_utils.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:09:11.8509109Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:09:11.8510509Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_common.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:09:11.8511973Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_inference.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:09:11.8513435Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_training.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:09:11.8514908Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_training_common.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:09:11.8516372Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/ssd_split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:09:11.8517724Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe_input_multiplexer.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:09:11.8518932Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/uvm.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:09:11.8520162Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/config/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/config 2025-05-07T20:09:11.8521488Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/config/feature_list.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/config 2025-05-07T20:09:11.8522804Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs 2025-05-07T20:09:11.8524062Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/common.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs 2025-05-07T20:09:11.8525324Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/examples.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs 2025-05-07T20:09:11.8526667Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs 2025-05-07T20:09:11.8528046Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/merge_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs 2025-05-07T20:09:11.8529478Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/permute_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs 2025-05-07T20:09:11.8530844Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/quantize_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs 2025-05-07T20:09:11.8532140Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/sparse_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs 2025-05-07T20:09:11.8533442Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/version.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs 2025-05-07T20:09:11.8534742Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/quantize/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/quantize 2025-05-07T20:09:11.8536106Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/quantize/quantize_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/quantize 2025-05-07T20:09:11.8537469Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll 2025-05-07T20:09:11.8538704Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe 2025-05-07T20:09:11.8539959Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/triton/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/triton 2025-05-07T20:09:11.8541259Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/triton/common.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/triton 2025-05-07T20:09:11.8542564Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/triton/quantize.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/triton 2025-05-07T20:09:11.8543904Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/triton/quantize_ref.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/triton 2025-05-07T20:09:11.8545215Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/utils/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/utils 2025-05-07T20:09:11.8546497Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/utils/filestore.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/utils 2025-05-07T20:09:11.8547803Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/utils/loader.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/utils 2025-05-07T20:09:11.8549313Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/utils/torch_library.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/utils 2025-05-07T20:09:11.8551139Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/cpu/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/cpu 2025-05-07T20:09:11.8552742Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/cpu/cpu_sll.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/cpu 2025-05-07T20:09:11.8554291Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/meta/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/meta 2025-05-07T20:09:11.8555808Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/meta/meta_sll.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/meta 2025-05-07T20:09:11.8557363Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:09:11.8558960Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/common.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:09:11.8560564Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_dense_jagged_cat_jagged_out.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:09:11.8562121Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged2_to_padded_dense.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:09:11.8563637Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_bmm.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:09:11.8565124Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_bmm_jagged_out.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:09:11.8566673Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_add.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:09:11.8568313Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_mul_jagged_out.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:09:11.8569945Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_flash_attention.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:09:11.8571732Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_flash_attention_basic.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:09:11.8573351Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_self_substraction_jagged_out.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:09:11.8574927Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_softmax.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:09:11.8576553Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_multi_head_jagged_flash_attention.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:09:11.8578107Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench 2025-05-07T20:09:11.8579692Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/bench_config.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench 2025-05-07T20:09:11.8581213Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/bench_runs.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench 2025-05-07T20:09:11.8582900Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/eeg_cli.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench 2025-05-07T20:09:11.8584460Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/embedding_ops_common_config.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench 2025-05-07T20:09:11.8586017Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/eval_compression.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench 2025-05-07T20:09:11.8589508Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/reporter.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench 2025-05-07T20:09:11.8591336Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench 2025-05-07T20:09:11.8592821Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config_loader.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench 2025-05-07T20:09:11.8594301Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config_param_models.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench 2025-05-07T20:09:11.8595730Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/utils.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench 2025-05-07T20:09:11.8597079Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/cache/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/cache 2025-05-07T20:09:11.8598493Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/cache/split_embeddings_cache_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/cache 2025-05-07T20:09:11.8599892Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/ssd/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/ssd 2025-05-07T20:09:11.8601210Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/ssd/common.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/ssd 2025-05-07T20:09:11.8602532Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/ssd/inference.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/ssd 2025-05-07T20:09:11.8604055Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/ssd/training.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/ssd 2025-05-07T20:09:11.8606640Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/stats/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/stats 2025-05-07T20:09:11.8608217Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/stats/bench_params_reporter.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/stats 2025-05-07T20:09:11.8609670Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/utils/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/utils 2025-05-07T20:09:11.8611238Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/utils/common.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/utils 2025-05-07T20:09:11.8612647Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/utils/offsets.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/utils 2025-05-07T20:09:11.8614168Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/utils/quantize.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/utils 2025-05-07T20:09:11.8615561Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/utils/requests.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/utils 2025-05-07T20:09:11.8617233Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/ssd/utils/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:09:11.8618947Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/ssd/utils/partially_materialized_tensor.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:09:11.8620506Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/triton/jagged/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/triton/jagged 2025-05-07T20:09:11.8621993Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/triton/jagged/triton_jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/triton/jagged 2025-05-07T20:09:11.8636312Z INFO:skbuild:copied 90 files 2025-05-07T20:09:11.8636667Z INFO:root:running build_ext 2025-05-07T20:09:11.8637181Z INFO:root:installing to _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel 2025-05-07T20:09:11.8637676Z INFO:root:running install 2025-05-07T20:09:11.8694450Z INFO:root:running install_lib 2025-05-07T20:09:11.8695041Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel 2025-05-07T20:09:11.8695757Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu 2025-05-07T20:09:11.8696647Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/config 2025-05-07T20:09:11.8697811Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/config/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/config 2025-05-07T20:09:11.8699362Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/config/feature_list.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/config 2025-05-07T20:09:11.8700496Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/docs 2025-05-07T20:09:11.8701580Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:09:11.8703015Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs/common.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:09:11.8704483Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs/examples.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:09:11.8705988Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs/jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:09:11.8707693Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs/merge_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:09:11.8709289Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs/permute_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:09:11.8710895Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs/quantize_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:09:11.8712390Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs/sparse_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:09:11.8713894Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs/version.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:09:11.8715004Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/quantize 2025-05-07T20:09:11.8716198Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/quantize/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/quantize 2025-05-07T20:09:11.8717725Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/quantize/quantize_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/quantize 2025-05-07T20:09:11.8718863Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/sll 2025-05-07T20:09:11.8719580Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/sll/cpu 2025-05-07T20:09:11.8720710Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/cpu/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/cpu 2025-05-07T20:09:11.8722209Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/cpu/cpu_sll.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/cpu 2025-05-07T20:09:11.8723317Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/sll/meta 2025-05-07T20:09:11.8724456Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/meta/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/meta 2025-05-07T20:09:11.8725981Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/meta/meta_sll.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/meta 2025-05-07T20:09:11.8727119Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/sll/triton 2025-05-07T20:09:11.8728280Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:11.8729836Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton/common.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:11.8731480Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton/triton_dense_jagged_cat_jagged_out.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:11.8733281Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton/triton_jagged2_to_padded_dense.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:11.8734972Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton/triton_jagged_bmm.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:11.8736722Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton/triton_jagged_bmm_jagged_out.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:11.8738507Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_add.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:11.8740360Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_mul_jagged_out.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:11.8742187Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton/triton_jagged_dense_flash_attention.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:11.8744225Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton/triton_jagged_flash_attention_basic.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:11.8746037Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton/triton_jagged_self_substraction_jagged_out.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:11.8747806Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton/triton_jagged_softmax.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:11.8749538Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton/triton_multi_head_jagged_flash_attention.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:11.8751165Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll 2025-05-07T20:09:11.8752225Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/tbe 2025-05-07T20:09:11.8752950Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/tbe/bench 2025-05-07T20:09:11.8754104Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:09:11.8755661Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench/bench_config.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:09:11.8757220Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench/bench_runs.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:09:11.8758767Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench/eeg_cli.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:09:11.8760424Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench/embedding_ops_common_config.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:09:11.8762073Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench/eval_compression.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:09:11.8763669Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench/reporter.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:09:11.8765284Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench/tbe_data_config.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:09:11.8766913Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench/tbe_data_config_loader.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:09:11.8768579Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench/tbe_data_config_param_models.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:09:11.8770361Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench/utils.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:09:11.8771519Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/tbe/cache 2025-05-07T20:09:11.8772647Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/cache/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/cache 2025-05-07T20:09:11.8774262Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/cache/split_embeddings_cache_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/cache 2025-05-07T20:09:11.8775457Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/tbe/ssd 2025-05-07T20:09:11.8776228Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:09:11.8777487Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/ssd/utils/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:09:11.8779177Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/ssd/utils/partially_materialized_tensor.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:09:11.8780834Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/ssd/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/ssd 2025-05-07T20:09:11.8782328Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/ssd/common.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/ssd 2025-05-07T20:09:11.8783830Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/ssd/inference.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/ssd 2025-05-07T20:09:11.8785367Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/ssd/training.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/ssd 2025-05-07T20:09:11.8786507Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/tbe/stats 2025-05-07T20:09:11.8787714Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/stats/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/stats 2025-05-07T20:09:11.8789312Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/stats/bench_params_reporter.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/stats 2025-05-07T20:09:11.8790553Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/tbe/utils 2025-05-07T20:09:11.8791678Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/utils/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/utils 2025-05-07T20:09:11.8793209Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/utils/common.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/utils 2025-05-07T20:09:11.8794795Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/utils/offsets.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/utils 2025-05-07T20:09:11.8796336Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/utils/quantize.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/utils 2025-05-07T20:09:11.8797935Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/utils/requests.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/utils 2025-05-07T20:09:11.8799423Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe 2025-05-07T20:09:11.8800488Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/triton 2025-05-07T20:09:11.8801256Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/triton/jagged 2025-05-07T20:09:11.8802453Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/triton/jagged/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/triton/jagged 2025-05-07T20:09:11.8804109Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/triton/jagged/triton_jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/triton/jagged 2025-05-07T20:09:11.8805724Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/triton/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/triton 2025-05-07T20:09:11.8807208Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/triton/common.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/triton 2025-05-07T20:09:11.8808692Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/triton/quantize.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/triton 2025-05-07T20:09:11.8810215Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/triton/quantize_ref.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/triton 2025-05-07T20:09:11.8811344Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/utils 2025-05-07T20:09:11.8812421Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/utils/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/utils 2025-05-07T20:09:11.8813934Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/utils/filestore.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/utils 2025-05-07T20:09:11.8815421Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/utils/loader.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/utils 2025-05-07T20:09:11.8816956Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/utils/torch_library.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/utils 2025-05-07T20:09:11.8818428Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/asmjit.so -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:11.8819792Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/fbgemm.so -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:11.8834739Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/fbgemm_gpu_tbe_cache.so -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:11.8968883Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/fbgemm_gpu_tbe_inference.so -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:12.1643123Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/fbgemm_gpu_config.so -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:12.1644701Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/fbgemm_gpu_tbe_utils.so -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:12.1745259Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/fbgemm_gpu_sparse_async_cumsum.so -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:12.1755262Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/fbgemm_gpu_tbe_common.so -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:12.1775515Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/fbgemm_gpu_tbe_optimizers.so -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:12.1834606Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/fbgemm_gpu_tbe_training_forward.so -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:12.3946985Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_pt2.so -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:12.4015090Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/fbgemm_gpu_tbe_training_backward.so -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:12.9579167Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_gwd.so -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:13.0450011Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_vbe.so -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:13.2459399Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_dense.so -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:13.2818433Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_split_host.so -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:13.2849241Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/fbgemm_gpu_tbe_index_select.so -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:13.3058760Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:13.3060349Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adagrad.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:13.3062655Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:13.3064728Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_sgd.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:13.3066846Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adam.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:13.3068886Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lamb.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:13.3071114Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lars_sgd.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:13.3073208Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_adam.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:13.3075370Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_lamb.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:13.3077473Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_none.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:13.3079605Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_with_counter.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:13.3081765Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_ssd.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:13.3083904Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:13.3085919Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:13.3088014Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args_ssd.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:13.3089502Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/split_embedding_optimizer_codegen 2025-05-07T20:09:13.3091113Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_optimizer_codegen/split_embedding_optimizer_rowwise_adagrad.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_optimizer_codegen 2025-05-07T20:09:13.3093236Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_optimizer_codegen/optimizer_args.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_optimizer_codegen 2025-05-07T20:09:13.3095052Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/fbgemm_gpu_embedding_inplace_ops.so -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:13.3096554Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/fbgemm_gpu_py.so -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:13.3533895Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:13.3535424Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/batched_unary_embeddings_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:13.3536984Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/enums.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:13.3538353Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/metrics.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:13.3539806Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/permute_pooled_embedding_modules.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:13.3541398Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/permute_pooled_embedding_modules_split.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:13.3542922Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/quantize_comm.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:13.3544332Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/quantize_utils.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:13.3545758Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/runtime_monitor.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:13.3547393Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sparse_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:13.3548831Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_configs.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:13.3550445Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_inference_converter.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:13.3552022Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_optimizer_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:13.3553589Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_utils.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:13.3555113Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:13.3556773Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_table_batched_embeddings_ops_common.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:13.3558414Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_table_batched_embeddings_ops_inference.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:13.3560052Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_table_batched_embeddings_ops_training.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:13.3561724Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_table_batched_embeddings_ops_training_common.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:13.3563392Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/ssd_split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:13.3564909Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe_input_multiplexer.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:13.3566419Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/uvm.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:13.3567230Z INFO:skbuild:copied 125 files 2025-05-07T20:09:13.3567505Z INFO:root:running install_egg_info 2025-05-07T20:09:13.3611438Z INFO:root:running egg_info 2025-05-07T20:09:13.3650803Z INFO:root:writing fbgemm_gpu_nightly.egg-info/PKG-INFO 2025-05-07T20:09:13.3651869Z INFO:root:writing dependency_links to fbgemm_gpu_nightly.egg-info/dependency_links.txt 2025-05-07T20:09:13.3653628Z INFO:root:writing requirements to fbgemm_gpu_nightly.egg-info/requires.txt 2025-05-07T20:09:13.3654595Z INFO:root:writing top-level names to fbgemm_gpu_nightly.egg-info/top_level.txt 2025-05-07T20:09:13.3750777Z INFO:root:reading manifest file 'fbgemm_gpu_nightly.egg-info/SOURCES.txt' 2025-05-07T20:09:13.3782477Z INFO:root:writing manifest file 'fbgemm_gpu_nightly.egg-info/SOURCES.txt' 2025-05-07T20:09:13.3783579Z INFO:root:Copying fbgemm_gpu_nightly.egg-info to _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu_nightly-2025.5.7-py3.13.egg-info 2025-05-07T20:09:13.3790071Z INFO:root:running install_scripts 2025-05-07T20:09:13.3790497Z INFO:skbuild:copied 0 files 2025-05-07T20:09:16.0892795Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu_nightly-2025.5.7.dist-info/WHEEL 2025-05-07T20:09:16.0894487Z INFO:wheel:creating '/__w/FBGEMM/FBGEMM/fbgemm_gpu/dist/.tmp-dn9uxucd/fbgemm_gpu_nightly-2025.5.7-cp313-cp313-manylinux_2_28_x86_64.whl' and adding '_skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel' to it 2025-05-07T20:09:16.0896149Z INFO:wheel:adding 'fbgemm_gpu/__init__.py' 2025-05-07T20:09:16.1154404Z INFO:wheel:adding 'fbgemm_gpu/asmjit.so' 2025-05-07T20:09:16.1168471Z INFO:wheel:adding 'fbgemm_gpu/batched_unary_embeddings_ops.py' 2025-05-07T20:09:16.1169109Z INFO:wheel:adding 'fbgemm_gpu/enums.py' 2025-05-07T20:09:16.3182101Z INFO:wheel:adding 'fbgemm_gpu/fbgemm.so' 2025-05-07T20:09:16.3317738Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_config.so' 2025-05-07T20:09:16.3449282Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_embedding_inplace_ops.so' 2025-05-07T20:09:18.0559921Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_py.so' 2025-05-07T20:09:18.2576694Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_sparse_async_cumsum.so' 2025-05-07T20:09:18.9666041Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_cache.so' 2025-05-07T20:09:19.0739080Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_common.so' 2025-05-07T20:09:19.6657453Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_index_select.so' 2025-05-07T20:09:37.6884605Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_inference.so' 2025-05-07T20:09:38.9492677Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_optimizers.so' 2025-05-07T20:10:06.6415096Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_training_backward.so' 2025-05-07T20:10:09.4326780Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_training_backward_dense.so' 2025-05-07T20:10:13.0197183Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_training_backward_gwd.so' 2025-05-07T20:10:13.7077226Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_training_backward_pt2.so' 2025-05-07T20:10:13.9256153Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_training_backward_split_host.so' 2025-05-07T20:10:22.4719231Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_training_backward_vbe.so' 2025-05-07T20:10:33.3456283Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_training_forward.so' 2025-05-07T20:10:34.8109174Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_utils.so' 2025-05-07T20:10:34.8464333Z INFO:wheel:adding 'fbgemm_gpu/metrics.py' 2025-05-07T20:10:34.8466069Z INFO:wheel:adding 'fbgemm_gpu/permute_pooled_embedding_modules.py' 2025-05-07T20:10:34.8467993Z INFO:wheel:adding 'fbgemm_gpu/permute_pooled_embedding_modules_split.py' 2025-05-07T20:10:34.8471286Z INFO:wheel:adding 'fbgemm_gpu/quantize_comm.py' 2025-05-07T20:10:34.8474270Z INFO:wheel:adding 'fbgemm_gpu/quantize_utils.py' 2025-05-07T20:10:34.8477223Z INFO:wheel:adding 'fbgemm_gpu/runtime_monitor.py' 2025-05-07T20:10:34.8487842Z INFO:wheel:adding 'fbgemm_gpu/sparse_ops.py' 2025-05-07T20:10:34.8491919Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_configs.py' 2025-05-07T20:10:34.8494748Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_inference_converter.py' 2025-05-07T20:10:34.8496257Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_optimizer_ops.py' 2025-05-07T20:10:34.8497766Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_utils.py' 2025-05-07T20:10:34.8499610Z INFO:wheel:adding 'fbgemm_gpu/split_table_batched_embeddings_ops.py' 2025-05-07T20:10:34.8502732Z INFO:wheel:adding 'fbgemm_gpu/split_table_batched_embeddings_ops_common.py' 2025-05-07T20:10:34.8525594Z INFO:wheel:adding 'fbgemm_gpu/split_table_batched_embeddings_ops_inference.py' 2025-05-07T20:10:34.8567979Z INFO:wheel:adding 'fbgemm_gpu/split_table_batched_embeddings_ops_training.py' 2025-05-07T20:10:34.8571380Z INFO:wheel:adding 'fbgemm_gpu/split_table_batched_embeddings_ops_training_common.py' 2025-05-07T20:10:34.8572743Z INFO:wheel:adding 'fbgemm_gpu/ssd_split_table_batched_embeddings_ops.py' 2025-05-07T20:10:34.8574813Z INFO:wheel:adding 'fbgemm_gpu/tbe_input_multiplexer.py' 2025-05-07T20:10:34.8575984Z INFO:wheel:adding 'fbgemm_gpu/uvm.py' 2025-05-07T20:10:34.8578338Z INFO:wheel:adding 'fbgemm_gpu/config/__init__.py' 2025-05-07T20:10:34.8579997Z INFO:wheel:adding 'fbgemm_gpu/config/feature_list.py' 2025-05-07T20:10:34.8581623Z INFO:wheel:adding 'fbgemm_gpu/docs/__init__.py' 2025-05-07T20:10:34.8582785Z INFO:wheel:adding 'fbgemm_gpu/docs/common.py' 2025-05-07T20:10:34.8584601Z INFO:wheel:adding 'fbgemm_gpu/docs/examples.py' 2025-05-07T20:10:34.8587045Z INFO:wheel:adding 'fbgemm_gpu/docs/jagged_tensor_ops.py' 2025-05-07T20:10:34.8588657Z INFO:wheel:adding 'fbgemm_gpu/docs/merge_pooled_embedding_ops.py' 2025-05-07T20:10:34.8590725Z INFO:wheel:adding 'fbgemm_gpu/docs/permute_pooled_embedding_ops.py' 2025-05-07T20:10:34.8592242Z INFO:wheel:adding 'fbgemm_gpu/docs/quantize_ops.py' 2025-05-07T20:10:34.8597875Z INFO:wheel:adding 'fbgemm_gpu/docs/sparse_ops.py' 2025-05-07T20:10:34.8599667Z INFO:wheel:adding 'fbgemm_gpu/docs/version.py' 2025-05-07T20:10:34.8601226Z INFO:wheel:adding 'fbgemm_gpu/quantize/__init__.py' 2025-05-07T20:10:34.8603046Z INFO:wheel:adding 'fbgemm_gpu/quantize/quantize_ops.py' 2025-05-07T20:10:34.8604777Z INFO:wheel:adding 'fbgemm_gpu/sll/__init__.py' 2025-05-07T20:10:34.8606719Z INFO:wheel:adding 'fbgemm_gpu/sll/cpu/__init__.py' 2025-05-07T20:10:34.8612843Z INFO:wheel:adding 'fbgemm_gpu/sll/cpu/cpu_sll.py' 2025-05-07T20:10:34.8615189Z INFO:wheel:adding 'fbgemm_gpu/sll/meta/__init__.py' 2025-05-07T20:10:34.8617669Z INFO:wheel:adding 'fbgemm_gpu/sll/meta/meta_sll.py' 2025-05-07T20:10:34.8620123Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/__init__.py' 2025-05-07T20:10:34.8621352Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/common.py' 2025-05-07T20:10:34.8623304Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_dense_jagged_cat_jagged_out.py' 2025-05-07T20:10:34.8625525Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged2_to_padded_dense.py' 2025-05-07T20:10:34.8628933Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_bmm.py' 2025-05-07T20:10:34.8632867Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_bmm_jagged_out.py' 2025-05-07T20:10:34.8634787Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_add.py' 2025-05-07T20:10:34.8637003Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_mul_jagged_out.py' 2025-05-07T20:10:34.8642277Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_dense_flash_attention.py' 2025-05-07T20:10:34.8647463Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_flash_attention_basic.py' 2025-05-07T20:10:34.8649612Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_self_substraction_jagged_out.py' 2025-05-07T20:10:34.8653117Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_softmax.py' 2025-05-07T20:10:34.8658512Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_multi_head_jagged_flash_attention.py' 2025-05-07T20:10:34.8661059Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/__init__.py' 2025-05-07T20:10:34.8663892Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adagrad.py' 2025-05-07T20:10:34.8667402Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adam.py' 2025-05-07T20:10:34.8669364Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args.py' 2025-05-07T20:10:34.8671403Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args_ssd.py' 2025-05-07T20:10:34.8674443Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lamb.py' 2025-05-07T20:10:34.8677429Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lars_sgd.py' 2025-05-07T20:10:34.8680214Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_none.py' 2025-05-07T20:10:34.8683248Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_adam.py' 2025-05-07T20:10:34.8686189Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_lamb.py' 2025-05-07T20:10:34.8689150Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad.py' 2025-05-07T20:10:34.8692176Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_ssd.py' 2025-05-07T20:10:34.8695466Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_with_counter.py' 2025-05-07T20:10:34.8698291Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_sgd.py' 2025-05-07T20:10:34.8700102Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_optimizer_codegen/optimizer_args.py' 2025-05-07T20:10:34.8702750Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_optimizer_codegen/split_embedding_optimizer_rowwise_adagrad.py' 2025-05-07T20:10:34.8703766Z INFO:wheel:adding 'fbgemm_gpu/tbe/__init__.py' 2025-05-07T20:10:34.8705878Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/__init__.py' 2025-05-07T20:10:34.8707774Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/bench_config.py' 2025-05-07T20:10:34.8712618Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/bench_runs.py' 2025-05-07T20:10:34.8714916Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/eeg_cli.py' 2025-05-07T20:10:34.8717104Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/embedding_ops_common_config.py' 2025-05-07T20:10:34.8718821Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/eval_compression.py' 2025-05-07T20:10:34.8720245Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/reporter.py' 2025-05-07T20:10:34.8723321Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/tbe_data_config.py' 2025-05-07T20:10:34.8725943Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/tbe_data_config_loader.py' 2025-05-07T20:10:34.8728227Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/tbe_data_config_param_models.py' 2025-05-07T20:10:34.8729717Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/utils.py' 2025-05-07T20:10:34.8731257Z INFO:wheel:adding 'fbgemm_gpu/tbe/cache/__init__.py' 2025-05-07T20:10:34.8732808Z INFO:wheel:adding 'fbgemm_gpu/tbe/cache/split_embeddings_cache_ops.py' 2025-05-07T20:10:34.8734039Z INFO:wheel:adding 'fbgemm_gpu/tbe/ssd/__init__.py' 2025-05-07T20:10:34.8735278Z INFO:wheel:adding 'fbgemm_gpu/tbe/ssd/common.py' 2025-05-07T20:10:34.8741230Z INFO:wheel:adding 'fbgemm_gpu/tbe/ssd/inference.py' 2025-05-07T20:10:34.8765465Z INFO:wheel:adding 'fbgemm_gpu/tbe/ssd/training.py' 2025-05-07T20:10:34.8769127Z INFO:wheel:adding 'fbgemm_gpu/tbe/ssd/utils/__init__.py' 2025-05-07T20:10:34.8772297Z INFO:wheel:adding 'fbgemm_gpu/tbe/ssd/utils/partially_materialized_tensor.py' 2025-05-07T20:10:34.8773758Z INFO:wheel:adding 'fbgemm_gpu/tbe/stats/__init__.py' 2025-05-07T20:10:34.8776427Z INFO:wheel:adding 'fbgemm_gpu/tbe/stats/bench_params_reporter.py' 2025-05-07T20:10:34.8778220Z INFO:wheel:adding 'fbgemm_gpu/tbe/utils/__init__.py' 2025-05-07T20:10:34.8779454Z INFO:wheel:adding 'fbgemm_gpu/tbe/utils/common.py' 2025-05-07T20:10:34.8781076Z INFO:wheel:adding 'fbgemm_gpu/tbe/utils/offsets.py' 2025-05-07T20:10:34.8783429Z INFO:wheel:adding 'fbgemm_gpu/tbe/utils/quantize.py' 2025-05-07T20:10:34.8788807Z INFO:wheel:adding 'fbgemm_gpu/tbe/utils/requests.py' 2025-05-07T20:10:34.8790813Z INFO:wheel:adding 'fbgemm_gpu/triton/__init__.py' 2025-05-07T20:10:34.8792456Z INFO:wheel:adding 'fbgemm_gpu/triton/common.py' 2025-05-07T20:10:34.8799842Z INFO:wheel:adding 'fbgemm_gpu/triton/quantize.py' 2025-05-07T20:10:34.8804288Z INFO:wheel:adding 'fbgemm_gpu/triton/quantize_ref.py' 2025-05-07T20:10:34.8806473Z INFO:wheel:adding 'fbgemm_gpu/triton/jagged/__init__.py' 2025-05-07T20:10:34.8814543Z INFO:wheel:adding 'fbgemm_gpu/triton/jagged/triton_jagged_tensor_ops.py' 2025-05-07T20:10:34.8816613Z INFO:wheel:adding 'fbgemm_gpu/utils/__init__.py' 2025-05-07T20:10:34.8818873Z INFO:wheel:adding 'fbgemm_gpu/utils/filestore.py' 2025-05-07T20:10:34.8820345Z INFO:wheel:adding 'fbgemm_gpu/utils/loader.py' 2025-05-07T20:10:34.8850004Z INFO:wheel:adding 'fbgemm_gpu/utils/torch_library.py' 2025-05-07T20:10:34.8853718Z INFO:wheel:adding 'fbgemm_gpu_nightly-2025.5.7.dist-info/METADATA' 2025-05-07T20:10:34.8854500Z INFO:wheel:adding 'fbgemm_gpu_nightly-2025.5.7.dist-info/WHEEL' 2025-05-07T20:10:34.8855220Z INFO:wheel:adding 'fbgemm_gpu_nightly-2025.5.7.dist-info/top_level.txt' 2025-05-07T20:10:34.8858092Z INFO:wheel:adding 'fbgemm_gpu_nightly-2025.5.7.dist-info/RECORD' 2025-05-07T20:10:34.8860689Z INFO:root:removing _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel 2025-05-07T20:10:35.0376460Z ╒════════════════════════════╤════════════════════════════════════════════════╕ 2025-05-07T20:10:35.0377027Z │ │ Version │ 2025-05-07T20:10:35.0377806Z ╞════════════════════════════╪════════════════════════════════════════════════╡ 2025-05-07T20:10:35.0378365Z │ PyTorch │ 2.8.0.dev20250507+cu126 │ 2025-05-07T20:10:35.0378927Z ├────────────────────────────┼────────────────────────────────────────────────┤ 2025-05-07T20:10:35.0379503Z │ CUDA (Declared by PyTorch) │ 12.6 │ 2025-05-07T20:10:35.0380093Z ├────────────────────────────┼────────────────────────────────────────────────┤ 2025-05-07T20:10:35.0380719Z │ CUDA (Actual) │ nvcc: NVIDIA (R) Cuda compiler driver │ 2025-05-07T20:10:35.0381295Z │ │ Copyright (c) 2005-2024 NVIDIA Corporation │ 2025-05-07T20:10:35.0381793Z │ │ Built on Tue_Oct_29_23:50:19_PDT_2024 │ 2025-05-07T20:10:35.0382314Z │ │ Cuda compilation tools, release 12.6, V12.6.85 │ 2025-05-07T20:10:35.0382873Z │ │ Build cuda_12.6.r12.6/compiler.35059454_0 │ 2025-05-07T20:10:35.0383443Z ╘════════════════════════════╧════════════════════════════════════════════════╛ 2025-05-07T20:10:35.2967515Z Successfully built fbgemm_gpu_nightly-2025.5.7-cp313-cp313-manylinux_2_28_x86_64.whl 2025-05-07T20:10:35.3719355Z 2025-05-07T20:10:35.3872395Z ################################################################################ 2025-05-07T20:10:35.3873007Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.13/cmake-build/asmjit.so 2025-05-07T20:10:35.3873468Z [CHECK] Listing out library size: 2025-05-07T20:10:35.3873923Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.13/cmake-build/asmjit.so 2025-05-07T20:10:35.3874242Z 2025-05-07T20:10:35.3892190Z 1 ./_skbuild/linux-x86_64-3.13/cmake-build/asmjit.so 2025-05-07T20:10:35.3893393Z 2025-05-07T20:10:35.3894857Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/asmjit.so 2025-05-07T20:10:35.3897783Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/asmjit.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:10:35.3899332Z 2025-05-07T20:10:35.3971647Z GLIBC_2.2.5 2025-05-07T20:10:35.3972190Z GLIBC_2.14 2025-05-07T20:10:35.3972462Z 2025-05-07T20:10:35.3972471Z 2025-05-07T20:10:35.3973098Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/asmjit.so 2025-05-07T20:10:35.3974016Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/asmjit.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:10:35.3974602Z 2025-05-07T20:10:35.4053684Z GLIBCXX_3.4 2025-05-07T20:10:35.4057681Z 2025-05-07T20:10:35.4057725Z 2025-05-07T20:10:35.4075109Z + nm -gDC ./_skbuild/linux-x86_64-3.13/cmake-build/asmjit.so > /tmp/tmp.EJhZfCCt5z.symbols.txt 2025-05-07T20:10:35.4075655Z 2025-05-07T20:10:35.4103778Z 2025-05-07T20:10:35.4142975Z [CHECK] Total Number of symbols: 841 2025-05-07T20:10:35.4161657Z [CHECK] Number of fbgemm symbols: 0 2025-05-07T20:10:35.4180083Z + nm -gDCu ./_skbuild/linux-x86_64-3.13/cmake-build/asmjit.so > /tmp/tmp.DPR5bPXwxD.usymbols.txt 2025-05-07T20:10:35.4181630Z 2025-05-07T20:10:35.4205248Z 2025-05-07T20:10:35.4236618Z [CHECK] Listing out undefined symbols (51 total): 2025-05-07T20:10:35.4257453Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:10:35.4258256Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:10:35.4258646Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:10:35.4259219Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:10:35.4259812Z U __errno_location@GLIBC_2.2.5 2025-05-07T20:10:35.4260243Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:10:35.4260550Z U abort@GLIBC_2.2.5 2025-05-07T20:10:35.4260851Z U bcmp@GLIBC_2.2.5 2025-05-07T20:10:35.4261145Z U close@GLIBC_2.2.5 2025-05-07T20:10:35.4261423Z U fputs@GLIBC_2.2.5 2025-05-07T20:10:35.4261718Z U free@GLIBC_2.2.5 2025-05-07T20:10:35.4262000Z U ftruncate64@GLIBC_2.2.5 2025-05-07T20:10:35.4262308Z U fwrite@GLIBC_2.2.5 2025-05-07T20:10:35.4262692Z U getenv@GLIBC_2.2.5 2025-05-07T20:10:35.4262984Z U getpagesize@GLIBC_2.2.5 2025-05-07T20:10:35.4263276Z U madvise@GLIBC_2.2.5 2025-05-07T20:10:35.4263675Z U malloc@GLIBC_2.2.5 2025-05-07T20:10:35.4263934Z U memcmp@GLIBC_2.2.5 2025-05-07T20:10:35.4264297Z U memcpy@GLIBC_2.14 2025-05-07T20:10:35.4264577Z U memmove@GLIBC_2.2.5 2025-05-07T20:10:35.4264842Z U memset@GLIBC_2.2.5 2025-05-07T20:10:35.4265116Z U mmap@GLIBC_2.2.5 2025-05-07T20:10:35.4265377Z U mprotect@GLIBC_2.2.5 2025-05-07T20:10:35.4265661Z U munmap@GLIBC_2.2.5 2025-05-07T20:10:35.4265920Z U open64@GLIBC_2.2.5 2025-05-07T20:10:35.4266277Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:10:35.4266590Z U pthread_mutex_destroy@GLIBC_2.2.5 2025-05-07T20:10:35.4266914Z U pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:10:35.4267230Z U pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:10:35.4267539Z U read@GLIBC_2.2.5 2025-05-07T20:10:35.4267849Z U realloc@GLIBC_2.2.5 2025-05-07T20:10:35.4268133Z U shm_open@GLIBC_2.2.5 2025-05-07T20:10:35.4268414Z U shm_unlink@GLIBC_2.2.5 2025-05-07T20:10:35.4268688Z U snprintf@GLIBC_2.2.5 2025-05-07T20:10:35.4269000Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:10:35.4269280Z U stderr@GLIBC_2.2.5 2025-05-07T20:10:35.4269552Z U strcmp@GLIBC_2.2.5 2025-05-07T20:10:35.4269821Z U strlen@GLIBC_2.2.5 2025-05-07T20:10:35.4270079Z U strtol@GLIBC_2.2.5 2025-05-07T20:10:35.4270703Z U syscall@GLIBC_2.2.5 2025-05-07T20:10:35.4271026Z U sysconf@GLIBC_2.2.5 2025-05-07T20:10:35.4271319Z U uname@GLIBC_2.2.5 2025-05-07T20:10:35.4271607Z U unlink@GLIBC_2.2.5 2025-05-07T20:10:35.4271904Z U vsnprintf@GLIBC_2.2.5 2025-05-07T20:10:35.4272254Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:10:35.4272688Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:10:35.4273128Z U vtable for __cxxabiv1::__vmi_class_type_info@CXXABI_1.3 2025-05-07T20:10:35.4273503Z w _ITM_deregisterTMCloneTable 2025-05-07T20:10:35.4273837Z w _ITM_registerTMCloneTable 2025-05-07T20:10:35.4274143Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:10:35.4274448Z w __gmon_start__ 2025-05-07T20:10:35.4274770Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:10:35.4275245Z + ldd ./_skbuild/linux-x86_64-3.13/cmake-build/asmjit.so 2025-05-07T20:10:35.4275508Z 2025-05-07T20:10:35.4312912Z linux-vdso.so.1 (0x00007fff465fc000) 2025-05-07T20:10:35.4314340Z libtorch.so => not found 2025-05-07T20:10:35.4315473Z libc10.so => not found 2025-05-07T20:10:35.4315911Z libnvrtc.so.12 => not found 2025-05-07T20:10:35.4316217Z libc10_cuda.so => not found 2025-05-07T20:10:35.4316628Z libnccl.so.2 => not found 2025-05-07T20:10:35.4316935Z libcuda.so.1 => not found 2025-05-07T20:10:35.4317219Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:35.4317551Z libtorch_cpu.so => not found 2025-05-07T20:10:35.4317975Z libtorch_cuda.so => not found 2025-05-07T20:10:35.4318426Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f4e1555c000) 2025-05-07T20:10:35.4318854Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f4e15506000) 2025-05-07T20:10:35.4319231Z librt.so.1 => /lib64/librt.so.1 (0x00007f4e154ff000) 2025-05-07T20:10:35.4319629Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f4e154d1000) 2025-05-07T20:10:35.4320050Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007f4e154cc000) 2025-05-07T20:10:35.4320471Z libc.so.6 => /lib64/libc.so.6 (0x00007f4e152c4000) 2025-05-07T20:10:35.4320848Z libm.so.6 => /lib64/libm.so.6 (0x00007f4e151e9000) 2025-05-07T20:10:35.4321198Z /lib64/ld-linux-x86-64.so.2 (0x00007f4e1583d000) 2025-05-07T20:10:35.4321436Z 2025-05-07T20:10:35.4321577Z [CHECK] Displaying ELF information: 2025-05-07T20:10:35.4321943Z + readelf -d ./_skbuild/linux-x86_64-3.13/cmake-build/asmjit.so 2025-05-07T20:10:35.4322236Z 2025-05-07T20:10:35.4354793Z 2025-05-07T20:10:35.4355938Z Dynamic section at offset 0x75898 contains 39 entries: 2025-05-07T20:10:35.4356844Z Tag Type Name/Value 2025-05-07T20:10:35.4357503Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:10:35.4358049Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:10:35.4358567Z 0x0000000000000001 (NEEDED) Shared library: [libnvrtc.so.12] 2025-05-07T20:10:35.4359189Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:10:35.4359749Z 0x0000000000000001 (NEEDED) Shared library: [libnccl.so.2] 2025-05-07T20:10:35.4360267Z 0x0000000000000001 (NEEDED) Shared library: [libcuda.so.1] 2025-05-07T20:10:35.4360822Z 0x0000000000000001 (NEEDED) Shared library: [libnvidia-ml.so.1] 2025-05-07T20:10:35.4361352Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:10:35.4361906Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:10:35.4362430Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:10:35.4362975Z 0x0000000000000001 (NEEDED) Shared library: [libgomp.so.1] 2025-05-07T20:10:35.4363501Z 0x0000000000000001 (NEEDED) Shared library: [librt.so.1] 2025-05-07T20:10:35.4364005Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:10:35.4364551Z 0x0000000000000001 (NEEDED) Shared library: [libpthread.so.0] 2025-05-07T20:10:35.4365068Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:10:35.4365585Z 0x000000000000000e (SONAME) Library soname: [asmjit.so] 2025-05-07T20:10:35.4366009Z 0x000000000000000c (INIT) 0x19000 2025-05-07T20:10:35.4366380Z 0x000000000000000d (FINI) 0x56a1c 2025-05-07T20:10:35.4366749Z 0x0000000000000019 (INIT_ARRAY) 0x74ac0 2025-05-07T20:10:35.4367101Z 0x000000000000001b (INIT_ARRAYSZ) 8 (bytes) 2025-05-07T20:10:35.4367477Z 0x000000000000001a (FINI_ARRAY) 0x74ac8 2025-05-07T20:10:35.4367830Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:10:35.4368317Z 0x000000006ffffef5 (GNU_HASH) 0x200 2025-05-07T20:10:35.4368648Z 0x0000000000000005 (STRTAB) 0x6980 2025-05-07T20:10:35.4369001Z 0x0000000000000006 (SYMTAB) 0x1a90 2025-05-07T20:10:35.4369562Z 0x000000000000000a (STRSZ) 48829 (bytes) 2025-05-07T20:10:35.4370305Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:10:35.4370933Z 0x0000000000000003 (PLTGOT) 0x75fe8 2025-05-07T20:10:35.4371357Z 0x0000000000000002 (PLTRELSZ) 8472 (bytes) 2025-05-07T20:10:35.4371738Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:10:35.4372406Z 0x0000000000000017 (JMPREL) 0x162e0 2025-05-07T20:10:35.4372917Z 0x0000000000000007 (RELA) 0x12f98 2025-05-07T20:10:35.4373284Z 0x0000000000000008 (RELASZ) 13128 (bytes) 2025-05-07T20:10:35.4373665Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:10:35.4374018Z 0x000000006ffffffe (VERNEED) 0x12ed8 2025-05-07T20:10:35.4374345Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:10:35.4374682Z 0x000000006ffffff0 (VERSYM) 0x1283e 2025-05-07T20:10:35.4375022Z 0x000000006ffffff9 (RELACOUNT) 3 2025-05-07T20:10:35.4375326Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:10:35.4375555Z 2025-05-07T20:10:35.4375669Z ################################################################################ 2025-05-07T20:10:35.4375894Z 2025-05-07T20:10:35.4375898Z 2025-05-07T20:10:35.4376012Z ################################################################################ 2025-05-07T20:10:35.4376593Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:10:35.4377067Z [CHECK] Listing out library size: 2025-05-07T20:10:35.4377554Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:10:35.4377913Z 2025-05-07T20:10:35.4378092Z 1 ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:10:35.4378374Z 2025-05-07T20:10:35.4378734Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:10:35.4379708Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_config.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:10:35.4380263Z 2025-05-07T20:10:35.4423641Z GLIBC_2.2.5 2025-05-07T20:10:35.4424690Z GLIBC_2.14 2025-05-07T20:10:35.4425416Z 2025-05-07T20:10:35.4425647Z 2025-05-07T20:10:35.4426483Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:10:35.4427491Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_config.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:10:35.4428070Z 2025-05-07T20:10:35.4478719Z GLIBCXX_3.4 2025-05-07T20:10:35.4479158Z GLIBCXX_3.4.9 2025-05-07T20:10:35.4479482Z GLIBCXX_3.4.21 2025-05-07T20:10:35.4480344Z 2025-05-07T20:10:35.4481056Z 2025-05-07T20:10:35.4503716Z + nm -gDC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_config.so > /tmp/tmp.v4kV4lnQmv.symbols.txt 2025-05-07T20:10:35.4505147Z 2025-05-07T20:10:35.4523916Z 2025-05-07T20:10:35.4554533Z [CHECK] Total Number of symbols: 116 2025-05-07T20:10:35.4569721Z [CHECK] Number of fbgemm symbols: 4 2025-05-07T20:10:35.4585251Z + nm -gDCu ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_config.so > /tmp/tmp.oK0SilOOrs.usymbols.txt 2025-05-07T20:10:35.4585764Z 2025-05-07T20:10:35.4604284Z 2025-05-07T20:10:35.4633227Z [CHECK] Listing out undefined symbols (55 total): 2025-05-07T20:10:35.4655279Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:35.4655971Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:10:35.4656289Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:10:35.4656698Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:10:35.4657022Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:10:35.4657332Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:10:35.4657866Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:10:35.4658194Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:10:35.4658497Z U __errno_location@GLIBC_2.2.5 2025-05-07T20:10:35.4658824Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:10:35.4659141Z U c10::BoolType::get() 2025-05-07T20:10:35.4659446Z U c10::StringType::get() 2025-05-07T20:10:35.4659817Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:10:35.4660567Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:10:35.4661767Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:35.4662533Z U getenv@GLIBC_2.2.5 2025-05-07T20:10:35.4662830Z U memcmp@GLIBC_2.2.5 2025-05-07T20:10:35.4663109Z U memcpy@GLIBC_2.14 2025-05-07T20:10:35.4663396Z U memmove@GLIBC_2.2.5 2025-05-07T20:10:35.4663674Z U memset@GLIBC_2.2.5 2025-05-07T20:10:35.4663987Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:10:35.4664342Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:10:35.4664743Z U std::_Rb_tree_decrement(std::_Rb_tree_node_base*)@GLIBCXX_3.4 2025-05-07T20:10:35.4665448Z U std::_Rb_tree_insert_and_rebalance(bool, std::_Rb_tree_node_base*, std::_Rb_tree_node_base*, std::_Rb_tree_node_base&)@GLIBCXX_3.4 2025-05-07T20:10:35.4666250Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:10:35.4667074Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:10:35.4667727Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:10:35.4668098Z U std::__throw_invalid_argument(char const*)@GLIBCXX_3.4 2025-05-07T20:10:35.4668512Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:35.4668909Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:35.4669294Z U std::__throw_out_of_range(char const*)@GLIBCXX_3.4 2025-05-07T20:10:35.4669786Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:10:35.4670877Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:10:35.4671673Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:10:35.4672031Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:10:35.4672380Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:10:35.4672728Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:10:35.4673048Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:10:35.4673363Z U strlen@GLIBC_2.2.5 2025-05-07T20:10:35.4673642Z U strtol@GLIBC_2.2.5 2025-05-07T20:10:35.4673961Z U torch::CppFunction::~CppFunction() 2025-05-07T20:10:35.4674776Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:10:35.4675943Z U torch::Library::_def(std::variant&&, torch::CppFunction&&, std::vector > const&) & 2025-05-07T20:10:35.4676929Z U torch::jit::parseSchemaOrName(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:10:35.4677634Z U typeinfo for std::invalid_argument@GLIBCXX_3.4 2025-05-07T20:10:35.4678031Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:10:35.4678462Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:10:35.4678885Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:10:35.4679524Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:35.4680187Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:10:35.4680625Z w _ITM_deregisterTMCloneTable 2025-05-07T20:10:35.4680954Z w _ITM_registerTMCloneTable 2025-05-07T20:10:35.4681258Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:10:35.4681561Z w __gmon_start__ 2025-05-07T20:10:35.4681848Z w __pthread_key_create@GLIBC_2.2.5 2025-05-07T20:10:35.4682232Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:10:35.4682681Z + ldd ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:10:35.4683070Z 2025-05-07T20:10:35.4701394Z linux-vdso.so.1 (0x00007ffecb1ef000) 2025-05-07T20:10:35.4702452Z libtorch.so => not found 2025-05-07T20:10:35.4703148Z libc10.so => not found 2025-05-07T20:10:35.4703850Z libnvrtc.so.12 => not found 2025-05-07T20:10:35.4704600Z libc10_cuda.so => not found 2025-05-07T20:10:35.4705475Z libnccl.so.2 => not found 2025-05-07T20:10:35.4706198Z libcuda.so.1 => not found 2025-05-07T20:10:35.4707010Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:35.4707296Z libtorch_cpu.so => not found 2025-05-07T20:10:35.4707559Z libtorch_cuda.so => not found 2025-05-07T20:10:35.4707901Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007fb74ea15000) 2025-05-07T20:10:35.4708310Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007fb74e9bf000) 2025-05-07T20:10:35.4708759Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007fb74e98f000) 2025-05-07T20:10:35.4709198Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007fb74e98a000) 2025-05-07T20:10:35.4709599Z libc.so.6 => /lib64/libc.so.6 (0x00007fb74e782000) 2025-05-07T20:10:35.4709973Z libm.so.6 => /lib64/libm.so.6 (0x00007fb74e6a7000) 2025-05-07T20:10:35.4710330Z /lib64/ld-linux-x86-64.so.2 (0x00007fb74ec8a000) 2025-05-07T20:10:35.4710577Z 2025-05-07T20:10:35.4710685Z [CHECK] Displaying ELF information: 2025-05-07T20:10:35.4711090Z + readelf -d ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:10:35.4711421Z 2025-05-07T20:10:35.4738461Z 2025-05-07T20:10:35.4739552Z Dynamic section at offset 0x8c98 contains 38 entries: 2025-05-07T20:10:35.4740721Z Tag Type Name/Value 2025-05-07T20:10:35.4741994Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:10:35.4743445Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:10:35.4744903Z 0x0000000000000001 (NEEDED) Shared library: [libnvrtc.so.12] 2025-05-07T20:10:35.4746360Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:10:35.4747815Z 0x0000000000000001 (NEEDED) Shared library: [libnccl.so.2] 2025-05-07T20:10:35.4748882Z 0x0000000000000001 (NEEDED) Shared library: [libcuda.so.1] 2025-05-07T20:10:35.4749362Z 0x0000000000000001 (NEEDED) Shared library: [libnvidia-ml.so.1] 2025-05-07T20:10:35.4749851Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:10:35.4750317Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:10:35.4750793Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:10:35.4751249Z 0x0000000000000001 (NEEDED) Shared library: [libgomp.so.1] 2025-05-07T20:10:35.4751810Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:10:35.4752439Z 0x0000000000000001 (NEEDED) Shared library: [libpthread.so.0] 2025-05-07T20:10:35.4753095Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:10:35.4753601Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_config.so] 2025-05-07T20:10:35.4754064Z 0x000000000000000c (INIT) 0x4000 2025-05-07T20:10:35.4754396Z 0x000000000000000d (FINI) 0x6f80 2025-05-07T20:10:35.4754719Z 0x0000000000000019 (INIT_ARRAY) 0x9bb0 2025-05-07T20:10:35.4755070Z 0x000000000000001b (INIT_ARRAYSZ) 16 (bytes) 2025-05-07T20:10:35.4755406Z 0x000000000000001a (FINI_ARRAY) 0x9bc0 2025-05-07T20:10:35.4755747Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:10:35.4756095Z 0x000000006ffffef5 (GNU_HASH) 0x200 2025-05-07T20:10:35.4756408Z 0x0000000000000005 (STRTAB) 0xed0 2025-05-07T20:10:35.4756733Z 0x0000000000000006 (SYMTAB) 0x3d8 2025-05-07T20:10:35.4757066Z 0x000000000000000a (STRSZ) 7795 (bytes) 2025-05-07T20:10:35.4757426Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:10:35.4757757Z 0x0000000000000003 (PLTGOT) 0x9fe8 2025-05-07T20:10:35.4758112Z 0x0000000000000002 (PLTRELSZ) 1632 (bytes) 2025-05-07T20:10:35.4758447Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:10:35.4758770Z 0x0000000000000017 (JMPREL) 0x33a0 2025-05-07T20:10:35.4759128Z 0x0000000000000007 (RELA) 0x2ef0 2025-05-07T20:10:35.4759464Z 0x0000000000000008 (RELASZ) 1200 (bytes) 2025-05-07T20:10:35.4759822Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:10:35.4760156Z 0x000000006ffffffe (VERNEED) 0x2e30 2025-05-07T20:10:35.4760489Z 0x000000006fffffff (VERNEEDNUM) 4 2025-05-07T20:10:35.4760808Z 0x000000006ffffff0 (VERSYM) 0x2d44 2025-05-07T20:10:35.4761144Z 0x000000006ffffff9 (RELACOUNT) 4 2025-05-07T20:10:35.4761476Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:10:35.4761688Z 2025-05-07T20:10:35.4761805Z ################################################################################ 2025-05-07T20:10:35.4762028Z 2025-05-07T20:10:35.4762032Z 2025-05-07T20:10:35.4762163Z ################################################################################ 2025-05-07T20:10:35.4762589Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm.so 2025-05-07T20:10:35.4763010Z [CHECK] Listing out library size: 2025-05-07T20:10:35.4763393Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm.so 2025-05-07T20:10:35.4763710Z 2025-05-07T20:10:35.4763856Z 6 ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm.so 2025-05-07T20:10:35.4764090Z 2025-05-07T20:10:35.4764413Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm.so 2025-05-07T20:10:35.4765342Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:10:35.4765862Z 2025-05-07T20:10:35.5027947Z GLIBC_2.2.5 2025-05-07T20:10:35.5028731Z GLIBC_2.3 2025-05-07T20:10:35.5029287Z GLIBC_2.14 2025-05-07T20:10:35.5029611Z 2025-05-07T20:10:35.5029625Z 2025-05-07T20:10:35.5030625Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm.so 2025-05-07T20:10:35.5033225Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:10:35.5034825Z 2025-05-07T20:10:35.5299848Z GLIBCXX_3.4 2025-05-07T20:10:35.5300498Z GLIBCXX_3.4.9 2025-05-07T20:10:35.5301080Z GLIBCXX_3.4.11 2025-05-07T20:10:35.5301671Z GLIBCXX_3.4.14 2025-05-07T20:10:35.5302227Z GLIBCXX_3.4.15 2025-05-07T20:10:35.5302797Z GLIBCXX_3.4.18 2025-05-07T20:10:35.5303345Z GLIBCXX_3.4.21 2025-05-07T20:10:35.5303715Z 2025-05-07T20:10:35.5304010Z 2025-05-07T20:10:35.5322463Z + nm -gDC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm.so > /tmp/tmp.ejBdnlesjb.symbols.txt 2025-05-07T20:10:35.5323687Z 2025-05-07T20:10:35.5549524Z 2025-05-07T20:10:35.5576280Z [CHECK] Total Number of symbols: 4951 2025-05-07T20:10:35.5597354Z [CHECK] Number of fbgemm symbols: 3554 2025-05-07T20:10:35.5620297Z + nm -gDCu ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm.so > /tmp/tmp.Gf6XL2Uk1F.usymbols.txt 2025-05-07T20:10:35.5621897Z 2025-05-07T20:10:35.5652948Z 2025-05-07T20:10:35.5681548Z [CHECK] Listing out undefined symbols (133 total): 2025-05-07T20:10:35.5700136Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:10:35.5701290Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:10:35.5702292Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:10:35.5703227Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:10:35.5704140Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:10:35.5705081Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:10:35.5706027Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:10:35.5706517Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:10:35.5706826Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:10:35.5707192Z U __cxa_init_primary_exception@CXXABI_1.3.11 2025-05-07T20:10:35.5707527Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:10:35.5707862Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:10:35.5708187Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:10:35.5708602Z U __extendhfsf2 2025-05-07T20:10:35.5708935Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:10:35.5709256Z U __once_proxy@GLIBCXX_3.4.11 2025-05-07T20:10:35.5709588Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:10:35.5709873Z U __truncsfhf2 2025-05-07T20:10:35.5710158Z U abort@GLIBC_2.2.5 2025-05-07T20:10:35.5710684Z U asmjit::_abi_1_13::BaseEmitter::_emitI(unsigned int, asmjit::_abi_1_13::Operand_ const&) 2025-05-07T20:10:35.5711419Z U asmjit::_abi_1_13::BaseEmitter::_emitI(unsigned int, asmjit::_abi_1_13::Operand_ const&, asmjit::_abi_1_13::Operand_ const&) 2025-05-07T20:10:35.5712348Z U asmjit::_abi_1_13::BaseEmitter::_emitI(unsigned int, asmjit::_abi_1_13::Operand_ const&, asmjit::_abi_1_13::Operand_ const&, asmjit::_abi_1_13::Operand_ const&) 2025-05-07T20:10:35.5713477Z U asmjit::_abi_1_13::BaseEmitter::_emitI(unsigned int, asmjit::_abi_1_13::Operand_ const&, asmjit::_abi_1_13::Operand_ const&, asmjit::_abi_1_13::Operand_ const&, asmjit::_abi_1_13::Operand_ const&) 2025-05-07T20:10:35.5714547Z U asmjit::_abi_1_13::BaseEmitter::emitArgsAssignment(asmjit::_abi_1_13::FuncFrame const&, asmjit::_abi_1_13::FuncArgsAssignment const&) 2025-05-07T20:10:35.5715273Z U asmjit::_abi_1_13::BaseEmitter::emitEpilog(asmjit::_abi_1_13::FuncFrame const&) 2025-05-07T20:10:35.5715854Z U asmjit::_abi_1_13::BaseEmitter::emitProlog(asmjit::_abi_1_13::FuncFrame const&) 2025-05-07T20:10:35.5716457Z U asmjit::_abi_1_13::CodeHolder::CodeHolder(asmjit::_abi_1_13::Support::Temporary const*) 2025-05-07T20:10:35.5717058Z U asmjit::_abi_1_13::CodeHolder::init(asmjit::_abi_1_13::Environment const&, unsigned long) 2025-05-07T20:10:35.5717574Z U asmjit::_abi_1_13::CodeHolder::~CodeHolder() 2025-05-07T20:10:35.5718110Z U asmjit::_abi_1_13::FuncArgsAssignment::updateFuncFrame(asmjit::_abi_1_13::FuncFrame&) const 2025-05-07T20:10:35.5718816Z U asmjit::_abi_1_13::FuncDetail::init(asmjit::_abi_1_13::FuncSignature const&, asmjit::_abi_1_13::Environment const&) 2025-05-07T20:10:35.5719398Z U asmjit::_abi_1_13::FuncFrame::finalize() 2025-05-07T20:10:35.5719821Z U asmjit::_abi_1_13::FuncFrame::init(asmjit::_abi_1_13::FuncDetail const&) 2025-05-07T20:10:35.5720483Z U asmjit::_abi_1_13::JitRuntime::JitRuntime(asmjit::_abi_1_13::JitAllocator::CreateParams const*) 2025-05-07T20:10:35.5721028Z U asmjit::_abi_1_13::JitRuntime::~JitRuntime() 2025-05-07T20:10:35.5721471Z U asmjit::_abi_1_13::x86::Assembler::Assembler(asmjit::_abi_1_13::CodeHolder*) 2025-05-07T20:10:35.5721954Z U asmjit::_abi_1_13::x86::Assembler::~Assembler() 2025-05-07T20:10:35.5722322Z U bcmp@GLIBC_2.2.5 2025-05-07T20:10:35.5722632Z U ceilf@GLIBC_2.2.5 2025-05-07T20:10:35.5722920Z U cpuinfo_get_packages 2025-05-07T20:10:35.5723250Z U cpuinfo_get_packages_count 2025-05-07T20:10:35.5723578Z U cpuinfo_initialize 2025-05-07T20:10:35.5723862Z U cpuinfo_isa 2025-05-07T20:10:35.5724149Z U floor@GLIBC_2.2.5 2025-05-07T20:10:35.5724430Z U fma@GLIBC_2.2.5 2025-05-07T20:10:35.5724740Z U fmaf@GLIBC_2.2.5 2025-05-07T20:10:35.5725008Z U free@GLIBC_2.2.5 2025-05-07T20:10:35.5725299Z U fwrite@GLIBC_2.2.5 2025-05-07T20:10:35.5725580Z U getenv@GLIBC_2.2.5 2025-05-07T20:10:35.5725882Z U ldexp@GLIBC_2.2.5 2025-05-07T20:10:35.5726184Z U log2@GLIBC_2.2.5 2025-05-07T20:10:35.5726453Z U log2f@GLIBC_2.2.5 2025-05-07T20:10:35.5726745Z U lrintf@GLIBC_2.2.5 2025-05-07T20:10:35.5727018Z U memcpy@GLIBC_2.14 2025-05-07T20:10:35.5727344Z U memmove@GLIBC_2.2.5 2025-05-07T20:10:35.5727623Z U memset@GLIBC_2.2.5 2025-05-07T20:10:35.5727929Z U nearbyint@GLIBC_2.2.5 2025-05-07T20:10:35.5728221Z U nearbyintf@GLIBC_2.2.5 2025-05-07T20:10:35.5728556Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:10:35.5728890Z U operator delete[](void*)@GLIBCXX_3.4 2025-05-07T20:10:35.5729299Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:10:35.5729670Z U operator new[](unsigned long)@GLIBCXX_3.4 2025-05-07T20:10:35.5730004Z U posix_memalign@GLIBC_2.2.5 2025-05-07T20:10:35.5730330Z U sqrtf@GLIBC_2.2.5 2025-05-07T20:10:35.5730711Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:10:35.5731354Z U std::_Rb_tree_decrement(std::_Rb_tree_node_base*)@GLIBCXX_3.4 2025-05-07T20:10:35.5731820Z U std::_Rb_tree_increment(std::_Rb_tree_node_base*)@GLIBCXX_3.4 2025-05-07T20:10:35.5732438Z U std::_Rb_tree_insert_and_rebalance(bool, std::_Rb_tree_node_base*, std::_Rb_tree_node_base*, std::_Rb_tree_node_base&)@GLIBCXX_3.4 2025-05-07T20:10:35.5733151Z U std::__atomic_futex_unsigned_base::_M_futex_notify_all(unsigned int*)@GLIBCXX_3.4.21 2025-05-07T20:10:35.5734116Z U std::__atomic_futex_unsigned_base::_M_futex_wait_until(unsigned int*, unsigned int, bool, std::chrono::duration >, std::chrono::duration >)@GLIBCXX_3.4.21 2025-05-07T20:10:35.5735217Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:10:35.5735922Z U std::__detail::_Prime_rehash_policy::_M_next_bkt(unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:10:35.5736473Z U std::__exception_ptr::exception_ptr::_M_addref() 2025-05-07T20:10:35.5737070Z U std::__exception_ptr::exception_ptr::_M_release() 2025-05-07T20:10:35.5737663Z U std::__exception_ptr::exception_ptr::exception_ptr(void*)@CXXABI_1.3.11 2025-05-07T20:10:35.5738188Z U std::__future_base::_Result_base::_Result_base()@GLIBCXX_3.4.15 2025-05-07T20:10:35.5738705Z U std::__future_base::_Result_base::~_Result_base()@GLIBCXX_3.4.15 2025-05-07T20:10:35.5739169Z U std::__once_call@GLIBCXX_3.4.11 2025-05-07T20:10:35.5739544Z U std::__once_callable@GLIBCXX_3.4.11 2025-05-07T20:10:35.5739923Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:10:35.5740275Z U std::__throw_bad_array_new_length() 2025-05-07T20:10:35.5740650Z U std::__throw_bad_cast()@GLIBCXX_3.4 2025-05-07T20:10:35.5741045Z U std::__throw_bad_function_call()@GLIBCXX_3.4.14 2025-05-07T20:10:35.5741469Z U std::__throw_future_error(int)@GLIBCXX_3.4.14 2025-05-07T20:10:35.5741866Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:35.5742284Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:10:35.5742685Z U std::bad_alloc::~bad_alloc()@GLIBCXX_3.4 2025-05-07T20:10:35.5743501Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:10:35.5744306Z U std::cerr@GLIBCXX_3.4 2025-05-07T20:10:35.5744623Z U std::cout@GLIBCXX_3.4 2025-05-07T20:10:35.5745016Z U std::ctype::_M_widen_init() const@GLIBCXX_3.4.11 2025-05-07T20:10:35.5745437Z U std::future_category()@GLIBCXX_3.4.15 2025-05-07T20:10:35.5745818Z U std::future_error::~future_error()@GLIBCXX_3.4.14 2025-05-07T20:10:35.5746261Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:10:35.5746626Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:10:35.5747287Z U std::logic_error::logic_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:10:35.5748027Z U std::logic_error::logic_error(std::logic_error const&)@GLIBCXX_3.4.21 2025-05-07T20:10:35.5748579Z U std::ostream& std::ostream::_M_insert(double)@GLIBCXX_3.4.9 2025-05-07T20:10:35.5749201Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:10:35.5749737Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:10:35.5750206Z U std::ostream::flush()@GLIBCXX_3.4 2025-05-07T20:10:35.5750566Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:10:35.5750911Z U std::ostream::put(char)@GLIBCXX_3.4 2025-05-07T20:10:35.5751457Z U std::rethrow_exception(std::__exception_ptr::exception_ptr)@CXXABI_1.3.3 2025-05-07T20:10:35.5752176Z U std::runtime_error::runtime_error(char const*)@GLIBCXX_3.4.21 2025-05-07T20:10:35.5752631Z U std::runtime_error::~runtime_error()@GLIBCXX_3.4 2025-05-07T20:10:35.5753023Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:10:35.5753347Z U stderr@GLIBC_2.2.5 2025-05-07T20:10:35.5753673Z U strcmp@GLIBC_2.2.5 2025-05-07T20:10:35.5753971Z U strlen@GLIBC_2.2.5 2025-05-07T20:10:35.5754290Z U strstr@GLIBC_2.2.5 2025-05-07T20:10:35.5754582Z U tolower@GLIBC_2.2.5 2025-05-07T20:10:35.5754902Z U toupper@GLIBC_2.2.5 2025-05-07T20:10:35.5764968Z U typeinfo for std::__future_base::_Result_base@GLIBCXX_3.4.15 2025-05-07T20:10:35.5765498Z U typeinfo for std::bad_alloc@GLIBCXX_3.4 2025-05-07T20:10:35.5765936Z U typeinfo for std::future_error@GLIBCXX_3.4.14 2025-05-07T20:10:35.5766458Z U typeinfo for std::runtime_error@GLIBCXX_3.4 2025-05-07T20:10:35.5766842Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:10:35.5767279Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:10:35.5767660Z U vtable for std::bad_alloc@GLIBCXX_3.4 2025-05-07T20:10:35.5768127Z U vtable for std::future_error@GLIBCXX_3.4.14 2025-05-07T20:10:35.5768473Z w _ITM_deregisterTMCloneTable 2025-05-07T20:10:35.5768827Z w _ITM_registerTMCloneTable 2025-05-07T20:10:35.5769168Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:10:35.5769466Z w __gmon_start__ 2025-05-07T20:10:35.5769801Z w __pthread_key_create 2025-05-07T20:10:35.5770785Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:10:35.5771169Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:10:35.5771601Z w pthread_once 2025-05-07T20:10:35.5771926Z w pthread_rwlock_rdlock 2025-05-07T20:10:35.5772245Z w pthread_rwlock_unlock 2025-05-07T20:10:35.5772583Z w pthread_rwlock_wrlock 2025-05-07T20:10:35.5772934Z w pthread_self@GLIBC_2.2.5 2025-05-07T20:10:35.5773311Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:10:35.5773757Z + ldd ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm.so 2025-05-07T20:10:35.5774023Z 2025-05-07T20:10:35.5774163Z linux-vdso.so.1 (0x00007ffe91130000) 2025-05-07T20:10:35.5774501Z libc10.so => not found 2025-05-07T20:10:35.5774765Z libnvrtc.so.12 => not found 2025-05-07T20:10:35.5775071Z libc10_cuda.so => not found 2025-05-07T20:10:35.5775377Z libnccl.so.2 => not found 2025-05-07T20:10:35.5775645Z libcuda.so.1 => not found 2025-05-07T20:10:35.5776283Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/asmjit.so (0x00007f7f454bd000) 2025-05-07T20:10:35.5776961Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:35.5777273Z libtorch.so => not found 2025-05-07T20:10:35.5777548Z libtorch_cpu.so => not found 2025-05-07T20:10:35.5777863Z libtorch_cuda.so => not found 2025-05-07T20:10:35.5778210Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f7f44b9c000) 2025-05-07T20:10:35.5778651Z libm.so.6 => /lib64/libm.so.6 (0x00007f7f453e0000) 2025-05-07T20:10:35.5779121Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f7f453b2000) 2025-05-07T20:10:35.5779519Z libc.so.6 => /lib64/libc.so.6 (0x00007f7f44994000) 2025-05-07T20:10:35.5779920Z /lib64/ld-linux-x86-64.so.2 (0x00007f7f4553a000) 2025-05-07T20:10:35.5780263Z libtorch.so => not found 2025-05-07T20:10:35.5780559Z libc10.so => not found 2025-05-07T20:10:35.5780826Z libnvrtc.so.12 => not found 2025-05-07T20:10:35.5781138Z libc10_cuda.so => not found 2025-05-07T20:10:35.5781415Z libnccl.so.2 => not found 2025-05-07T20:10:35.5781709Z libcuda.so.1 => not found 2025-05-07T20:10:35.5781992Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:35.5782309Z libtorch_cpu.so => not found 2025-05-07T20:10:35.5782625Z libtorch_cuda.so => not found 2025-05-07T20:10:35.5782964Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f7f4493e000) 2025-05-07T20:10:35.5783390Z librt.so.1 => /lib64/librt.so.1 (0x00007f7f453a9000) 2025-05-07T20:10:35.5783815Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007f7f453a4000) 2025-05-07T20:10:35.5784130Z 2025-05-07T20:10:35.5784251Z [CHECK] Displaying ELF information: 2025-05-07T20:10:35.5784630Z + readelf -d ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm.so 2025-05-07T20:10:35.5784941Z 2025-05-07T20:10:35.5802985Z 2025-05-07T20:10:35.5803802Z Dynamic section at offset 0x54d6c8 contains 40 entries: 2025-05-07T20:10:35.5804470Z Tag Type Name/Value 2025-05-07T20:10:35.5804905Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:10:35.5805460Z 0x0000000000000001 (NEEDED) Shared library: [libnvrtc.so.12] 2025-05-07T20:10:35.5806017Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:10:35.5806541Z 0x0000000000000001 (NEEDED) Shared library: [libnccl.so.2] 2025-05-07T20:10:35.5807084Z 0x0000000000000001 (NEEDED) Shared library: [libcuda.so.1] 2025-05-07T20:10:35.5807594Z 0x0000000000000001 (NEEDED) Shared library: [asmjit.so] 2025-05-07T20:10:35.5808266Z 0x0000000000000001 (NEEDED) Shared library: [libnvidia-ml.so.1] 2025-05-07T20:10:35.5808791Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:10:35.5809334Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:10:35.5809886Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:10:35.5810460Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:10:35.5811010Z 0x0000000000000001 (NEEDED) Shared library: [libm.so.6] 2025-05-07T20:10:35.5811513Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:10:35.5812052Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:10:35.5812674Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:10:35.5813216Z 0x000000000000000e (SONAME) Library soname: [fbgemm.so] 2025-05-07T20:10:35.5813722Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:10:35.5814129Z 0x000000000000000c (INIT) 0xff000 2025-05-07T20:10:35.5814500Z 0x000000000000000d (FINI) 0x4c1c58 2025-05-07T20:10:35.5814849Z 0x0000000000000019 (INIT_ARRAY) 0x54a1c0 2025-05-07T20:10:35.5815235Z 0x000000000000001b (INIT_ARRAYSZ) 1224 (bytes) 2025-05-07T20:10:35.5815590Z 0x000000000000001a (FINI_ARRAY) 0x54a688 2025-05-07T20:10:35.5815988Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:10:35.5816461Z 0x000000006ffffef5 (GNU_HASH) 0x238 2025-05-07T20:10:35.5816966Z 0x0000000000000005 (STRTAB) 0x26de0 2025-05-07T20:10:35.5817333Z 0x0000000000000006 (SYMTAB) 0x9da0 2025-05-07T20:10:35.5817752Z 0x000000000000000a (STRSZ) 754246 (bytes) 2025-05-07T20:10:35.5818150Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:10:35.5818510Z 0x0000000000000003 (PLTGOT) 0x551fe8 2025-05-07T20:10:35.5818940Z 0x0000000000000002 (PLTRELSZ) 25992 (bytes) 2025-05-07T20:10:35.5819300Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:10:35.5819664Z 0x0000000000000017 (JMPREL) 0xf8458 2025-05-07T20:10:35.5820032Z 0x0000000000000007 (RELA) 0xe1838 2025-05-07T20:10:35.5820398Z 0x0000000000000008 (RELASZ) 93216 (bytes) 2025-05-07T20:10:35.5820802Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:10:35.5821168Z 0x000000006ffffffe (VERNEED) 0xe16d8 2025-05-07T20:10:35.5821543Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:10:35.5821886Z 0x000000006ffffff0 (VERSYM) 0xdf026 2025-05-07T20:10:35.5822252Z 0x000000006ffffff9 (RELACOUNT) 155 2025-05-07T20:10:35.5822560Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:10:35.5822776Z 2025-05-07T20:10:35.5822891Z ################################################################################ 2025-05-07T20:10:35.5823117Z 2025-05-07T20:10:35.5823123Z 2025-05-07T20:10:35.5823252Z ################################################################################ 2025-05-07T20:10:35.5823771Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:10:35.5824251Z [CHECK] Listing out library size: 2025-05-07T20:10:35.5824695Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:10:35.5825070Z 2025-05-07T20:10:35.5825275Z 3 ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:10:35.5825566Z 2025-05-07T20:10:35.5825954Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:10:35.5826887Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_common.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:10:35.5827502Z 2025-05-07T20:10:35.5880991Z GLIBC_2.2.5 2025-05-07T20:10:35.5881654Z GLIBC_2.14 2025-05-07T20:10:35.5882185Z 2025-05-07T20:10:35.5882354Z 2025-05-07T20:10:35.5882787Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:10:35.5883796Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_common.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:10:35.5884490Z 2025-05-07T20:10:35.5947639Z GLIBCXX_3.4 2025-05-07T20:10:35.5948332Z GLIBCXX_3.4.9 2025-05-07T20:10:35.5948953Z GLIBCXX_3.4.14 2025-05-07T20:10:35.5949565Z GLIBCXX_3.4.20 2025-05-07T20:10:35.5950139Z GLIBCXX_3.4.21 2025-05-07T20:10:35.5950526Z 2025-05-07T20:10:35.5950539Z 2025-05-07T20:10:35.5973878Z + nm -gDC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_common.so > /tmp/tmp.TJI0TbTo9I.symbols.txt 2025-05-07T20:10:35.5975279Z 2025-05-07T20:10:35.6006014Z 2025-05-07T20:10:35.6041102Z [CHECK] Total Number of symbols: 550 2025-05-07T20:10:35.6055722Z [CHECK] Number of fbgemm symbols: 48 2025-05-07T20:10:35.6072152Z + nm -gDCu ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_common.so > /tmp/tmp.PX8n0j8Sqm.usymbols.txt 2025-05-07T20:10:35.6072633Z 2025-05-07T20:10:35.6096457Z 2025-05-07T20:10:35.6124502Z [CHECK] Listing out undefined symbols (179 total): 2025-05-07T20:10:35.6143123Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:35.6145126Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:10:35.6146138Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:10:35.6146572Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:10:35.6146963Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:10:35.6147382Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:10:35.6147807Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:10:35.6148240Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:10:35.6148648Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:10:35.6149021Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:10:35.6149388Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:10:35.6149712Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:10:35.6150068Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:10:35.6150395Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:10:35.6150762Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:10:35.6151113Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:10:35.6151443Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:10:35.6151786Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:10:35.6152097Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:10:35.6152437Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:10:35.6152951Z U at::Tensor::index_put_(std::initializer_list, at::Tensor const&) 2025-05-07T20:10:35.6153551Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:10:35.6154030Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:10:35.6154946Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:35.6155856Z U at::_ops::is_nonzero::call(at::Tensor const&) 2025-05-07T20:10:35.6156321Z U at::_ops::reshape::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:10:35.6156809Z U at::_ops::select_int::call(at::Tensor const&, long, c10::SymInt) 2025-05-07T20:10:35.6157481Z U at::_ops::slice_Tensor::call(at::Tensor const&, long, std::optional, std::optional, c10::SymInt) 2025-05-07T20:10:35.6158616Z U at::_ops::to_dtype_layout::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, bool, bool, std::optional) 2025-05-07T20:10:35.6159623Z U at::_ops::zero_::call(at::Tensor&) 2025-05-07T20:10:35.6160524Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:35.6161272Z U at::cuda::getCurrentDeviceProperties() 2025-05-07T20:10:35.6161623Z U at::get_num_threads() 2025-05-07T20:10:35.6161937Z U at::get_thread_num() 2025-05-07T20:10:35.6162246Z U at::internal::set_thread_num(int) 2025-05-07T20:10:35.6162618Z U at::toAccumulateType(c10::ScalarType, bool) 2025-05-07T20:10:35.6162953Z U c10::BoolType::get() 2025-05-07T20:10:35.6163325Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:10:35.6164091Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:10:35.6164690Z U c10::Error::what() const 2025-05-07T20:10:35.6165064Z U c10::Half* at::TensorBase::data_ptr() const 2025-05-07T20:10:35.6165526Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:35.6165974Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:10:35.6166327Z U c10::IntType::get() 2025-05-07T20:10:35.6166716Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:10:35.6167143Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:10:35.6167678Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:10:35.6168180Z U c10::ParallelGuard::ParallelGuard(bool) 2025-05-07T20:10:35.6168546Z U c10::ParallelGuard::~ParallelGuard() 2025-05-07T20:10:35.6168958Z U c10::SymBool::expect_true(char const*, long) const 2025-05-07T20:10:35.6169394Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:10:35.6170046Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:10:35.6171057Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:10:35.6171509Z U c10::SymInt::operator/(c10::SymInt const&) const 2025-05-07T20:10:35.6171921Z U c10::SymInt::promote_to_negative() 2025-05-07T20:10:35.6172316Z U c10::SymInt::sym_ne(c10::SymInt const&) const 2025-05-07T20:10:35.6172691Z U c10::SymInt::toSymNode() const 2025-05-07T20:10:35.6173049Z U c10::SymIntType::get() 2025-05-07T20:10:35.6173414Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:10:35.6173806Z U c10::TensorType::get() 2025-05-07T20:10:35.6174148Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:10:35.6175109Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:10:35.6176084Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:10:35.6176587Z U c10::cuda::CUDAKernelLaunchRegistry::get_singleton_ref() 2025-05-07T20:10:35.6177197Z U c10::cuda::CUDAKernelLaunchRegistry::get_uvm_assertions_ptr_for_current_device() 2025-05-07T20:10:35.6178072Z U c10::cuda::CUDAKernelLaunchRegistry::insert(char const*, char const*, unsigned int, char const*, int) 2025-05-07T20:10:35.6178648Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:10:35.6179030Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:10:35.6179411Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:10:35.6179795Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:10:35.6180173Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:10:35.6180647Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:10:35.6181136Z U c10::cuda::device_count() 2025-05-07T20:10:35.6181492Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:10:35.6181908Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:10:35.6182326Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:10:35.6182729Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:10:35.6183168Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:10:35.6183560Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:10:35.6184321Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:10:35.6185251Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:10:35.6186107Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:35.6187067Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:10:35.6188141Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:35.6189135Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:10:35.6189480Z U c10::impl::GPUTrace::haveState 2025-05-07T20:10:35.6189975Z U c10::impl::device_guard_impl_registry 2025-05-07T20:10:35.6190369Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:10:35.6190756Z U c10::operator<(c10::SymInt const&, int) 2025-05-07T20:10:35.6191126Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:10:35.6191546Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:10:35.6191936Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:10:35.6192336Z U c10::operator>(c10::SymInt const&, int) 2025-05-07T20:10:35.6192720Z U c10::operator>=(c10::SymInt const&, int) 2025-05-07T20:10:35.6193067Z U c10::warn(c10::Warning const&) 2025-05-07T20:10:35.6193505Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:10:35.6193941Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:10:35.6194505Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:10:35.6194883Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:10:35.6195276Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:10:35.6195666Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:10:35.6196013Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:10:35.6196504Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:10:35.6196933Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:10:35.6197307Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:10:35.6197692Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:10:35.6198082Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:10:35.6198443Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:10:35.6198837Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:10:35.6199849Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:35.6201482Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:35.6203106Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:35.6204739Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:35.6208105Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:35.6209859Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:35.6211635Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:35.6213496Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:35.6215274Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:35.6217160Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:35.6218916Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:35.6220694Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:35.6221866Z U fbgemm::fbgemmAlignedAlloc(unsigned long, unsigned long, bool) 2025-05-07T20:10:35.6222304Z U fbgemm::fbgemmAlignedFree(void*) 2025-05-07T20:10:35.6222822Z U fbgemm_gpu::config::is_feature_enabled(fbgemm_gpu::config::FeatureGateName const&) 2025-05-07T20:10:35.6223365Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:10:35.6223788Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:35.6224256Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:10:35.6224649Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:35.6225108Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:10:35.6225538Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:10:35.6225994Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:35.6226388Z U memcpy@GLIBC_2.14 2025-05-07T20:10:35.6226690Z U memmove@GLIBC_2.2.5 2025-05-07T20:10:35.6227023Z U memset@GLIBC_2.2.5 2025-05-07T20:10:35.6227338Z U omp_get_max_threads@OMP_1.0 2025-05-07T20:10:35.6227693Z U omp_get_thread_num@OMP_1.0 2025-05-07T20:10:35.6228034Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:10:35.6228409Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:10:35.6229020Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:10:35.6229916Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:10:35.6230514Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:10:35.6230864Z U std::__throw_bad_function_call()@GLIBCXX_3.4.14 2025-05-07T20:10:35.6231281Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:35.6231686Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:35.6232088Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:10:35.6232609Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:10:35.6233736Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:10:35.6234556Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:10:35.6234941Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:10:35.6235485Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:10:35.6235922Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:10:35.6236276Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:10:35.6236712Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:10:35.6237281Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:10:35.6237765Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:10:35.6238343Z U std::pair fbgemm::radix_sort_parallel(int*, int*, int*, int*, long, long, bool) 2025-05-07T20:10:35.6239293Z U std::pair*> fbgemm::radix_sort_parallel >(int*, std::pair*, int*, std::pair*, long, long, bool) 2025-05-07T20:10:35.6240477Z U std::pair*> fbgemm::radix_sort_parallel >(int*, std::pair*, int*, std::pair*, long, long, bool) 2025-05-07T20:10:35.6241220Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:10:35.6241544Z U strlen@GLIBC_2.2.5 2025-05-07T20:10:35.6241893Z U torch::CppFunction::~CppFunction() 2025-05-07T20:10:35.6242734Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:10:35.6243904Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:10:35.6244746Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:10:35.6245494Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:10:35.6246108Z U typeinfo for c10::Error 2025-05-07T20:10:35.6246484Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:10:35.6246912Z U unsigned char* at::TensorBase::data_ptr() const 2025-05-07T20:10:35.6247393Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:10:35.6247848Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:10:35.6248286Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:10:35.6248688Z U vtable for c10::Error 2025-05-07T20:10:35.6249233Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:35.6249897Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:10:35.6250344Z w _ITM_deregisterTMCloneTable 2025-05-07T20:10:35.6250708Z w _ITM_registerTMCloneTable 2025-05-07T20:10:35.6251034Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:10:35.6251439Z w __gmon_start__ 2025-05-07T20:10:35.6251759Z w __pthread_key_create 2025-05-07T20:10:35.6252121Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:10:35.6252684Z + ldd ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:10:35.6253003Z 2025-05-07T20:10:35.6253181Z linux-vdso.so.1 (0x00007ffc2d911000) 2025-05-07T20:10:35.6253498Z libc10.so => not found 2025-05-07T20:10:35.6253857Z libnvrtc.so.12 => not found 2025-05-07T20:10:35.6254139Z libc10_cuda.so => not found 2025-05-07T20:10:35.6254442Z libnccl.so.2 => not found 2025-05-07T20:10:35.6254712Z libcuda.so.1 => not found 2025-05-07T20:10:35.6255265Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm.so (0x00007f0020a00000) 2025-05-07T20:10:35.6256205Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_config.so (0x00007f00214c7000) 2025-05-07T20:10:35.6256946Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:35.6257248Z libtorch.so => not found 2025-05-07T20:10:35.6257521Z libtorch_cpu.so => not found 2025-05-07T20:10:35.6257836Z libtorch_cuda.so => not found 2025-05-07T20:10:35.6258119Z libcudart.so.12 => not found 2025-05-07T20:10:35.6258484Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f002079c000) 2025-05-07T20:10:35.6258908Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f002146f000) 2025-05-07T20:10:35.6259340Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f0021441000) 2025-05-07T20:10:35.6259753Z libc.so.6 => /lib64/libc.so.6 (0x00007f0020594000) 2025-05-07T20:10:35.6260081Z libc10.so => not found 2025-05-07T20:10:35.6260364Z libnvrtc.so.12 => not found 2025-05-07T20:10:35.6260633Z libc10_cuda.so => not found 2025-05-07T20:10:35.6260921Z libnccl.so.2 => not found 2025-05-07T20:10:35.6261180Z libcuda.so.1 => not found 2025-05-07T20:10:35.6261699Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/asmjit.so (0x00007f00213c8000) 2025-05-07T20:10:35.6262249Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:35.6262521Z libtorch.so => not found 2025-05-07T20:10:35.6262763Z libtorch_cpu.so => not found 2025-05-07T20:10:35.6263036Z libtorch_cuda.so => not found 2025-05-07T20:10:35.6263333Z libm.so.6 => /lib64/libm.so.6 (0x00007f00204b9000) 2025-05-07T20:10:35.6263682Z /lib64/ld-linux-x86-64.so.2 (0x00007f00214d8000) 2025-05-07T20:10:35.6264051Z libtorch.so => not found 2025-05-07T20:10:35.6264293Z libc10.so => not found 2025-05-07T20:10:35.6264547Z libnvrtc.so.12 => not found 2025-05-07T20:10:35.6264801Z libc10_cuda.so => not found 2025-05-07T20:10:35.6265069Z libnccl.so.2 => not found 2025-05-07T20:10:35.6265319Z libcuda.so.1 => not found 2025-05-07T20:10:35.6265583Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:35.6265881Z libtorch_cpu.so => not found 2025-05-07T20:10:35.6266155Z libtorch_cuda.so => not found 2025-05-07T20:10:35.6266498Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007f00213bd000) 2025-05-07T20:10:35.6266866Z libtorch.so => not found 2025-05-07T20:10:35.6267117Z libc10.so => not found 2025-05-07T20:10:35.6267351Z libnvrtc.so.12 => not found 2025-05-07T20:10:35.6267618Z libc10_cuda.so => not found 2025-05-07T20:10:35.6267876Z libnccl.so.2 => not found 2025-05-07T20:10:35.6268132Z libcuda.so.1 => not found 2025-05-07T20:10:35.6268383Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:35.6268665Z libtorch_cpu.so => not found 2025-05-07T20:10:35.6268930Z libtorch_cuda.so => not found 2025-05-07T20:10:35.6269243Z librt.so.1 => /lib64/librt.so.1 (0x00007f00213b4000) 2025-05-07T20:10:35.6269475Z 2025-05-07T20:10:35.6269597Z [CHECK] Displaying ELF information: 2025-05-07T20:10:35.6270015Z + readelf -d ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:10:35.6270526Z 2025-05-07T20:10:35.6270531Z 2025-05-07T20:10:35.6270691Z Dynamic section at offset 0x2b5a90 contains 41 entries: 2025-05-07T20:10:35.6271145Z Tag Type Name/Value 2025-05-07T20:10:35.6271566Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:10:35.6272079Z 0x0000000000000001 (NEEDED) Shared library: [libnvrtc.so.12] 2025-05-07T20:10:35.6272581Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:10:35.6273093Z 0x0000000000000001 (NEEDED) Shared library: [libnccl.so.2] 2025-05-07T20:10:35.6273617Z 0x0000000000000001 (NEEDED) Shared library: [libcuda.so.1] 2025-05-07T20:10:35.6274111Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm.so] 2025-05-07T20:10:35.6274607Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_config.so] 2025-05-07T20:10:35.6275141Z 0x0000000000000001 (NEEDED) Shared library: [libnvidia-ml.so.1] 2025-05-07T20:10:35.6275655Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:10:35.6276151Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:10:35.6276668Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:10:35.6277167Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:10:35.6277679Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:10:35.6278170Z 0x0000000000000001 (NEEDED) Shared library: [libgomp.so.1] 2025-05-07T20:10:35.6278672Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:10:35.6279165Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:10:35.6279669Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_common.so] 2025-05-07T20:10:35.6280187Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:10:35.6280578Z 0x000000000000000c (INIT) 0x16000 2025-05-07T20:10:35.6280913Z 0x000000000000000d (FINI) 0x6243c 2025-05-07T20:10:35.6281240Z 0x0000000000000019 (INIT_ARRAY) 0x2b5a40 2025-05-07T20:10:35.6281590Z 0x000000000000001b (INIT_ARRAYSZ) 72 (bytes) 2025-05-07T20:10:35.6281939Z 0x000000000000001a (FINI_ARRAY) 0x2b5a88 2025-05-07T20:10:35.6282274Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:10:35.6282727Z 0x000000006ffffef5 (GNU_HASH) 0x200 2025-05-07T20:10:35.6283135Z 0x0000000000000005 (STRTAB) 0x40a0 2025-05-07T20:10:35.6283475Z 0x0000000000000006 (SYMTAB) 0xcf8 2025-05-07T20:10:35.6283784Z 0x000000000000000a (STRSZ) 48233 (bytes) 2025-05-07T20:10:35.6284118Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:10:35.6284426Z 0x0000000000000003 (PLTGOT) 0x2b6fe8 2025-05-07T20:10:35.6284765Z 0x0000000000000002 (PLTRELSZ) 9240 (bytes) 2025-05-07T20:10:35.6285122Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:10:35.6285419Z 0x0000000000000017 (JMPREL) 0x13a68 2025-05-07T20:10:35.6285753Z 0x0000000000000007 (RELA) 0x10258 2025-05-07T20:10:35.6286069Z 0x0000000000000008 (RELASZ) 14352 (bytes) 2025-05-07T20:10:35.6286415Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:10:35.6286738Z 0x000000006ffffffe (VERNEED) 0x10158 2025-05-07T20:10:35.6287054Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:10:35.6287361Z 0x000000006ffffff0 (VERSYM) 0xfd0a 2025-05-07T20:10:35.6287685Z 0x000000006ffffff9 (RELACOUNT) 337 2025-05-07T20:10:35.6287995Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:10:35.6288182Z 2025-05-07T20:10:35.6288296Z ################################################################################ 2025-05-07T20:10:35.6288512Z 2025-05-07T20:10:35.6288532Z 2025-05-07T20:10:35.6288647Z ################################################################################ 2025-05-07T20:10:35.6289109Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:10:35.6289578Z [CHECK] Listing out library size: 2025-05-07T20:10:35.6290013Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:10:35.6290344Z 2025-05-07T20:10:35.6290522Z 21 ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:10:35.6290802Z 2025-05-07T20:10:35.6291156Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:10:35.6292045Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_cache.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:10:35.6292570Z 2025-05-07T20:10:35.6332612Z GLIBC_2.2.5 2025-05-07T20:10:35.6332892Z GLIBC_2.14 2025-05-07T20:10:35.6333472Z 2025-05-07T20:10:35.6333517Z 2025-05-07T20:10:35.6335338Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:10:35.6336487Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_cache.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:10:35.6337085Z 2025-05-07T20:10:35.6415148Z GLIBCXX_3.4 2025-05-07T20:10:35.6415902Z GLIBCXX_3.4.9 2025-05-07T20:10:35.6416186Z GLIBCXX_3.4.11 2025-05-07T20:10:35.6416534Z GLIBCXX_3.4.20 2025-05-07T20:10:35.6416748Z GLIBCXX_3.4.21 2025-05-07T20:10:35.6416903Z 2025-05-07T20:10:35.6416931Z 2025-05-07T20:10:35.6436295Z + nm -gDC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_cache.so > /tmp/tmp.QaPeJbKMyI.symbols.txt 2025-05-07T20:10:35.6436799Z 2025-05-07T20:10:35.6479479Z 2025-05-07T20:10:35.6506203Z [CHECK] Total Number of symbols: 783 2025-05-07T20:10:35.6522471Z [CHECK] Number of fbgemm symbols: 73 2025-05-07T20:10:35.6538094Z + nm -gDCu ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_cache.so > /tmp/tmp.OgiDq8Prle.usymbols.txt 2025-05-07T20:10:35.6538568Z 2025-05-07T20:10:35.6557925Z 2025-05-07T20:10:35.6593559Z [CHECK] Listing out undefined symbols (147 total): 2025-05-07T20:10:35.6614317Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:35.6616094Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:10:35.6617346Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:10:35.6618006Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:10:35.6620134Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:10:35.6620524Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:10:35.6620893Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:10:35.6621263Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:10:35.6621637Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:10:35.6622062Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:10:35.6622388Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:10:35.6622710Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:10:35.6623038Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:10:35.6623460Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:10:35.6623772Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:10:35.6624061Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:10:35.6624369Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:10:35.6624712Z U at::CUDAGeneratorImpl::device_type() 2025-05-07T20:10:35.6625088Z U at::CUDAGeneratorImpl::philox_cuda_state(unsigned long) 2025-05-07T20:10:35.6625782Z U at::_ops::arange::call(c10::Scalar const&, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:35.6626904Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:35.6628101Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:35.6629028Z U at::_ops::fill__Scalar::call(at::Tensor&, c10::Scalar const&) 2025-05-07T20:10:35.6629962Z U at::_ops::full_like::call(at::Tensor const&, c10::Scalar const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:35.6630822Z U at::_ops::index_select::call(at::Tensor const&, long, at::Tensor const&) 2025-05-07T20:10:35.6631463Z U at::_ops::slice_Tensor::call(at::Tensor const&, long, std::optional, std::optional, c10::SymInt) 2025-05-07T20:10:35.6632344Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:35.6633416Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:35.6634194Z U at::cuda::detail::getDefaultCUDAGenerator(signed char) 2025-05-07T20:10:35.6634588Z U at::cuda::getDeviceProperties(signed char) 2025-05-07T20:10:35.6634910Z U c10::BoolType::get() 2025-05-07T20:10:35.6635254Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:10:35.6635611Z U c10::GeneratorImpl::device() const 2025-05-07T20:10:35.6635996Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:35.6636405Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:10:35.6636728Z U c10::IntType::get() 2025-05-07T20:10:35.6637124Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:10:35.6637579Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:10:35.6637971Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:10:35.6638616Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:10:35.6639212Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:10:35.6639561Z U c10::SymInt::promote_to_negative() 2025-05-07T20:10:35.6639908Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:10:35.6640291Z U c10::TensorType::get() 2025-05-07T20:10:35.6640609Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:10:35.6641474Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:10:35.6642355Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:10:35.6642691Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:10:35.6643027Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:10:35.6643354Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:10:35.6643673Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:10:35.6643998Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:10:35.6644429Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:10:35.6644878Z U c10::cuda::current_device() 2025-05-07T20:10:35.6645210Z U c10::cuda::device_count() 2025-05-07T20:10:35.6645525Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:10:35.6645900Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:10:35.6646262Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:10:35.6646634Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:10:35.6647044Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:10:35.6647414Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:10:35.6648099Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:10:35.6648893Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:10:35.6649687Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:35.6650548Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:10:35.6651476Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:35.6652219Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:10:35.6652541Z U c10::impl::GPUTrace::haveState 2025-05-07T20:10:35.6652876Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:10:35.6653278Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:10:35.6653647Z U c10::impl::device_guard_impl_registry 2025-05-07T20:10:35.6654007Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:10:35.6654365Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:10:35.6654714Z U c10::throwNullDataPtrError() 2025-05-07T20:10:35.6655029Z U c10::warn(c10::Warning const&) 2025-05-07T20:10:35.6655327Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:10:35.6655722Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:10:35.6656142Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:10:35.6656571Z U cudaDeviceGetAttribute@libcudart.so.12 2025-05-07T20:10:35.6657085Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:10:35.6657527Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:10:35.6657932Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:10:35.6658274Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:10:35.6658636Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:10:35.6658970Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:10:35.6659332Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:10:35.6659684Z U cudaFuncGetAttributes@libcudart.so.12 2025-05-07T20:10:35.6660040Z U cudaGetDevice@libcudart.so.12 2025-05-07T20:10:35.6660390Z U cudaGetDeviceCount@libcudart.so.12 2025-05-07T20:10:35.6660744Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:10:35.6661099Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:10:35.6661429Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:10:35.6661775Z U cudaMemcpyAsync@libcudart.so.12 2025-05-07T20:10:35.6662099Z U cudaMemsetAsync@libcudart.so.12 2025-05-07T20:10:35.6662604Z U cudaOccupancyMaxActiveBlocksPerMultiprocessorWithFlags@libcudart.so.12 2025-05-07T20:10:35.6663141Z U cudaPeekAtLastError@libcudart.so.12 2025-05-07T20:10:35.6663488Z U cudaSetDevice@libcudart.so.12 2025-05-07T20:10:35.6663823Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:10:35.6664164Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:10:35.6664524Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:10:35.6664906Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:35.6665386Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:10:35.6665763Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:35.6666122Z U log2f@GLIBC_2.2.5 2025-05-07T20:10:35.6666497Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:10:35.6666916Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:10:35.6667322Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:35.6667675Z U memcpy@GLIBC_2.14 2025-05-07T20:10:35.6667966Z U memmove@GLIBC_2.2.5 2025-05-07T20:10:35.6668249Z U memset@GLIBC_2.2.5 2025-05-07T20:10:35.6668563Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:10:35.6668914Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:10:35.6669477Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:10:35.6670530Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:10:35.6671134Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:10:35.6671511Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:35.6671921Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:35.6672341Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:10:35.6672772Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:10:35.6673250Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:10:35.6674170Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:10:35.6675033Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:10:35.6675385Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:10:35.6675748Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:10:35.6676085Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:10:35.6676536Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:10:35.6677079Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:10:35.6677531Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:10:35.6677856Z U strlen@GLIBC_2.2.5 2025-05-07T20:10:35.6678173Z U torch::CppFunction::~CppFunction() 2025-05-07T20:10:35.6678992Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:10:35.6680133Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:10:35.6680942Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:10:35.6681709Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:10:35.6682481Z U unsigned char* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:35.6682918Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:10:35.6683317Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:10:35.6683707Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:10:35.6684305Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:35.6684919Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:10:35.6685329Z w _ITM_deregisterTMCloneTable 2025-05-07T20:10:35.6685634Z w _ITM_registerTMCloneTable 2025-05-07T20:10:35.6685917Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:10:35.6686200Z w __gmon_start__ 2025-05-07T20:10:35.6686455Z w __pthread_key_create 2025-05-07T20:10:35.6686747Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:10:35.6687064Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:10:35.6687396Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:10:35.6687819Z + ldd ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:10:35.6688102Z 2025-05-07T20:10:35.6688226Z linux-vdso.so.1 (0x00007ffc0a1e6000) 2025-05-07T20:10:35.6688506Z libtorch.so => not found 2025-05-07T20:10:35.6688746Z libc10.so => not found 2025-05-07T20:10:35.6688975Z libnvrtc.so.12 => not found 2025-05-07T20:10:35.6689228Z libc10_cuda.so => not found 2025-05-07T20:10:35.6689469Z libnccl.so.2 => not found 2025-05-07T20:10:35.6689716Z libcuda.so.1 => not found 2025-05-07T20:10:35.6689959Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:35.6690224Z libtorch_cpu.so => not found 2025-05-07T20:10:35.6690469Z libtorch_cuda.so => not found 2025-05-07T20:10:35.6690728Z libcudart.so.12 => not found 2025-05-07T20:10:35.6691029Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f2990d9c000) 2025-05-07T20:10:35.6691396Z libm.so.6 => /lib64/libm.so.6 (0x00007f2992728000) 2025-05-07T20:10:35.6691756Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f29926d2000) 2025-05-07T20:10:35.6692120Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f2990d6e000) 2025-05-07T20:10:35.6692510Z libc.so.6 => /lib64/libc.so.6 (0x00007f2990b66000) 2025-05-07T20:10:35.6692847Z /lib64/ld-linux-x86-64.so.2 (0x00007f299280b000) 2025-05-07T20:10:35.6693072Z 2025-05-07T20:10:35.6693172Z [CHECK] Displaying ELF information: 2025-05-07T20:10:35.6693560Z + readelf -d ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:10:35.6693878Z 2025-05-07T20:10:35.6698733Z 2025-05-07T20:10:35.6698954Z Dynamic section at offset 0x14b76f0 contains 39 entries: 2025-05-07T20:10:35.6699348Z Tag Type Name/Value 2025-05-07T20:10:35.6699767Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:10:35.6700268Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:10:35.6700763Z 0x0000000000000001 (NEEDED) Shared library: [libnvrtc.so.12] 2025-05-07T20:10:35.6701281Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:10:35.6701791Z 0x0000000000000001 (NEEDED) Shared library: [libnccl.so.2] 2025-05-07T20:10:35.6702289Z 0x0000000000000001 (NEEDED) Shared library: [libcuda.so.1] 2025-05-07T20:10:35.6702820Z 0x0000000000000001 (NEEDED) Shared library: [libnvidia-ml.so.1] 2025-05-07T20:10:35.6703331Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:10:35.6703850Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:10:35.6704362Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:10:35.6704893Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:10:35.6705393Z 0x0000000000000001 (NEEDED) Shared library: [libm.so.6] 2025-05-07T20:10:35.6705869Z 0x0000000000000001 (NEEDED) Shared library: [libgomp.so.1] 2025-05-07T20:10:35.6706373Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:10:35.6706849Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:10:35.6707407Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_cache.so] 2025-05-07T20:10:35.6707861Z 0x000000000000000c (INIT) 0x2d000 2025-05-07T20:10:35.6708183Z 0x000000000000000d (FINI) 0xd6d2c 2025-05-07T20:10:35.6708522Z 0x0000000000000019 (INIT_ARRAY) 0x14b5318 2025-05-07T20:10:35.6708868Z 0x000000000000001b (INIT_ARRAYSZ) 208 (bytes) 2025-05-07T20:10:35.6709219Z 0x000000000000001a (FINI_ARRAY) 0x14b53e8 2025-05-07T20:10:35.6709547Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:10:35.6709878Z 0x000000006ffffef5 (GNU_HASH) 0x200 2025-05-07T20:10:35.6710209Z 0x0000000000000005 (STRTAB) 0x5fa8 2025-05-07T20:10:35.6710519Z 0x0000000000000006 (SYMTAB) 0x1628 2025-05-07T20:10:35.6710867Z 0x000000000000000a (STRSZ) 113302 (bytes) 2025-05-07T20:10:35.6711218Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:10:35.6711562Z 0x0000000000000003 (PLTGOT) 0x14b7fe8 2025-05-07T20:10:35.6711908Z 0x0000000000000002 (PLTRELSZ) 10368 (bytes) 2025-05-07T20:10:35.6712250Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:10:35.6712562Z 0x0000000000000017 (JMPREL) 0x29e58 2025-05-07T20:10:35.6712888Z 0x0000000000000007 (RELA) 0x22160 2025-05-07T20:10:35.6713233Z 0x0000000000000008 (RELASZ) 31992 (bytes) 2025-05-07T20:10:35.6713683Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:10:35.6714021Z 0x000000006ffffffe (VERNEED) 0x22060 2025-05-07T20:10:35.6714332Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:10:35.6714836Z 0x000000006ffffff0 (VERSYM) 0x21a3e 2025-05-07T20:10:35.6715155Z 0x000000006ffffff9 (RELACOUNT) 498 2025-05-07T20:10:35.6715461Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:10:35.6715656Z 2025-05-07T20:10:35.6715780Z ################################################################################ 2025-05-07T20:10:35.6716033Z 2025-05-07T20:10:35.6716037Z 2025-05-07T20:10:35.6716149Z ################################################################################ 2025-05-07T20:10:35.6716730Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:10:35.6717210Z [CHECK] Listing out library size: 2025-05-07T20:10:35.6717697Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:10:35.6718064Z 2025-05-07T20:10:35.6718360Z 9 ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:10:35.6719553Z 2025-05-07T20:10:35.6721091Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:10:35.6724110Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_optimizers.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:10:35.6725446Z 2025-05-07T20:10:35.6777272Z GLIBC_2.2.5 2025-05-07T20:10:35.6777890Z GLIBC_2.3 2025-05-07T20:10:35.6778447Z GLIBC_2.14 2025-05-07T20:10:35.6778787Z 2025-05-07T20:10:35.6778801Z 2025-05-07T20:10:35.6780064Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:10:35.6781672Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_optimizers.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:10:35.6782672Z 2025-05-07T20:10:35.6842300Z GLIBCXX_3.4 2025-05-07T20:10:35.6842546Z GLIBCXX_3.4.9 2025-05-07T20:10:35.6842798Z GLIBCXX_3.4.11 2025-05-07T20:10:35.6843016Z GLIBCXX_3.4.18 2025-05-07T20:10:35.6843217Z GLIBCXX_3.4.21 2025-05-07T20:10:35.6843357Z 2025-05-07T20:10:35.6843365Z 2025-05-07T20:10:35.6862284Z + nm -gDC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_optimizers.so > /tmp/tmp.PdwbBMSgNw.symbols.txt 2025-05-07T20:10:35.6862780Z 2025-05-07T20:10:35.6894386Z 2025-05-07T20:10:35.6922253Z [CHECK] Total Number of symbols: 347 2025-05-07T20:10:35.6934594Z [CHECK] Number of fbgemm symbols: 16 2025-05-07T20:10:35.6953320Z + nm -gDCu ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_optimizers.so > /tmp/tmp.0dWpLOgl55.usymbols.txt 2025-05-07T20:10:35.6954996Z 2025-05-07T20:10:35.6972538Z 2025-05-07T20:10:35.7003622Z [CHECK] Listing out undefined symbols (124 total): 2025-05-07T20:10:35.7027256Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:35.7029020Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:35.7029557Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:10:35.7029896Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:10:35.7030289Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:10:35.7030668Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:10:35.7031036Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:10:35.7031390Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:10:35.7031743Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:10:35.7032099Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:10:35.7032434Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:10:35.7032746Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:10:35.7033053Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:10:35.7033364Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:10:35.7033668Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:10:35.7033996Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:10:35.7034304Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:10:35.7034777Z U at::CUDAGeneratorImpl::device_type() 2025-05-07T20:10:35.7035180Z U at::CUDAGeneratorImpl::philox_cuda_state(unsigned long) 2025-05-07T20:10:35.7035635Z U at::_ops::flatten_using_ints::call(at::Tensor const&, long, long) 2025-05-07T20:10:35.7036100Z U at::cuda::detail::getDefaultCUDAGenerator(signed char) 2025-05-07T20:10:35.7036463Z U c10::BoolType::get() 2025-05-07T20:10:35.7036980Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:10:35.7037347Z U c10::FloatType::get() 2025-05-07T20:10:35.7037656Z U c10::GeneratorImpl::device() const 2025-05-07T20:10:35.7038047Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:35.7038457Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:10:35.7038801Z U c10::IntType::get() 2025-05-07T20:10:35.7039246Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:10:35.7039629Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:10:35.7039977Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:10:35.7040339Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:10:35.7040951Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:10:35.7041539Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:10:35.7041926Z U c10::SymInt::promote_to_negative() 2025-05-07T20:10:35.7042233Z U c10::TensorType::get() 2025-05-07T20:10:35.7042530Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:10:35.7043397Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:10:35.7044327Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:10:35.7044671Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:10:35.7045003Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:10:35.7045312Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:10:35.7045634Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:10:35.7045939Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:10:35.7046383Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:10:35.7046814Z U c10::cuda::device_count() 2025-05-07T20:10:35.7047126Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:10:35.7047484Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:10:35.7047831Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:10:35.7048199Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:10:35.7048566Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:10:35.7048926Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:10:35.7049606Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:10:35.7050397Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:10:35.7051183Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:35.7052031Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:10:35.7053128Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:35.7053862Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:10:35.7054171Z U c10::impl::GPUTrace::haveState 2025-05-07T20:10:35.7054498Z U c10::impl::device_guard_impl_registry 2025-05-07T20:10:35.7054842Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:10:35.7055194Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:10:35.7055518Z U c10::warn(c10::Warning const&) 2025-05-07T20:10:35.7055880Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:10:35.7056285Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:10:35.7056727Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:10:35.7057243Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:10:35.7057592Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:10:35.7057925Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:10:35.7058259Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:10:35.7058596Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:10:35.7058970Z U cudaGetDeviceProperties_v2@libcudart.so.12 2025-05-07T20:10:35.7059341Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:10:35.7059703Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:10:35.7060042Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:10:35.7060368Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:10:35.7060720Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:10:35.7061062Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:10:35.7061413Z U float at::Tensor::item() const 2025-05-07T20:10:35.7061827Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:35.7062233Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:35.7062637Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:35.7062985Z U memcpy@GLIBC_2.14 2025-05-07T20:10:35.7063272Z U memmove@GLIBC_2.2.5 2025-05-07T20:10:35.7063549Z U memset@GLIBC_2.2.5 2025-05-07T20:10:35.7063861Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:10:35.7064194Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:10:35.7064763Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:10:35.7065587Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:10:35.7066389Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:10:35.7067182Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:10:35.7067774Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:10:35.7068104Z U std::__throw_bad_array_new_length() 2025-05-07T20:10:35.7068470Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:35.7068849Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:35.7069235Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:10:35.7069711Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:10:35.7070828Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:10:35.7071941Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:10:35.7072437Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:10:35.7072834Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:10:35.7073533Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:10:35.7074016Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:10:35.7074560Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:10:35.7075029Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:10:35.7075383Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:10:35.7075701Z U strlen@GLIBC_2.2.5 2025-05-07T20:10:35.7076013Z U torch::CppFunction::~CppFunction() 2025-05-07T20:10:35.7076835Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:10:35.7077975Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:10:35.7078817Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:10:35.7079547Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:10:35.7080158Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:10:35.7080585Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:10:35.7081021Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:10:35.7081648Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:35.7082313Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:10:35.7082757Z w _ITM_deregisterTMCloneTable 2025-05-07T20:10:35.7083097Z w _ITM_registerTMCloneTable 2025-05-07T20:10:35.7083421Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:10:35.7083821Z w __gmon_start__ 2025-05-07T20:10:35.7084096Z w __pthread_key_create 2025-05-07T20:10:35.7084388Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:10:35.7084715Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:10:35.7085157Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:10:35.7085370Z + ldd ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:10:35.7085379Z 2025-05-07T20:10:35.7085497Z linux-vdso.so.1 (0x00007fffc3eaf000) 2025-05-07T20:10:35.7085586Z libtorch.so => not found 2025-05-07T20:10:35.7085682Z libc10.so => not found 2025-05-07T20:10:35.7085773Z libnvrtc.so.12 => not found 2025-05-07T20:10:35.7085882Z libc10_cuda.so => not found 2025-05-07T20:10:35.7085979Z libnccl.so.2 => not found 2025-05-07T20:10:35.7086104Z libcuda.so.1 => not found 2025-05-07T20:10:35.7086208Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:35.7086309Z libtorch_cpu.so => not found 2025-05-07T20:10:35.7086433Z libtorch_cuda.so => not found 2025-05-07T20:10:35.7086530Z libcudart.so.12 => not found 2025-05-07T20:10:35.7086693Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f93cc39c000) 2025-05-07T20:10:35.7086874Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f93ccfc5000) 2025-05-07T20:10:35.7087024Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f93ccf97000) 2025-05-07T20:10:35.7087149Z libc.so.6 => /lib64/libc.so.6 (0x00007f93cc194000) 2025-05-07T20:10:35.7087306Z /lib64/ld-linux-x86-64.so.2 (0x00007f93cd023000) 2025-05-07T20:10:35.7087453Z libm.so.6 => /lib64/libm.so.6 (0x00007f93cc0b9000) 2025-05-07T20:10:35.7087458Z 2025-05-07T20:10:35.7087568Z [CHECK] Displaying ELF information: 2025-05-07T20:10:35.7087804Z + readelf -d ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:10:35.7087839Z 2025-05-07T20:10:35.7103286Z 2025-05-07T20:10:35.7104604Z Dynamic section at offset 0x8a7a10 contains 39 entries: 2025-05-07T20:10:35.7105198Z Tag Type Name/Value 2025-05-07T20:10:35.7106190Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:10:35.7106568Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:10:35.7106845Z 0x0000000000000001 (NEEDED) Shared library: [libnvrtc.so.12] 2025-05-07T20:10:35.7107053Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:10:35.7107286Z 0x0000000000000001 (NEEDED) Shared library: [libnccl.so.2] 2025-05-07T20:10:35.7107486Z 0x0000000000000001 (NEEDED) Shared library: [libcuda.so.1] 2025-05-07T20:10:35.7107704Z 0x0000000000000001 (NEEDED) Shared library: [libnvidia-ml.so.1] 2025-05-07T20:10:35.7107943Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:10:35.7108158Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:10:35.7108431Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:10:35.7108660Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:10:35.7108868Z 0x0000000000000001 (NEEDED) Shared library: [libgomp.so.1] 2025-05-07T20:10:35.7109074Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:10:35.7109293Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:10:35.7109542Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:10:35.7109784Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_optimizers.so] 2025-05-07T20:10:35.7109910Z 0x000000000000000c (INIT) 0x10000 2025-05-07T20:10:35.7110059Z 0x000000000000000d (FINI) 0x333cc 2025-05-07T20:10:35.7110188Z 0x0000000000000019 (INIT_ARRAY) 0x8a71f8 2025-05-07T20:10:35.7110323Z 0x000000000000001b (INIT_ARRAYSZ) 48 (bytes) 2025-05-07T20:10:35.7110477Z 0x000000000000001a (FINI_ARRAY) 0x8a7228 2025-05-07T20:10:35.7110610Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:10:35.7110734Z 0x000000006ffffef5 (GNU_HASH) 0x238 2025-05-07T20:10:35.7110882Z 0x0000000000000005 (STRTAB) 0x2a78 2025-05-07T20:10:35.7111000Z 0x0000000000000006 (SYMTAB) 0x9d8 2025-05-07T20:10:35.7111144Z 0x000000000000000a (STRSZ) 38407 (bytes) 2025-05-07T20:10:35.7111271Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:10:35.7111423Z 0x0000000000000003 (PLTGOT) 0x8a7fe8 2025-05-07T20:10:35.7111564Z 0x0000000000000002 (PLTRELSZ) 4728 (bytes) 2025-05-07T20:10:35.7111677Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:10:35.7111819Z 0x0000000000000017 (JMPREL) 0xe230 2025-05-07T20:10:35.7111936Z 0x0000000000000007 (RELA) 0xc448 2025-05-07T20:10:35.7112070Z 0x0000000000000008 (RELASZ) 7656 (bytes) 2025-05-07T20:10:35.7112197Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:10:35.7112342Z 0x000000006ffffffe (VERNEED) 0xc338 2025-05-07T20:10:35.7112454Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:10:35.7112574Z 0x000000006ffffff0 (VERSYM) 0xc080 2025-05-07T20:10:35.7112708Z 0x000000006ffffff9 (RELACOUNT) 136 2025-05-07T20:10:35.7112814Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:10:35.7112861Z 2025-05-07T20:10:35.7112983Z ################################################################################ 2025-05-07T20:10:35.7112987Z 2025-05-07T20:10:35.7112991Z 2025-05-07T20:10:35.7113117Z ################################################################################ 2025-05-07T20:10:35.7113376Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:10:35.7113513Z [CHECK] Listing out library size: 2025-05-07T20:10:35.7113796Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:10:35.7113803Z 2025-05-07T20:10:35.7117011Z 17 ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:10:35.7117910Z 2025-05-07T20:10:35.7119261Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:10:35.7119923Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_utils.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:10:35.7119934Z 2025-05-07T20:10:35.7177323Z GLIBC_2.2.5 2025-05-07T20:10:35.7177653Z GLIBC_2.14 2025-05-07T20:10:35.7178935Z 2025-05-07T20:10:35.7178970Z 2025-05-07T20:10:35.7179604Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:10:35.7180317Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_utils.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:10:35.7180334Z 2025-05-07T20:10:35.7242038Z GLIBCXX_3.4 2025-05-07T20:10:35.7242514Z GLIBCXX_3.4.9 2025-05-07T20:10:35.7242979Z GLIBCXX_3.4.20 2025-05-07T20:10:35.7243448Z GLIBCXX_3.4.21 2025-05-07T20:10:35.7243499Z 2025-05-07T20:10:35.7243519Z 2025-05-07T20:10:35.7261558Z + nm -gDC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_utils.so > /tmp/tmp.PmhZgyN8rH.symbols.txt 2025-05-07T20:10:35.7261609Z 2025-05-07T20:10:35.7289285Z 2025-05-07T20:10:35.7333327Z [CHECK] Total Number of symbols: 452 2025-05-07T20:10:35.7354004Z [CHECK] Number of fbgemm symbols: 13 2025-05-07T20:10:35.7371813Z + nm -gDCu ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_utils.so > /tmp/tmp.CldFPTpYxo.usymbols.txt 2025-05-07T20:10:35.7371852Z 2025-05-07T20:10:35.7384077Z 2025-05-07T20:10:35.7414053Z [CHECK] Listing out undefined symbols (149 total): 2025-05-07T20:10:35.7432379Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:35.7433018Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:10:35.7433591Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:10:35.7434010Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:10:35.7434388Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:10:35.7434911Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:10:35.7435048Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:10:35.7435179Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:10:35.7435347Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:10:35.7435474Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:10:35.7435585Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:10:35.7435706Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:10:35.7435831Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:10:35.7435945Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:10:35.7436060Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:10:35.7436178Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:10:35.7436283Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:10:35.7436380Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:10:35.7436516Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:10:35.7436894Z U at::_ops::add_Tensor::call(at::Tensor const&, at::Tensor const&, c10::Scalar const&) 2025-05-07T20:10:35.7437069Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:10:35.7437706Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:35.7438411Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:35.7438585Z U at::_ops::fill__Scalar::call(at::Tensor&, c10::Scalar const&) 2025-05-07T20:10:35.7438768Z U at::_ops::mul_Scalar::call(at::Tensor const&, c10::Scalar const&) 2025-05-07T20:10:35.7438979Z U at::_ops::select_int::call(at::Tensor const&, long, c10::SymInt) 2025-05-07T20:10:35.7439210Z U at::_ops::sub__Tensor::call(at::Tensor&, at::Tensor const&, c10::Scalar const&) 2025-05-07T20:10:35.7439336Z U at::_ops::zero_::call(at::Tensor&) 2025-05-07T20:10:35.7439839Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:35.7440447Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:35.7440559Z U c10::BoolType::get() 2025-05-07T20:10:35.7440753Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:10:35.7440902Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:10:35.7441011Z U c10::IntType::get() 2025-05-07T20:10:35.7441295Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:10:35.7441427Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:10:35.7441657Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:10:35.7441857Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:10:35.7442120Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:10:35.7442519Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:10:35.7442687Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:10:35.7442815Z U c10::SymInt::promote_to_negative() 2025-05-07T20:10:35.7442931Z U c10::SymInt::toSymNode() const 2025-05-07T20:10:35.7443063Z U c10::SymIntType::get() 2025-05-07T20:10:35.7443222Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:10:35.7443385Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:10:35.7443623Z U c10::TensorType::get() 2025-05-07T20:10:35.7443748Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:10:35.7444397Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:10:35.7444555Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:10:35.7444675Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:10:35.7444795Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:10:35.7444928Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:10:35.7445051Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:10:35.7445189Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:10:35.7445424Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:10:35.7445553Z U c10::cuda::current_device() 2025-05-07T20:10:35.7445655Z U c10::cuda::device_count() 2025-05-07T20:10:35.7445792Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:10:35.7445962Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:10:35.7446103Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:10:35.7446243Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:10:35.7446418Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:10:35.7446530Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:10:35.7447004Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:10:35.7447265Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:10:35.7447717Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:35.7448034Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:10:35.7448609Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:35.7448730Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:10:35.7448861Z U c10::impl::GPUTrace::haveState 2025-05-07T20:10:35.7449010Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:10:35.7449191Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:10:35.7449313Z U c10::impl::device_guard_impl_registry 2025-05-07T20:10:35.7449475Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:10:35.7449611Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:10:35.7449727Z U c10::throwNullDataPtrError() 2025-05-07T20:10:35.7449851Z U c10::warn(c10::Warning const&) 2025-05-07T20:10:35.7449967Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:10:35.7450155Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:10:35.7450294Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:10:35.7450421Z U cudaDeviceGetAttribute@libcudart.so.12 2025-05-07T20:10:35.7450546Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:10:35.7450702Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:10:35.7450818Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:10:35.7450939Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:10:35.7451051Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:10:35.7451187Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:10:35.7451311Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:10:35.7451438Z U cudaFuncGetAttributes@libcudart.so.12 2025-05-07T20:10:35.7451576Z U cudaGetDevice@libcudart.so.12 2025-05-07T20:10:35.7451695Z U cudaGetDeviceCount@libcudart.so.12 2025-05-07T20:10:35.7451831Z U cudaGetDeviceProperties_v2@libcudart.so.12 2025-05-07T20:10:35.7451984Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:10:35.7452096Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:10:35.7452237Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:10:35.7452353Z U cudaMemcpyAsync@libcudart.so.12 2025-05-07T20:10:35.7452483Z U cudaMemsetAsync@libcudart.so.12 2025-05-07T20:10:35.7452742Z U cudaOccupancyMaxActiveBlocksPerMultiprocessorWithFlags@libcudart.so.12 2025-05-07T20:10:35.7452857Z U cudaPeekAtLastError@libcudart.so.12 2025-05-07T20:10:35.7452996Z U cudaSetDevice@libcudart.so.12 2025-05-07T20:10:35.7453102Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:10:35.7453220Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:10:35.7453342Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:10:35.7453461Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:10:35.7453594Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:35.7453681Z U log2@GLIBC_2.2.5 2025-05-07T20:10:35.7453866Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:10:35.7466669Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:10:35.7466903Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:35.7467005Z U memcpy@GLIBC_2.14 2025-05-07T20:10:35.7467123Z U memmove@GLIBC_2.2.5 2025-05-07T20:10:35.7467229Z U memset@GLIBC_2.2.5 2025-05-07T20:10:35.7467349Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:10:35.7467563Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:10:35.7467913Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:10:35.7468297Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:10:35.7468438Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:10:35.7468648Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:35.7468849Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:35.7469126Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:10:35.7469609Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:10:35.7470516Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:10:35.7470670Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:10:35.7470793Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:10:35.7470917Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:10:35.7471055Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:10:35.7471174Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:10:35.7471361Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:10:35.7471616Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:10:35.7471749Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:10:35.7471862Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:10:35.7471962Z U strlen@GLIBC_2.2.5 2025-05-07T20:10:35.7472102Z U torch::CppFunction::~CppFunction() 2025-05-07T20:10:35.7472681Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:10:35.7473252Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:10:35.7473850Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:10:35.7474273Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:10:35.7474460Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:10:35.7474617Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:10:35.7474783Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:10:35.7474959Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:10:35.7475287Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:35.7475515Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:10:35.7475647Z w _ITM_deregisterTMCloneTable 2025-05-07T20:10:35.7475755Z w _ITM_registerTMCloneTable 2025-05-07T20:10:35.7475861Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:10:35.7475955Z w __gmon_start__ 2025-05-07T20:10:35.7476068Z w __pthread_key_create 2025-05-07T20:10:35.7476218Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:10:35.7476454Z + ldd ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:10:35.7476461Z 2025-05-07T20:10:35.7476611Z linux-vdso.so.1 (0x00007ffdbc569000) 2025-05-07T20:10:35.7476709Z libtorch.so => not found 2025-05-07T20:10:35.7476796Z libc10.so => not found 2025-05-07T20:10:35.7476911Z libnvrtc.so.12 => not found 2025-05-07T20:10:35.7477003Z libc10_cuda.so => not found 2025-05-07T20:10:35.7477094Z libnccl.so.2 => not found 2025-05-07T20:10:35.7477201Z libcuda.so.1 => not found 2025-05-07T20:10:35.7477340Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:35.7477436Z libtorch_cpu.so => not found 2025-05-07T20:10:35.7477530Z libtorch_cuda.so => not found 2025-05-07T20:10:35.7477637Z libcudart.so.12 => not found 2025-05-07T20:10:35.7477795Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f5d0119c000) 2025-05-07T20:10:35.7477919Z libm.so.6 => /lib64/libm.so.6 (0x00007f5d010c1000) 2025-05-07T20:10:35.7478085Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f5d02517000) 2025-05-07T20:10:35.7478233Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f5d01093000) 2025-05-07T20:10:35.7478354Z libc.so.6 => /lib64/libc.so.6 (0x00007f5d00e8b000) 2025-05-07T20:10:35.7478478Z /lib64/ld-linux-x86-64.so.2 (0x00007f5d02575000) 2025-05-07T20:10:35.7478498Z 2025-05-07T20:10:35.7478604Z [CHECK] Displaying ELF information: 2025-05-07T20:10:35.7478826Z + readelf -d ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:10:35.7478831Z 2025-05-07T20:10:35.7503410Z 2025-05-07T20:10:35.7504774Z Dynamic section at offset 0x104fa28 contains 39 entries: 2025-05-07T20:10:35.7505410Z Tag Type Name/Value 2025-05-07T20:10:35.7506540Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:10:35.7507249Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:10:35.7507830Z 0x0000000000000001 (NEEDED) Shared library: [libnvrtc.so.12] 2025-05-07T20:10:35.7508399Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:10:35.7508820Z 0x0000000000000001 (NEEDED) Shared library: [libnccl.so.2] 2025-05-07T20:10:35.7509013Z 0x0000000000000001 (NEEDED) Shared library: [libcuda.so.1] 2025-05-07T20:10:35.7509218Z 0x0000000000000001 (NEEDED) Shared library: [libnvidia-ml.so.1] 2025-05-07T20:10:35.7509434Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:10:35.7509743Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:10:35.7509943Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:10:35.7510156Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:10:35.7510342Z 0x0000000000000001 (NEEDED) Shared library: [libm.so.6] 2025-05-07T20:10:35.7510572Z 0x0000000000000001 (NEEDED) Shared library: [libgomp.so.1] 2025-05-07T20:10:35.7510780Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:10:35.7510970Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:10:35.7511187Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_utils.so] 2025-05-07T20:10:35.7511301Z 0x000000000000000c (INIT) 0x11000 2025-05-07T20:10:35.7511427Z 0x000000000000000d (FINI) 0x8746c 2025-05-07T20:10:35.7511546Z 0x0000000000000019 (INIT_ARRAY) 0x104ff20 2025-05-07T20:10:35.7511675Z 0x000000000000001b (INIT_ARRAYSZ) 96 (bytes) 2025-05-07T20:10:35.7511808Z 0x000000000000001a (FINI_ARRAY) 0x104ff80 2025-05-07T20:10:35.7511928Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:10:35.7512042Z 0x000000006ffffef5 (GNU_HASH) 0x200 2025-05-07T20:10:35.7512167Z 0x0000000000000005 (STRTAB) 0x3660 2025-05-07T20:10:35.7512279Z 0x0000000000000006 (SYMTAB) 0xbe8 2025-05-07T20:10:35.7512410Z 0x000000000000000a (STRSZ) 35790 (bytes) 2025-05-07T20:10:35.7512558Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:10:35.7512690Z 0x0000000000000003 (PLTGOT) 0x1050fe8 2025-05-07T20:10:35.7512821Z 0x0000000000000002 (PLTRELSZ) 6480 (bytes) 2025-05-07T20:10:35.7512928Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:10:35.7513053Z 0x0000000000000017 (JMPREL) 0xf060 2025-05-07T20:10:35.7513162Z 0x0000000000000007 (RELA) 0xc6a8 2025-05-07T20:10:35.7513321Z 0x0000000000000008 (RELASZ) 10680 (bytes) 2025-05-07T20:10:35.7513440Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:10:35.7513568Z 0x000000006ffffffe (VERNEED) 0xc5b8 2025-05-07T20:10:35.7513675Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:10:35.7513784Z 0x000000006ffffff0 (VERSYM) 0xc22e 2025-05-07T20:10:35.7513905Z 0x000000006ffffff9 (RELACOUNT) 116 2025-05-07T20:10:35.7514003Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:10:35.7514025Z 2025-05-07T20:10:35.7514148Z ################################################################################ 2025-05-07T20:10:35.7514154Z 2025-05-07T20:10:35.7514157Z 2025-05-07T20:10:35.7514286Z ################################################################################ 2025-05-07T20:10:35.7514590Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:10:35.7514698Z [CHECK] Listing out library size: 2025-05-07T20:10:35.7515013Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:10:35.7515017Z 2025-05-07T20:10:35.7516106Z 2 ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:10:35.7517050Z 2025-05-07T20:10:35.7518421Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:10:35.7519031Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_sparse_async_cumsum.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:10:35.7519037Z 2025-05-07T20:10:35.7571839Z GLIBC_2.2.5 2025-05-07T20:10:35.7572325Z GLIBC_2.14 2025-05-07T20:10:35.7572355Z 2025-05-07T20:10:35.7572378Z 2025-05-07T20:10:35.7574397Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:10:35.7576285Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_sparse_async_cumsum.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:10:35.7576302Z 2025-05-07T20:10:35.7621238Z GLIBCXX_3.4 2025-05-07T20:10:35.7621700Z GLIBCXX_3.4.9 2025-05-07T20:10:35.7622127Z GLIBCXX_3.4.21 2025-05-07T20:10:35.7622277Z 2025-05-07T20:10:35.7622315Z 2025-05-07T20:10:35.7645018Z + nm -gDC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_sparse_async_cumsum.so > /tmp/tmp.epNsG0iirV.symbols.txt 2025-05-07T20:10:35.7645046Z 2025-05-07T20:10:35.7660944Z 2025-05-07T20:10:35.7688286Z [CHECK] Total Number of symbols: 277 2025-05-07T20:10:35.7701832Z [CHECK] Number of fbgemm symbols: 44 2025-05-07T20:10:35.7724673Z + nm -gDCu ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_sparse_async_cumsum.so > /tmp/tmp.xuNQaxRSgO.usymbols.txt 2025-05-07T20:10:35.7724701Z 2025-05-07T20:10:35.7748560Z 2025-05-07T20:10:35.7775640Z [CHECK] Listing out undefined symbols (127 total): 2025-05-07T20:10:35.7793067Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:35.7793999Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:10:35.7794688Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:10:35.7794847Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:10:35.7795007Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:10:35.7795193Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:10:35.7795594Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:10:35.7795720Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:10:35.7795879Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:10:35.7795986Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:10:35.7796101Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:10:35.7796239Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:10:35.7796404Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:10:35.7796509Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:10:35.7796616Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:10:35.7796847Z U at::_ops::contiguous::call(at::Tensor const&, c10::MemoryFormat) 2025-05-07T20:10:35.7797400Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:35.7798023Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:35.7798197Z U at::_ops::select_int::call(at::Tensor const&, long, c10::SymInt) 2025-05-07T20:10:35.7798314Z U at::_ops::zero_::call(at::Tensor&) 2025-05-07T20:10:35.7798775Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:35.7798877Z U at::get_thread_num() 2025-05-07T20:10:35.7798996Z U at::internal::set_thread_num(int) 2025-05-07T20:10:35.7799546Z U at::native::empty_like(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:35.7799803Z U at::native::resize_(at::Tensor const&, c10::ArrayRef, std::optional) 2025-05-07T20:10:35.7799980Z U c10::BFloat16* at::TensorBase::data_ptr() const 2025-05-07T20:10:35.7800158Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:10:35.7800369Z U c10::Half* at::TensorBase::data_ptr() const 2025-05-07T20:10:35.7800512Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:10:35.7800668Z U c10::ParallelGuard::ParallelGuard(bool) 2025-05-07T20:10:35.7800789Z U c10::ParallelGuard::~ParallelGuard() 2025-05-07T20:10:35.7800944Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:10:35.7801162Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:10:35.7801282Z U c10::SymInt::promote_to_negative() 2025-05-07T20:10:35.7801435Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:10:35.7801610Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:10:35.7801718Z U c10::TensorType::get() 2025-05-07T20:10:35.7801843Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:10:35.7802523Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:10:35.7802657Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:10:35.7802780Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:10:35.7802930Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:10:35.7803046Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:10:35.7803198Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:10:35.7803316Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:10:35.7803575Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:10:35.7803684Z U c10::cuda::device_count() 2025-05-07T20:10:35.7803824Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:10:35.7804009Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:10:35.7804153Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:10:35.7804292Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:10:35.7804474Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:10:35.7804588Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:10:35.7805060Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:10:35.7805304Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:10:35.7805751Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:35.7806074Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:10:35.7806185Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:10:35.7806290Z U c10::impl::GPUTrace::haveState 2025-05-07T20:10:35.7806429Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:10:35.7806600Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:10:35.7806715Z U c10::impl::device_guard_impl_registry 2025-05-07T20:10:35.7806847Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:10:35.7806991Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:10:35.7807099Z U c10::throwNullDataPtrError() 2025-05-07T20:10:35.7807199Z U c10::warn(c10::Warning const&) 2025-05-07T20:10:35.7807316Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:10:35.7807529Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:10:35.7807642Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:10:35.7807778Z U cudaDeviceGetAttribute@libcudart.so.12 2025-05-07T20:10:35.7807903Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:10:35.7808079Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:10:35.7808198Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:10:35.7808321Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:10:35.7808450Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:10:35.7808564Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:10:35.7808685Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:10:35.7808826Z U cudaFuncGetAttributes@libcudart.so.12 2025-05-07T20:10:35.7808932Z U cudaGetDevice@libcudart.so.12 2025-05-07T20:10:35.7809050Z U cudaGetDeviceCount@libcudart.so.12 2025-05-07T20:10:35.7809167Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:10:35.7809299Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:10:35.7809410Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:10:35.7809674Z U cudaOccupancyMaxActiveBlocksPerMultiprocessorWithFlags@libcudart.so.12 2025-05-07T20:10:35.7809815Z U cudaPeekAtLastError@libcudart.so.12 2025-05-07T20:10:35.7810111Z U cudaSetDevice@libcudart.so.12 2025-05-07T20:10:35.7810234Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:10:35.7810432Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:10:35.7810550Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:10:35.7810696Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:10:35.7810846Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:10:35.7811044Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:10:35.7811182Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:10:35.7811283Z U memcpy@GLIBC_2.14 2025-05-07T20:10:35.7811403Z U memmove@GLIBC_2.2.5 2025-05-07T20:10:35.7811501Z U memset@GLIBC_2.2.5 2025-05-07T20:10:35.7811611Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:10:35.7811762Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:10:35.7812096Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:10:35.7812476Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:10:35.7812617Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:10:35.7812764Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:35.7812910Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:35.7813173Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:10:35.7813730Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:10:35.7813859Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:10:35.7814007Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:10:35.7814133Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:10:35.7814248Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:10:35.7814448Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:10:35.7814592Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:10:35.7814693Z U strlen@GLIBC_2.2.5 2025-05-07T20:10:35.7814836Z U torch::CppFunction::~CppFunction() 2025-05-07T20:10:35.7815404Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:10:35.7815881Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:10:35.7816158Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:10:35.7816608Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:10:35.7816765Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:10:35.7817121Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:10:35.7817289Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:10:35.7817617Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:35.7817927Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:10:35.7818046Z w _ITM_deregisterTMCloneTable 2025-05-07T20:10:35.7818188Z w _ITM_registerTMCloneTable 2025-05-07T20:10:35.7818323Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:10:35.7818421Z w __gmon_start__ 2025-05-07T20:10:35.7818525Z w __pthread_key_create 2025-05-07T20:10:35.7818699Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:10:35.7818950Z + ldd ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:10:35.7818979Z 2025-05-07T20:10:35.7839566Z linux-vdso.so.1 (0x00007ffd47763000) 2025-05-07T20:10:35.7839728Z libc10.so => not found 2025-05-07T20:10:35.7839849Z libnvrtc.so.12 => not found 2025-05-07T20:10:35.7839951Z libc10_cuda.so => not found 2025-05-07T20:10:35.7840120Z libnccl.so.2 => not found 2025-05-07T20:10:35.7841221Z libcuda.so.1 => not found 2025-05-07T20:10:35.7842707Z fbgemm_gpu_tbe_utils.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_utils.so (0x00007fcfe3400000) 2025-05-07T20:10:35.7843031Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:35.7843315Z libtorch.so => not found 2025-05-07T20:10:35.7843592Z libtorch_cpu.so => not found 2025-05-07T20:10:35.7843976Z libtorch_cuda.so => not found 2025-05-07T20:10:35.7844266Z libcudart.so.12 => not found 2025-05-07T20:10:35.7844747Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007fcfe319c000) 2025-05-07T20:10:35.7845205Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007fcfe468c000) 2025-05-07T20:10:35.7845653Z libc.so.6 => /lib64/libc.so.6 (0x00007fcfe2f94000) 2025-05-07T20:10:35.7845915Z libtorch.so => not found 2025-05-07T20:10:35.7846156Z libc10.so => not found 2025-05-07T20:10:35.7846422Z libnvrtc.so.12 => not found 2025-05-07T20:10:35.7846688Z libc10_cuda.so => not found 2025-05-07T20:10:35.7846942Z libnccl.so.2 => not found 2025-05-07T20:10:35.7847233Z libcuda.so.1 => not found 2025-05-07T20:10:35.7847345Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:35.7847440Z libtorch_cpu.so => not found 2025-05-07T20:10:35.7847540Z libtorch_cuda.so => not found 2025-05-07T20:10:35.7847633Z libcudart.so.12 => not found 2025-05-07T20:10:35.7847770Z libm.so.6 => /lib64/libm.so.6 (0x00007fcfe45ad000) 2025-05-07T20:10:35.7847921Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007fcfe4557000) 2025-05-07T20:10:35.7848049Z /lib64/ld-linux-x86-64.so.2 (0x00007fcfe4869000) 2025-05-07T20:10:35.7848061Z 2025-05-07T20:10:35.7850417Z [CHECK] Displaying ELF information: 2025-05-07T20:10:35.7850691Z + readelf -d ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:10:35.7850697Z 2025-05-07T20:10:35.7876835Z 2025-05-07T20:10:35.7877658Z Dynamic section at offset 0x16eba8 contains 39 entries: 2025-05-07T20:10:35.7878019Z Tag Type Name/Value 2025-05-07T20:10:35.7878590Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:10:35.7879473Z 0x0000000000000001 (NEEDED) Shared library: [libnvrtc.so.12] 2025-05-07T20:10:35.7880064Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:10:35.7880626Z 0x0000000000000001 (NEEDED) Shared library: [libnccl.so.2] 2025-05-07T20:10:35.7881180Z 0x0000000000000001 (NEEDED) Shared library: [libcuda.so.1] 2025-05-07T20:10:35.7881827Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_utils.so] 2025-05-07T20:10:35.7882428Z 0x0000000000000001 (NEEDED) Shared library: [libnvidia-ml.so.1] 2025-05-07T20:10:35.7882990Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:10:35.7883579Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:10:35.7884151Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:10:35.7884724Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:10:35.7885305Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:10:35.7885974Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:10:35.7886518Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:10:35.7887230Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_sparse_async_cumsum.so] 2025-05-07T20:10:35.7887761Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:10:35.7888089Z 0x000000000000000c (INIT) 0xa000 2025-05-07T20:10:35.7888500Z 0x000000000000000d (FINI) 0x1a14c 2025-05-07T20:10:35.7888853Z 0x0000000000000019 (INIT_ARRAY) 0x16f890 2025-05-07T20:10:35.7889197Z 0x000000000000001b (INIT_ARRAYSZ) 32 (bytes) 2025-05-07T20:10:35.7889523Z 0x000000000000001a (FINI_ARRAY) 0x16f8b0 2025-05-07T20:10:35.7889889Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:10:35.7890212Z 0x000000006ffffef5 (GNU_HASH) 0x200 2025-05-07T20:10:35.7890530Z 0x0000000000000005 (STRTAB) 0x2108 2025-05-07T20:10:35.7890873Z 0x0000000000000006 (SYMTAB) 0x6f8 2025-05-07T20:10:35.7891013Z 0x000000000000000a (STRSZ) 20443 (bytes) 2025-05-07T20:10:35.7891128Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:10:35.7891242Z 0x0000000000000003 (PLTGOT) 0x16ffe8 2025-05-07T20:10:35.7891384Z 0x0000000000000002 (PLTRELSZ) 3936 (bytes) 2025-05-07T20:10:35.7891490Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:10:35.7891599Z 0x0000000000000017 (JMPREL) 0x8150 2025-05-07T20:10:35.7891701Z 0x0000000000000007 (RELA) 0x73d0 2025-05-07T20:10:35.7891838Z 0x0000000000000008 (RELASZ) 3456 (bytes) 2025-05-07T20:10:35.7891966Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:10:35.7892079Z 0x000000006ffffffe (VERNEED) 0x7310 2025-05-07T20:10:35.7892197Z 0x000000006fffffff (VERNEEDNUM) 4 2025-05-07T20:10:35.7892309Z 0x000000006ffffff0 (VERSYM) 0x70e4 2025-05-07T20:10:35.7892412Z 0x000000006ffffff9 (RELACOUNT) 7 2025-05-07T20:10:35.7892523Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:10:35.7892527Z 2025-05-07T20:10:35.7892643Z ################################################################################ 2025-05-07T20:10:35.7892648Z 2025-05-07T20:10:35.7892652Z 2025-05-07T20:10:35.7892763Z ################################################################################ 2025-05-07T20:10:35.7893144Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:10:35.7893247Z [CHECK] Listing out library size: 2025-05-07T20:10:35.7893554Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:10:35.7893558Z 2025-05-07T20:10:35.7906684Z 11 ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:10:35.7908470Z 2025-05-07T20:10:35.7908936Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:10:35.7909487Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:10:35.7909493Z 2025-05-07T20:10:35.8372966Z GLIBC_2.2.5 2025-05-07T20:10:35.8373216Z GLIBC_2.3 2025-05-07T20:10:35.8373464Z GLIBC_2.14 2025-05-07T20:10:35.8377131Z 2025-05-07T20:10:35.8377141Z 2025-05-07T20:10:35.8377632Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:10:35.8378191Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:10:35.8378200Z 2025-05-07T20:10:35.8846030Z GLIBCXX_3.4 2025-05-07T20:10:35.8846259Z GLIBCXX_3.4.9 2025-05-07T20:10:35.8846809Z GLIBCXX_3.4.11 2025-05-07T20:10:35.8846919Z GLIBCXX_3.4.15 2025-05-07T20:10:35.8847017Z GLIBCXX_3.4.18 2025-05-07T20:10:35.8847125Z GLIBCXX_3.4.20 2025-05-07T20:10:35.8847210Z GLIBCXX_3.4.21 2025-05-07T20:10:35.8847217Z 2025-05-07T20:10:35.8847223Z 2025-05-07T20:10:35.8863727Z + nm -gDC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so > /tmp/tmp.FxnZZJj0qa.symbols.txt 2025-05-07T20:10:35.8863743Z 2025-05-07T20:10:35.9271227Z 2025-05-07T20:10:35.9298473Z [CHECK] Total Number of symbols: 4395 2025-05-07T20:10:35.9327614Z [CHECK] Number of fbgemm symbols: 4 2025-05-07T20:10:35.9341788Z + nm -gDCu ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so > /tmp/tmp.wnoVu2kUeS.usymbols.txt 2025-05-07T20:10:35.9341799Z 2025-05-07T20:10:35.9384514Z 2025-05-07T20:10:35.9419241Z [CHECK] Listing out undefined symbols (185 total): 2025-05-07T20:10:35.9441103Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:35.9441511Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:35.9441667Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:10:35.9441795Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:10:35.9441902Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:10:35.9442032Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:10:35.9442260Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:10:35.9442371Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:10:35.9442488Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:10:35.9442610Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:10:35.9442718Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:10:35.9442818Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:10:35.9442929Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:10:35.9443043Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:10:35.9443141Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:10:35.9443333Z U at::RecordFunction::RecordFunction(at::StepCallbacks&&) 2025-05-07T20:10:35.9443466Z U at::RecordFunction::currentThreadId() 2025-05-07T20:10:35.9443574Z U at::RecordFunction::end() 2025-05-07T20:10:35.9443834Z U at::RecordFunction::~RecordFunction() 2025-05-07T20:10:35.9444002Z U at::SavedTensorDefaultHooks::set_tracing(bool) 2025-05-07T20:10:35.9444303Z U at::Tensor fbgemm_gpu::reshape_vbe_offsets(at::Tensor const&, at::Tensor const&, long, int) 2025-05-07T20:10:35.9444601Z U at::Tensor fbgemm_gpu::reshape_vbe_offsets(at::Tensor const&, at::Tensor const&, long, int) 2025-05-07T20:10:35.9444984Z U at::_ops::clamp::call(at::Tensor const&, std::optional const&, std::optional const&) 2025-05-07T20:10:35.9445192Z U at::_ops::clone::call(at::Tensor const&, std::optional) 2025-05-07T20:10:35.9445843Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:35.9446034Z U at::_ops::flatten_using_ints::call(at::Tensor const&, long, long) 2025-05-07T20:10:35.9446200Z U at::_ops::view_as::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:10:35.9446346Z U at::functorch::functorchTLSAccessor() 2025-05-07T20:10:35.9446505Z U at::getStepCallbacksUnlessEmpty(at::RecordScope) 2025-05-07T20:10:35.9446637Z U at::sequence_number::get_and_increment() 2025-05-07T20:10:35.9446747Z U bcmp@GLIBC_2.2.5 2025-05-07T20:10:35.9446885Z U c10::AnyType::get() 2025-05-07T20:10:35.9446984Z U c10::BoolType::get() 2025-05-07T20:10:35.9447147Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:10:35.9447342Z U c10::Dispatcher::findSchemaOrThrow(char const*, char const*) 2025-05-07T20:10:35.9447459Z U c10::Dispatcher::realSingleton() 2025-05-07T20:10:35.9449132Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet) 2025-05-07T20:10:35.9449758Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet, c10::ArrayRef) 2025-05-07T20:10:35.9450118Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:10:35.9450239Z U c10::Error::what() const 2025-05-07T20:10:35.9450338Z U c10::FloatType::get() 2025-05-07T20:10:35.9450443Z U c10::GradMode::is_enabled() 2025-05-07T20:10:35.9450552Z U c10::GradMode::set_enabled(bool) 2025-05-07T20:10:35.9450821Z U c10::IValue::TagType::get(c10::IValue const&) 2025-05-07T20:10:35.9450934Z U c10::IValue::hash(c10::IValue const&) 2025-05-07T20:10:35.9451042Z U c10::IValue::isBoolList() const 2025-05-07T20:10:35.9451162Z U c10::IValue::isDoubleList() const 2025-05-07T20:10:35.9451264Z U c10::IValue::isIntList() const 2025-05-07T20:10:35.9451371Z U c10::IValue::isSymIntList() const 2025-05-07T20:10:35.9451487Z U c10::IValue::isTensorList() const 2025-05-07T20:10:35.9451621Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:10:35.9451716Z U c10::IntType::get() 2025-05-07T20:10:35.9452166Z U c10::ListType::get(std::__cxx11::basic_string, std::allocator > const&, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:10:35.9452324Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:10:35.9452444Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:10:35.9452578Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:10:35.9452728Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:10:35.9452932Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:10:35.9453206Z U c10::SmallVectorBase::grow_pod(void const*, unsigned long, unsigned long) 2025-05-07T20:10:35.9453304Z U c10::StringType::get() 2025-05-07T20:10:35.9453490Z U c10::SymBool::expect_true(char const*, long) const 2025-05-07T20:10:35.9453642Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:10:35.9453783Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:10:35.9453926Z U c10::SymFloat::operator/(c10::SymFloat const&) const 2025-05-07T20:10:35.9454311Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:10:35.9454444Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:10:35.9454565Z U c10::SymInt::operator c10::SymFloat() const 2025-05-07T20:10:35.9454709Z U c10::SymInt::operator/(c10::SymInt const&) const 2025-05-07T20:10:35.9454821Z U c10::SymInt::promote_to_negative() 2025-05-07T20:10:35.9454943Z U c10::SymInt::sym_le(c10::SymInt const&) const 2025-05-07T20:10:35.9455051Z U c10::SymInt::toSymNode() const 2025-05-07T20:10:35.9455164Z U c10::SymIntType::get() 2025-05-07T20:10:35.9455302Z U c10::TensorImpl::requires_grad() const 2025-05-07T20:10:35.9455397Z U c10::TensorType::get() 2025-05-07T20:10:35.9455531Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:10:35.9455953Z U c10::detail::ListImpl::ListImpl(std::vector >, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:10:35.9456574Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:10:35.9456987Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:10:35.9457495Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:35.9457858Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:10:35.9458427Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:35.9458759Z U c10::impl::OperatorEntry::assertSignatureIsCorrect(c10::impl::CppSignature const&, bool) const 2025-05-07T20:10:35.9458946Z U c10::impl::OperatorEntry::reportError(c10::DispatchKey) const 2025-05-07T20:10:35.9459070Z U c10::impl::raw_local_dispatch_key_set 2025-05-07T20:10:35.9459244Z U c10::initializeFunctionalityOffsetsAndMasks() 2025-05-07T20:10:35.9459612Z U c10::ivalue::ConstantString::create(std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:10:35.9459741Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:10:35.9459920Z U c10::operator<<(std::ostream&, c10::OperatorName const&) 2025-05-07T20:10:35.9460069Z U c10::operator<<(std::ostream&, c10::SymFloat const&) 2025-05-07T20:10:35.9460214Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:10:35.9460429Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:10:35.9460583Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:10:35.9460839Z U fbgemm_gpu::config::is_feature_enabled(fbgemm_gpu::config::FeatureGateName const&) 2025-05-07T20:10:35.9461140Z U fbgemm_gpu::reshape_vbe_output(at::Tensor const&, long, at::Tensor const&, at::Tensor const&) 2025-05-07T20:10:35.9461239Z U free@GLIBC_2.2.5 2025-05-07T20:10:35.9461440Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:10:35.9461539Z U memcmp@GLIBC_2.2.5 2025-05-07T20:10:35.9461652Z U memcpy@GLIBC_2.14 2025-05-07T20:10:35.9461751Z U memmove@GLIBC_2.2.5 2025-05-07T20:10:35.9461845Z U memset@GLIBC_2.2.5 2025-05-07T20:10:35.9461978Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:10:35.9462103Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:10:35.9462201Z U realloc@GLIBC_2.2.5 2025-05-07T20:10:35.9462431Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:10:35.9462768Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:10:35.9463234Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:10:35.9463547Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:10:35.9463915Z U std::__cxx11::basic_stringstream, std::allocator >::~basic_stringstream()@GLIBCXX_3.4.21 2025-05-07T20:10:35.9464255Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:10:35.9464382Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:10:35.9464493Z U std::__throw_bad_array_new_length() 2025-05-07T20:10:35.9464650Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:35.9464799Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:35.9464961Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:10:35.9465089Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:10:35.9465234Z U std::bad_weak_ptr::~bad_weak_ptr()@GLIBCXX_3.4.15 2025-05-07T20:10:35.9465457Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:10:35.9465981Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:10:35.9466114Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:10:35.9466234Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:10:35.9466346Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:10:35.9466471Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:10:35.9466578Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:10:35.9466747Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:10:35.9466981Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:10:35.9467101Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:10:35.9467257Z U std::out_of_range::out_of_range(char const*)@GLIBCXX_3.4.21 2025-05-07T20:10:35.9467396Z U std::out_of_range::~out_of_range()@GLIBCXX_3.4 2025-05-07T20:10:35.9467784Z U std::runtime_error::runtime_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:10:35.9467941Z U std::runtime_error::~runtime_error()@GLIBCXX_3.4 2025-05-07T20:10:35.9468061Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:10:35.9468153Z U strcmp@GLIBC_2.2.5 2025-05-07T20:10:35.9468241Z U strlen@GLIBC_2.2.5 2025-05-07T20:10:35.9468373Z U torch::CppFunction::~CppFunction() 2025-05-07T20:10:35.9468933Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:10:35.9469355Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:10:35.9469607Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:10:35.9469723Z U torch::autograd::AnomalyMode::_enabled 2025-05-07T20:10:35.9469995Z U torch::autograd::AutogradContext::AutogradContext(torch::dynamo::autograd::PackedArgs&) 2025-05-07T20:10:35.9470327Z U torch::autograd::AutogradContext::get_and_bump_dirty() const 2025-05-07T20:10:35.9470698Z U torch::autograd::AutogradContext::get_non_differentiable() const 2025-05-07T20:10:35.9470970Z U torch::autograd::AutogradContext::get_saved_variables() const 2025-05-07T20:10:35.9471411Z U torch::autograd::AutogradContext::save_for_backward(std::vector >) 2025-05-07T20:10:35.9471566Z U torch::autograd::AutogradContext::save_variables() 2025-05-07T20:10:35.9471756Z U torch::autograd::ForwardADLevel::try_get_by_idx(unsigned long) 2025-05-07T20:10:35.9471951Z U torch::autograd::InputMetadata::shape_as_dim_vector() const 2025-05-07T20:10:35.9472078Z U torch::autograd::Node::assign_parent() 2025-05-07T20:10:35.9472225Z U torch::autograd::Node::metadata() 2025-05-07T20:10:35.9472382Z U torch::autograd::Node::name[abi:cxx11]() const 2025-05-07T20:10:35.9472627Z U torch::autograd::SavedVariable::SavedVariable(at::Tensor const&, bool, bool) 2025-05-07T20:10:35.9472899Z U torch::autograd::SavedVariable::unpack(std::shared_ptr) const 2025-05-07T20:10:35.9473056Z U torch::autograd::VariableInfo::VariableInfo() 2025-05-07T20:10:35.9473269Z U torch::autograd::VariableInfo::VariableInfo(at::Tensor const&, bool) 2025-05-07T20:10:35.9473485Z U torch::autograd::VariableInfo::zeros(c10::OptionalDeviceGuard&) const 2025-05-07T20:10:35.9476097Z U torch::autograd::_wrap_outputs(std::vector > const&, std::unordered_set, std::equal_to, std::allocator > const&, std::unordered_set, std::equal_to, std::allocator > const&, c10::ArrayRef >, std::shared_ptr const&, std::function > (std::vector >, std::vector >)> const&, std::unordered_set, std::equal_to, std::allocator > const&, std::function const&) 2025-05-07T20:10:35.9476271Z U torch::autograd::deleteNode(torch::autograd::Node*) 2025-05-07T20:10:35.9476424Z U torch::autograd::get_current_graph_task_exec_info() 2025-05-07T20:10:35.9476587Z U torch::autograd::impl::gradient_edge(at::Tensor const&) 2025-05-07T20:10:35.9477486Z U torch::autograd::profiler::record_function_enter_new(std::__cxx11::basic_string, std::allocator > const&, std::optional, std::allocator > > const&) 2025-05-07T20:10:35.9477633Z U torch::dynamo::autograd::getPyCompilerInterface() 2025-05-07T20:10:35.9478009Z U torch::dynamo::autograd::get_input_metadata(std::vector > const&) 2025-05-07T20:10:35.9478387Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:10:35.9478488Z U typeinfo for c10::Error 2025-05-07T20:10:35.9478621Z U typeinfo for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:10:35.9478751Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:10:35.9478873Z U typeinfo for std::out_of_range@GLIBCXX_3.4 2025-05-07T20:10:35.9479002Z U typeinfo for std::runtime_error@GLIBCXX_3.4 2025-05-07T20:10:35.9479128Z U typeinfo for torch::autograd::Node 2025-05-07T20:10:35.9479269Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:10:35.9479421Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:10:35.9479586Z U vtable for __cxxabiv1::__pointer_type_info@CXXABI_1.3 2025-05-07T20:10:35.9479760Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:10:35.9479862Z U vtable for c10::Error 2025-05-07T20:10:35.9480188Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:35.9480313Z U vtable for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:10:35.9480523Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:10:35.9480655Z U vtable for torch::autograd::Node 2025-05-07T20:10:35.9480832Z w TLS init function for c10::impl::raw_local_dispatch_key_set 2025-05-07T20:10:35.9480938Z w _ITM_deregisterTMCloneTable 2025-05-07T20:10:35.9481037Z w _ITM_registerTMCloneTable 2025-05-07T20:10:35.9481149Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:10:35.9481236Z w __gmon_start__ 2025-05-07T20:10:35.9481327Z w __pthread_key_create 2025-05-07T20:10:35.9481445Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:10:35.9481548Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:10:35.9481683Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:10:35.9481917Z + ldd ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:10:35.9481936Z 2025-05-07T20:10:35.9505993Z linux-vdso.so.1 (0x00007fffddd87000) 2025-05-07T20:10:35.9506317Z libc10.so => not found 2025-05-07T20:10:35.9506607Z libnvrtc.so.12 => not found 2025-05-07T20:10:35.9506909Z libc10_cuda.so => not found 2025-05-07T20:10:35.9507231Z libnccl.so.2 => not found 2025-05-07T20:10:35.9507322Z libcuda.so.1 => not found 2025-05-07T20:10:35.9507813Z fbgemm_gpu_tbe_cache.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_cache.so (0x00007f8211200000) 2025-05-07T20:10:35.9508284Z fbgemm_gpu_tbe_common.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_common.so (0x00007f8210e00000) 2025-05-07T20:10:35.9508843Z fbgemm_gpu_sparse_async_cumsum.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_sparse_async_cumsum.so (0x00007f8210c59000) 2025-05-07T20:10:35.9508982Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:35.9509083Z libtorch.so => not found 2025-05-07T20:10:35.9509550Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_config.so (0x00007f8213592000) 2025-05-07T20:10:35.9509773Z libtorch_cpu.so => not found 2025-05-07T20:10:35.9509871Z libtorch_cuda.so => not found 2025-05-07T20:10:35.9510053Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f82109f5000) 2025-05-07T20:10:35.9510204Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f8213564000) 2025-05-07T20:10:35.9510362Z libc.so.6 => /lib64/libc.so.6 (0x00007f82107ed000) 2025-05-07T20:10:35.9510506Z /lib64/ld-linux-x86-64.so.2 (0x00007f82135a5000) 2025-05-07T20:10:35.9510603Z libtorch.so => not found 2025-05-07T20:10:35.9510692Z libc10.so => not found 2025-05-07T20:10:35.9510790Z libnvrtc.so.12 => not found 2025-05-07T20:10:35.9510900Z libc10_cuda.so => not found 2025-05-07T20:10:35.9510993Z libnccl.so.2 => not found 2025-05-07T20:10:35.9511084Z libcuda.so.1 => not found 2025-05-07T20:10:35.9511201Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:35.9511299Z libtorch_cpu.so => not found 2025-05-07T20:10:35.9511399Z libtorch_cuda.so => not found 2025-05-07T20:10:35.9511499Z libcudart.so.12 => not found 2025-05-07T20:10:35.9511638Z libm.so.6 => /lib64/libm.so.6 (0x00007f8213485000) 2025-05-07T20:10:35.9511787Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f82129aa000) 2025-05-07T20:10:35.9511877Z libc10.so => not found 2025-05-07T20:10:35.9511984Z libnvrtc.so.12 => not found 2025-05-07T20:10:35.9512076Z libc10_cuda.so => not found 2025-05-07T20:10:35.9512168Z libnccl.so.2 => not found 2025-05-07T20:10:35.9512260Z libcuda.so.1 => not found 2025-05-07T20:10:35.9512661Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm.so (0x00007f8210200000) 2025-05-07T20:10:35.9512763Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:35.9512855Z libtorch.so => not found 2025-05-07T20:10:35.9512966Z libtorch_cpu.so => not found 2025-05-07T20:10:35.9513064Z libtorch_cuda.so => not found 2025-05-07T20:10:35.9513157Z libcudart.so.12 => not found 2025-05-07T20:10:35.9513249Z libc10.so => not found 2025-05-07T20:10:35.9513390Z libnvrtc.so.12 => not found 2025-05-07T20:10:35.9513489Z libc10_cuda.so => not found 2025-05-07T20:10:35.9513581Z libnccl.so.2 => not found 2025-05-07T20:10:35.9513687Z libcuda.so.1 => not found 2025-05-07T20:10:35.9514140Z fbgemm_gpu_tbe_utils.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_utils.so (0x00007f820f000000) 2025-05-07T20:10:35.9514241Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:35.9514332Z libtorch.so => not found 2025-05-07T20:10:35.9514442Z libtorch_cpu.so => not found 2025-05-07T20:10:35.9514541Z libtorch_cuda.so => not found 2025-05-07T20:10:35.9514634Z libcudart.so.12 => not found 2025-05-07T20:10:35.9514745Z libtorch.so => not found 2025-05-07T20:10:35.9514833Z libc10.so => not found 2025-05-07T20:10:35.9514927Z libnvrtc.so.12 => not found 2025-05-07T20:10:35.9515022Z libc10_cuda.so => not found 2025-05-07T20:10:35.9515129Z libnccl.so.2 => not found 2025-05-07T20:10:35.9515220Z libcuda.so.1 => not found 2025-05-07T20:10:35.9515321Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:35.9515432Z libtorch_cpu.so => not found 2025-05-07T20:10:35.9515527Z libtorch_cuda.so => not found 2025-05-07T20:10:35.9515701Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007f8213472000) 2025-05-07T20:10:35.9515791Z libc10.so => not found 2025-05-07T20:10:35.9515901Z libnvrtc.so.12 => not found 2025-05-07T20:10:35.9515994Z libc10_cuda.so => not found 2025-05-07T20:10:35.9516084Z libnccl.so.2 => not found 2025-05-07T20:10:35.9516192Z libcuda.so.1 => not found 2025-05-07T20:10:35.9516546Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/asmjit.so (0x00007f8212933000) 2025-05-07T20:10:35.9516654Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:35.9516773Z libtorch.so => not found 2025-05-07T20:10:35.9516875Z libtorch_cpu.so => not found 2025-05-07T20:10:35.9516979Z libtorch_cuda.so => not found 2025-05-07T20:10:35.9517078Z libtorch.so => not found 2025-05-07T20:10:35.9517220Z libc10.so => not found 2025-05-07T20:10:35.9517324Z libnvrtc.so.12 => not found 2025-05-07T20:10:35.9517421Z libc10_cuda.so => not found 2025-05-07T20:10:35.9517540Z libnccl.so.2 => not found 2025-05-07T20:10:35.9517636Z libcuda.so.1 => not found 2025-05-07T20:10:35.9517742Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:35.9517845Z libtorch_cpu.so => not found 2025-05-07T20:10:35.9518000Z libtorch_cuda.so => not found 2025-05-07T20:10:35.9518100Z libcudart.so.12 => not found 2025-05-07T20:10:35.9518200Z libtorch.so => not found 2025-05-07T20:10:35.9518317Z libc10.so => not found 2025-05-07T20:10:35.9518417Z libnvrtc.so.12 => not found 2025-05-07T20:10:35.9518519Z libc10_cuda.so => not found 2025-05-07T20:10:35.9518619Z libnccl.so.2 => not found 2025-05-07T20:10:35.9518737Z libcuda.so.1 => not found 2025-05-07T20:10:35.9518843Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:35.9518944Z libtorch_cpu.so => not found 2025-05-07T20:10:35.9519079Z libtorch_cuda.so => not found 2025-05-07T20:10:35.9519223Z librt.so.1 => /lib64/librt.so.1 (0x00007f8213463000) 2025-05-07T20:10:35.9519239Z 2025-05-07T20:10:35.9519356Z [CHECK] Displaying ELF information: 2025-05-07T20:10:35.9519650Z + readelf -d ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:10:35.9519682Z 2025-05-07T20:10:35.9554111Z 2025-05-07T20:10:35.9554807Z Dynamic section at offset 0xa44058 contains 42 entries: 2025-05-07T20:10:35.9554975Z Tag Type Name/Value 2025-05-07T20:10:35.9555351Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:10:35.9555592Z 0x0000000000000001 (NEEDED) Shared library: [libnvrtc.so.12] 2025-05-07T20:10:35.9555802Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:10:35.9556014Z 0x0000000000000001 (NEEDED) Shared library: [libnccl.so.2] 2025-05-07T20:10:35.9556240Z 0x0000000000000001 (NEEDED) Shared library: [libcuda.so.1] 2025-05-07T20:10:35.9556503Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_cache.so] 2025-05-07T20:10:35.9556731Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_common.so] 2025-05-07T20:10:35.9557000Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_sparse_async_cumsum.so] 2025-05-07T20:10:35.9557218Z 0x0000000000000001 (NEEDED) Shared library: [libnvidia-ml.so.1] 2025-05-07T20:10:35.9557417Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:10:35.9557661Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_config.so] 2025-05-07T20:10:35.9557870Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:10:35.9558079Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:10:35.9558284Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:10:35.9558515Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:10:35.9558711Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:10:35.9558926Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:10:35.9559228Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_training_backward_pt2.so] 2025-05-07T20:10:35.9559416Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:10:35.9559541Z 0x000000000000000c (INIT) 0x190000 2025-05-07T20:10:35.9559687Z 0x000000000000000d (FINI) 0x8ac368 2025-05-07T20:10:35.9559812Z 0x0000000000000019 (INIT_ARRAY) 0xa37c40 2025-05-07T20:10:35.9559950Z 0x000000000000001b (INIT_ARRAYSZ) 256 (bytes) 2025-05-07T20:10:35.9560099Z 0x000000000000001a (FINI_ARRAY) 0xa37d40 2025-05-07T20:10:35.9560227Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:10:35.9560355Z 0x000000006ffffef5 (GNU_HASH) 0x200 2025-05-07T20:10:35.9560525Z 0x0000000000000005 (STRTAB) 0x23008 2025-05-07T20:10:35.9560669Z 0x0000000000000006 (SYMTAB) 0x93e8 2025-05-07T20:10:35.9560825Z 0x000000000000000a (STRSZ) 1248185 (bytes) 2025-05-07T20:10:35.9560952Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:10:35.9561100Z 0x0000000000000003 (PLTGOT) 0xa47fe8 2025-05-07T20:10:35.9561269Z 0x0000000000000002 (PLTRELSZ) 42648 (bytes) 2025-05-07T20:10:35.9561386Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:10:35.9561531Z 0x0000000000000017 (JMPREL) 0x184d90 2025-05-07T20:10:35.9561652Z 0x0000000000000007 (RELA) 0x155f30 2025-05-07T20:10:35.9561793Z 0x0000000000000008 (RELASZ) 192096 (bytes) 2025-05-07T20:10:35.9561920Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:10:35.9562066Z 0x000000006ffffffe (VERNEED) 0x155e20 2025-05-07T20:10:35.9562183Z 0x000000006fffffff (VERNEEDNUM) 4 2025-05-07T20:10:35.9562310Z 0x000000006ffffff0 (VERSYM) 0x153bc2 2025-05-07T20:10:35.9562449Z 0x000000006ffffff9 (RELACOUNT) 34 2025-05-07T20:10:35.9562557Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:10:35.9562563Z 2025-05-07T20:10:35.9562691Z ################################################################################ 2025-05-07T20:10:35.9562698Z 2025-05-07T20:10:35.9562702Z 2025-05-07T20:10:35.9562848Z ################################################################################ 2025-05-07T20:10:35.9563154Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:10:35.9563268Z [CHECK] Listing out library size: 2025-05-07T20:10:35.9563563Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:10:35.9563567Z 2025-05-07T20:10:35.9569534Z 429 ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:10:35.9569810Z 2025-05-07T20:10:35.9572113Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:10:35.9572629Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_inference.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:10:35.9572636Z 2025-05-07T20:10:35.9968721Z GLIBC_2.2.5 2025-05-07T20:10:35.9969386Z GLIBC_2.14 2025-05-07T20:10:35.9969728Z 2025-05-07T20:10:35.9969742Z 2025-05-07T20:10:35.9971385Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:10:35.9974367Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_inference.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:10:35.9976174Z 2025-05-07T20:10:36.0362186Z GLIBCXX_3.4 2025-05-07T20:10:36.0363057Z GLIBCXX_3.4.9 2025-05-07T20:10:36.0363681Z GLIBCXX_3.4.11 2025-05-07T20:10:36.0364295Z GLIBCXX_3.4.14 2025-05-07T20:10:36.0364880Z GLIBCXX_3.4.18 2025-05-07T20:10:36.0365476Z GLIBCXX_3.4.20 2025-05-07T20:10:36.0366043Z GLIBCXX_3.4.21 2025-05-07T20:10:36.0366440Z 2025-05-07T20:10:36.0366455Z 2025-05-07T20:10:36.0380379Z + nm -gDC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_inference.so > /tmp/tmp.1eGegTJq8A.symbols.txt 2025-05-07T20:10:36.0381764Z 2025-05-07T20:10:36.0738308Z 2025-05-07T20:10:36.0767199Z [CHECK] Total Number of symbols: 5083 2025-05-07T20:10:36.0805353Z [CHECK] Number of fbgemm symbols: 3788 2025-05-07T20:10:36.0822959Z + nm -gDCu ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_inference.so > /tmp/tmp.EeaEv0gJju.usymbols.txt 2025-05-07T20:10:36.0824398Z 2025-05-07T20:10:36.0860071Z 2025-05-07T20:10:36.0894781Z [CHECK] Listing out undefined symbols (246 total): 2025-05-07T20:10:36.0915488Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:36.0917064Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:36.0917840Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:10:36.0918503Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:10:36.0918925Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:10:36.0919402Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:10:36.0919788Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:10:36.0920164Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:10:36.0920531Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:10:36.0920891Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:10:36.0921303Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:10:36.0921629Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:10:36.0921962Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:10:36.0922280Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:10:36.0922609Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:10:36.0922944Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:10:36.0923258Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:10:36.0923585Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:10:36.0923892Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:10:36.0924200Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:10:36.0924570Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:10:36.0925108Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:10:36.0925929Z U at::_ops::arange_start::call(c10::Scalar const&, c10::Scalar const&, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:36.0927152Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:36.0928416Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:36.0929432Z U at::_ops::fill__Scalar::call(at::Tensor&, c10::Scalar const&) 2025-05-07T20:10:36.0930123Z U at::_ops::scalar_tensor::call(c10::Scalar const&, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:36.0930831Z U at::_ops::select_int::call(at::Tensor const&, long, c10::SymInt) 2025-05-07T20:10:36.0931398Z U at::_ops::to_dtype::call(at::Tensor const&, c10::ScalarType, bool, bool, std::optional) 2025-05-07T20:10:36.0932350Z U at::_ops::to_dtype_layout::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, bool, bool, std::optional) 2025-05-07T20:10:36.0933663Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:36.0934346Z U at::detail::getCUDAHooks() 2025-05-07T20:10:36.0934662Z U at::detail::getHIPHooks() 2025-05-07T20:10:36.0934974Z U at::get_thread_num() 2025-05-07T20:10:36.0935260Z U at::globalContext() 2025-05-07T20:10:36.0935576Z U at::internal::set_thread_num(int) 2025-05-07T20:10:36.0935962Z U c10::AutogradMetaInterface::~AutogradMetaInterface() 2025-05-07T20:10:36.0936498Z U c10::BFloat16* at::TensorBase::data_ptr() const 2025-05-07T20:10:36.0937197Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:36.0937650Z U c10::ClassType::addMethod(torch::jit::Function*) 2025-05-07T20:10:36.0938287Z U c10::ClassType::getMethod(std::__cxx11::basic_string, std::allocator > const&) const 2025-05-07T20:10:36.0938978Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:10:36.0939885Z U c10::DictType::get(std::__cxx11::basic_string, std::allocator > const&, c10::Type::SingletonOrSharedTypePtr, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:10:36.0941006Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:10:36.0941592Z U c10::Error::what() const 2025-05-07T20:10:36.0941902Z U c10::GradMode::is_enabled() 2025-05-07T20:10:36.0942236Z U c10::GradMode::set_enabled(bool) 2025-05-07T20:10:36.0942603Z U c10::Half* at::TensorBase::data_ptr() const 2025-05-07T20:10:36.0943047Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:36.0943504Z U c10::IValue::TagType::get(c10::IValue const&) 2025-05-07T20:10:36.0943894Z U c10::IValue::is(c10::IValue const&) const 2025-05-07T20:10:36.0944281Z U c10::IValue::isTensorList() const 2025-05-07T20:10:36.0944644Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:10:36.0945001Z U c10::IntType::get() 2025-05-07T20:10:36.0945663Z U c10::ListType::get(std::__cxx11::basic_string, std::allocator > const&, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:10:36.0946458Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:10:36.0946870Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:10:36.0947191Z U c10::NoneType::get() 2025-05-07T20:10:36.0947615Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:10:36.0948070Z U c10::ParallelGuard::ParallelGuard(bool) 2025-05-07T20:10:36.0948441Z U c10::ParallelGuard::~ParallelGuard() 2025-05-07T20:10:36.0948836Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:10:36.0949208Z U c10::StringType::get() 2025-05-07T20:10:36.0949565Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:10:36.0949959Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:10:36.0950628Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:10:36.0951381Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:10:36.0951732Z U c10::SymInt::promote_to_negative() 2025-05-07T20:10:36.0952113Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:10:36.0952782Z U c10::TensorImpl::set_autograd_meta(std::unique_ptr >) 2025-05-07T20:10:36.0953417Z U c10::TensorType::get() 2025-05-07T20:10:36.0954369Z U c10::TupleType::TupleType(std::vector, std::allocator > >, std::optional, std::shared_ptr) 2025-05-07T20:10:36.0955329Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:10:36.0956245Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:10:36.0957398Z U c10::_fastEqualsForContainer(c10::IValue const&, c10::IValue const&) 2025-05-07T20:10:36.0957816Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:10:36.0958196Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:10:36.0958521Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:10:36.0959022Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:10:36.0959369Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:10:36.0959699Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:10:36.0960165Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:10:36.0960609Z U c10::cuda::device_count() 2025-05-07T20:10:36.0960974Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:10:36.0961356Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:10:36.0961770Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:10:36.0962183Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:10:36.0962587Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:10:36.0963003Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:10:36.0963844Z U c10::detail::ListImpl::ListImpl(std::vector >, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:10:36.0964916Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:10:36.0966586Z U c10::detail::infer_schema::make_function_schema(std::__cxx11::basic_string, std::allocator >&&, std::__cxx11::basic_string, std::allocator >&&, c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:10:36.0968508Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:10:36.0969382Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:36.0970508Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:10:36.0971623Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:36.0972479Z U c10::getCustomClassTypeImpl(std::type_index const&) 2025-05-07T20:10:36.0972943Z U c10::get_default_dtype() 2025-05-07T20:10:36.0973797Z U c10::impl::ExcludeDispatchKeyGuard::ExcludeDispatchKeyGuard(c10::DispatchKeySet) 2025-05-07T20:10:36.0974413Z U c10::impl::ExcludeDispatchKeyGuard::~ExcludeDispatchKeyGuard() 2025-05-07T20:10:36.0974902Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:10:36.0975257Z U c10::impl::GPUTrace::haveState 2025-05-07T20:10:36.0975645Z U c10::impl::device_guard_impl_registry 2025-05-07T20:10:36.0976262Z U c10::ivalue::ConstantString::create(std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:10:36.0977021Z U c10::ivalue::Future::extractStorages(c10::IValue const&) 2025-05-07T20:10:36.0977489Z U c10::ivalue::Object::resizeObject(unsigned long) 2025-05-07T20:10:36.0978074Z U c10::ivalue::checkCustomClassType(c10::ClassType const*, c10::Type const*) 2025-05-07T20:10:36.0978614Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:10:36.0978999Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:10:36.0979425Z U c10::operator<<(std::ostream&, c10::FunctionSchema const&) 2025-05-07T20:10:36.0979887Z U c10::warn(c10::Warning const&) 2025-05-07T20:10:36.0980309Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:10:36.0980790Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:10:36.0981159Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:10:36.0981565Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:10:36.0981935Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:10:36.0982322Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:10:36.0982708Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:10:36.0983053Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:10:36.0983436Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:10:36.0983796Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:10:36.0984170Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:10:36.0984522Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:10:36.0984900Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:10:36.0985341Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:10:36.0985711Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:10:36.0986721Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:36.0988483Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:36.0990215Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:36.0991788Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:36.0993352Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:36.0994968Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:36.0996456Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMFP8WithStrides(long, bool, bool, bool, long, long, int, int, bool) 2025-05-07T20:10:36.0997905Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMNBitWithStrides(int, long, bool, bool, int, bool, bool, long, long, bool, bool, bool, int) 2025-05-07T20:10:36.0999459Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:36.1001086Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMNBitWithStrides(int, long, bool, bool, int, bool, bool, long, long, bool, bool, bool, int) 2025-05-07T20:10:36.1002728Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:36.1004356Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMFP8WithStrides(long, bool, bool, bool, long, long, int, int, bool) 2025-05-07T20:10:36.1005935Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMNBitWithStrides(int, long, bool, bool, int, bool, bool, long, long, bool, bool, bool, int) 2025-05-07T20:10:36.1007620Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:36.1009184Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMFP8WithStrides(long, bool, bool, bool, long, long, int, int, bool) 2025-05-07T20:10:36.1010668Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMNBitWithStrides(int, long, bool, bool, int, bool, bool, long, long, bool, bool, bool, int) 2025-05-07T20:10:36.1012268Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:36.1013894Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMNBitWithStrides(int, long, bool, bool, int, bool, bool, long, long, bool, bool, bool, int) 2025-05-07T20:10:36.1015569Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:36.1017454Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMFP8WithStrides(long, bool, bool, bool, long, long, int, int, bool) 2025-05-07T20:10:36.1019200Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMNBitWithStrides(int, long, bool, bool, int, bool, bool, long, long, bool, bool, bool, int) 2025-05-07T20:10:36.1021524Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:36.1023598Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:36.1025794Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:36.1027746Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:36.1029665Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:36.1031439Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:36.1033276Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:36.1034394Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:10:36.1034789Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:36.1036603Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:10:36.1036961Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:36.1037583Z U linearize_cache_indices_cuda(at::Tensor const&, at::Tensor const&, at::Tensor const&, std::optional const&, long, long) 2025-05-07T20:10:36.1038226Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:10:36.1038648Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:10:36.1039021Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:36.1039780Z U lru_cache_populate_byte_cuda(at::Tensor, at::Tensor, long, at::Tensor, at::Tensor, at::Tensor, at::Tensor, at::Tensor, at::Tensor, at::Tensor, long, at::Tensor, long, bool, std::optional) 2025-05-07T20:10:36.1041270Z U lxu_cache_lookup_cuda(at::Tensor, at::Tensor, long, bool, std::optional, std::optional, std::optional) 2025-05-07T20:10:36.1041851Z U memchr@GLIBC_2.2.5 2025-05-07T20:10:36.1042110Z U memcpy@GLIBC_2.14 2025-05-07T20:10:36.1042382Z U memmove@GLIBC_2.2.5 2025-05-07T20:10:36.1042643Z U memset@GLIBC_2.2.5 2025-05-07T20:10:36.1042935Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:10:36.1043268Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:10:36.1043675Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:10:36.1044295Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:10:36.1045046Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:10:36.1045808Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:10:36.1046871Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream(std::__cxx11::basic_string, std::allocator > const&, std::_Ios_Openmode)@GLIBCXX_3.4.21 2025-05-07T20:10:36.1047964Z U std::__cxx11::basic_stringstream, std::allocator >::~basic_stringstream()@GLIBCXX_3.4.21 2025-05-07T20:10:36.1048800Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:10:36.1049380Z U std::__exception_ptr::exception_ptr::_M_addref() 2025-05-07T20:10:36.1049743Z U std::__exception_ptr::exception_ptr::_M_release() 2025-05-07T20:10:36.1050127Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:10:36.1050450Z U std::__throw_bad_array_new_length() 2025-05-07T20:10:36.1050776Z U std::__throw_bad_function_call()@GLIBCXX_3.4.14 2025-05-07T20:10:36.1051142Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:36.1051495Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:36.1051933Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:10:36.1052333Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:10:36.1052809Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:10:36.1053692Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:10:36.1054489Z U std::condition_variable::condition_variable()@GLIBCXX_3.4.11 2025-05-07T20:10:36.1054977Z U std::condition_variable::notify_all()@GLIBCXX_3.4.11 2025-05-07T20:10:36.1055437Z U std::condition_variable::~condition_variable()@GLIBCXX_3.4.11 2025-05-07T20:10:36.1055845Z U std::current_exception()@CXXABI_1.3.3 2025-05-07T20:10:36.1056218Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:10:36.1056649Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:10:36.1057243Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:10:36.1057614Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:10:36.1058075Z U std::istream& std::istream::_M_extract(long&)@GLIBCXX_3.4.9 2025-05-07T20:10:36.1058535Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:10:36.1059175Z U std::logic_error::logic_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:10:36.1059867Z U std::logic_error::~logic_error()@GLIBCXX_3.4 2025-05-07T20:10:36.1060339Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:10:36.1060888Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:10:36.1061414Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:10:36.1061832Z U std::out_of_range::out_of_range(char const*)@GLIBCXX_3.4.21 2025-05-07T20:10:36.1062294Z U std::out_of_range::~out_of_range()@GLIBCXX_3.4 2025-05-07T20:10:36.1062788Z U std::rethrow_exception(std::__exception_ptr::exception_ptr)@CXXABI_1.3.3 2025-05-07T20:10:36.1063350Z U std::runtime_error::runtime_error(char const*)@GLIBCXX_3.4.21 2025-05-07T20:10:36.1063835Z U std::runtime_error::~runtime_error()@GLIBCXX_3.4 2025-05-07T20:10:36.1064215Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:10:36.1064564Z U strcmp@GLIBC_2.2.5 2025-05-07T20:10:36.1064865Z U strlen@GLIBC_2.2.5 2025-05-07T20:10:36.1065220Z U torch::CppFunction::~CppFunction() 2025-05-07T20:10:36.1066071Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:10:36.1067248Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:10:36.1068091Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:10:36.1069523Z U torch::detail::class_base::class_base(std::__cxx11::basic_string, std::allocator > const&, std::__cxx11::basic_string, std::allocator > const&, std::__cxx11::basic_string, std::allocator >, std::type_info const&, std::type_info const&) 2025-05-07T20:10:36.1071247Z U torch::detail::class_base::withNewArguments(c10::FunctionSchema const&, std::initializer_list) 2025-05-07T20:10:36.1072113Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:10:36.1073012Z U torch::registerCustomClassMethod(std::unique_ptr >) 2025-05-07T20:10:36.1073670Z U torch::serialize::InputArchive::InputArchive() 2025-05-07T20:10:36.1074272Z U torch::serialize::InputArchive::load_from(char const*, unsigned long, std::optional) 2025-05-07T20:10:36.1075259Z U torch::serialize::InputArchive::read(std::__cxx11::basic_string, std::allocator > const&, at::Tensor&, bool) 2025-05-07T20:10:36.1076144Z U torch::serialize::OutputArchive::OutputArchive(std::shared_ptr) 2025-05-07T20:10:36.1076781Z U torch::serialize::OutputArchive::save_to(std::ostream&) 2025-05-07T20:10:36.1077584Z U torch::serialize::OutputArchive::write(std::__cxx11::basic_string, std::allocator > const&, at::Tensor const&, bool) 2025-05-07T20:10:36.1078336Z U typeinfo for c10::Error 2025-05-07T20:10:36.1078711Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:10:36.1079091Z U typeinfo for std::logic_error@GLIBCXX_3.4 2025-05-07T20:10:36.1079498Z U typeinfo for std::out_of_range@GLIBCXX_3.4 2025-05-07T20:10:36.1079892Z U typeinfo for std::runtime_error@GLIBCXX_3.4 2025-05-07T20:10:36.1080367Z U unsigned char* at::TensorBase::data_ptr() const 2025-05-07T20:10:36.1080921Z U unsigned char* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:36.1081425Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:10:36.1081988Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:10:36.1082423Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:10:36.1082823Z U vtable for c10::Error 2025-05-07T20:10:36.1083465Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:36.1084089Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:10:36.1084555Z U vtable for torch::autograd::AutogradMeta 2025-05-07T20:10:36.1084892Z w _ITM_deregisterTMCloneTable 2025-05-07T20:10:36.1085228Z w _ITM_registerTMCloneTable 2025-05-07T20:10:36.1085558Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:10:36.1085850Z w __gmon_start__ 2025-05-07T20:10:36.1086144Z w __pthread_key_create 2025-05-07T20:10:36.1086440Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:10:36.1086784Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:10:36.1087134Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:10:36.1087635Z + ldd ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:10:36.1087941Z 2025-05-07T20:10:36.1088048Z linux-vdso.so.1 (0x00007ffe06188000) 2025-05-07T20:10:36.1088355Z libc10.so => not found 2025-05-07T20:10:36.1088631Z libnvrtc.so.12 => not found 2025-05-07T20:10:36.1088887Z libc10_cuda.so => not found 2025-05-07T20:10:36.1089155Z libnccl.so.2 => not found 2025-05-07T20:10:36.1089438Z libcuda.so.1 => not found 2025-05-07T20:10:36.1089955Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm.so (0x00007f6bcc600000) 2025-05-07T20:10:36.1090817Z fbgemm_gpu_tbe_cache.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_cache.so (0x00007f6bcae00000) 2025-05-07T20:10:36.1091774Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_config.so (0x00007f6bccbf5000) 2025-05-07T20:10:36.1092393Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:36.1092656Z libtorch.so => not found 2025-05-07T20:10:36.1092925Z libtorch_cpu.so => not found 2025-05-07T20:10:36.1093185Z libtorch_cuda.so => not found 2025-05-07T20:10:36.1093468Z libcudart.so.12 => not found 2025-05-07T20:10:36.1093788Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f6bcab9c000) 2025-05-07T20:10:36.1094206Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f6bccbc7000) 2025-05-07T20:10:36.1094567Z libc.so.6 => /lib64/libc.so.6 (0x00007f6bca994000) 2025-05-07T20:10:36.1094897Z libc10.so => not found 2025-05-07T20:10:36.1095161Z libnvrtc.so.12 => not found 2025-05-07T20:10:36.1095440Z libc10_cuda.so => not found 2025-05-07T20:10:36.1095718Z libnccl.so.2 => not found 2025-05-07T20:10:36.1095966Z libcuda.so.1 => not found 2025-05-07T20:10:36.1096618Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/asmjit.so (0x00007f6bcc589000) 2025-05-07T20:10:36.1097349Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:36.1097744Z libtorch.so => not found 2025-05-07T20:10:36.1098015Z libtorch_cpu.so => not found 2025-05-07T20:10:36.1098361Z libtorch_cuda.so => not found 2025-05-07T20:10:36.1098695Z libm.so.6 => /lib64/libm.so.6 (0x00007f6bca8b9000) 2025-05-07T20:10:36.1099068Z /lib64/ld-linux-x86-64.so.2 (0x00007f6be7d26000) 2025-05-07T20:10:36.1099421Z libtorch.so => not found 2025-05-07T20:10:36.1099678Z libc10.so => not found 2025-05-07T20:10:36.1099956Z libnvrtc.so.12 => not found 2025-05-07T20:10:36.1100225Z libc10_cuda.so => not found 2025-05-07T20:10:36.1100515Z libnccl.so.2 => not found 2025-05-07T20:10:36.1100781Z libcuda.so.1 => not found 2025-05-07T20:10:36.1101080Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:36.1101368Z libtorch_cpu.so => not found 2025-05-07T20:10:36.1101675Z libtorch_cuda.so => not found 2025-05-07T20:10:36.1101974Z libcudart.so.12 => not found 2025-05-07T20:10:36.1102312Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f6bccb6b000) 2025-05-07T20:10:36.1102699Z libtorch.so => not found 2025-05-07T20:10:36.1102954Z libc10.so => not found 2025-05-07T20:10:36.1103239Z libnvrtc.so.12 => not found 2025-05-07T20:10:36.1103513Z libc10_cuda.so => not found 2025-05-07T20:10:36.1103803Z libnccl.so.2 => not found 2025-05-07T20:10:36.1104073Z libcuda.so.1 => not found 2025-05-07T20:10:36.1104368Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:36.1104659Z libtorch_cpu.so => not found 2025-05-07T20:10:36.1104972Z libtorch_cuda.so => not found 2025-05-07T20:10:36.1105354Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007f6bccb62000) 2025-05-07T20:10:36.1105751Z libtorch.so => not found 2025-05-07T20:10:36.1106036Z libc10.so => not found 2025-05-07T20:10:36.1106298Z libnvrtc.so.12 => not found 2025-05-07T20:10:36.1106598Z libc10_cuda.so => not found 2025-05-07T20:10:36.1106871Z libnccl.so.2 => not found 2025-05-07T20:10:36.1107164Z libcuda.so.1 => not found 2025-05-07T20:10:36.1107437Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:36.1107754Z libtorch_cpu.so => not found 2025-05-07T20:10:36.1108067Z libtorch_cuda.so => not found 2025-05-07T20:10:36.1108412Z librt.so.1 => /lib64/librt.so.1 (0x00007f6bcc582000) 2025-05-07T20:10:36.1108656Z 2025-05-07T20:10:36.1108791Z [CHECK] Displaying ELF information: 2025-05-07T20:10:36.1109319Z + readelf -d ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:10:36.1109639Z 2025-05-07T20:10:36.1109672Z 2025-05-07T20:10:36.1109862Z Dynamic section at offset 0x1ac7bfc8 contains 41 entries: 2025-05-07T20:10:36.1110227Z Tag Type Name/Value 2025-05-07T20:10:36.1110647Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:10:36.1111112Z 0x0000000000000001 (NEEDED) Shared library: [libnvrtc.so.12] 2025-05-07T20:10:36.1111607Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:10:36.1112100Z 0x0000000000000001 (NEEDED) Shared library: [libnccl.so.2] 2025-05-07T20:10:36.1112565Z 0x0000000000000001 (NEEDED) Shared library: [libcuda.so.1] 2025-05-07T20:10:36.1113055Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm.so] 2025-05-07T20:10:36.1113541Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_cache.so] 2025-05-07T20:10:36.1114067Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_config.so] 2025-05-07T20:10:36.1114590Z 0x0000000000000001 (NEEDED) Shared library: [libnvidia-ml.so.1] 2025-05-07T20:10:36.1115067Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:10:36.1115594Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:10:36.1116087Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:10:36.1116599Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:10:36.1117084Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:10:36.1117582Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:10:36.1118100Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:10:36.1118594Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_inference.so] 2025-05-07T20:10:36.1119106Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:10:36.1119487Z 0x000000000000000c (INIT) 0x1a0000 2025-05-07T20:10:36.1119836Z 0x000000000000000d (FINI) 0x74838c 2025-05-07T20:10:36.1120163Z 0x0000000000000019 (INIT_ARRAY) 0x1ac7aca0 2025-05-07T20:10:36.1120532Z 0x000000000000001b (INIT_ARRAYSZ) 392 (bytes) 2025-05-07T20:10:36.1120893Z 0x000000000000001a (FINI_ARRAY) 0x1ac7ae28 2025-05-07T20:10:36.1121232Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:10:36.1121581Z 0x000000006ffffef5 (GNU_HASH) 0x200 2025-05-07T20:10:36.1121898Z 0x0000000000000005 (STRTAB) 0x27a50 2025-05-07T20:10:36.1122237Z 0x0000000000000006 (SYMTAB) 0x9db0 2025-05-07T20:10:36.1122577Z 0x000000000000000a (STRSZ) 1387089 (bytes) 2025-05-07T20:10:36.1122942Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:10:36.1123277Z 0x0000000000000003 (PLTGOT) 0x1ac84fe8 2025-05-07T20:10:36.1123640Z 0x0000000000000002 (PLTRELSZ) 20568 (bytes) 2025-05-07T20:10:36.1123990Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:10:36.1124300Z 0x0000000000000017 (JMPREL) 0x19af18 2025-05-07T20:10:36.1124637Z 0x0000000000000007 (RELA) 0x17cd80 2025-05-07T20:10:36.1124975Z 0x0000000000000008 (RELASZ) 123288 (bytes) 2025-05-07T20:10:36.1125328Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:10:36.1125656Z 0x000000006ffffffe (VERNEED) 0x17cc60 2025-05-07T20:10:36.1125996Z 0x000000006fffffff (VERNEEDNUM) 4 2025-05-07T20:10:36.1126308Z 0x000000006ffffff0 (VERSYM) 0x17a4a2 2025-05-07T20:10:36.1126671Z 0x000000006ffffff9 (RELACOUNT) 539 2025-05-07T20:10:36.1126993Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:10:36.1127183Z 2025-05-07T20:10:36.1127297Z ################################################################################ 2025-05-07T20:10:36.1127532Z 2025-05-07T20:10:36.1127536Z 2025-05-07T20:10:36.1127647Z ################################################################################ 2025-05-07T20:10:36.1128209Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:10:36.1128756Z [CHECK] Listing out library size: 2025-05-07T20:10:36.1129267Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:10:36.1129677Z 2025-05-07T20:10:36.1129945Z 5 ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:10:36.1130313Z 2025-05-07T20:10:36.1130742Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:10:36.1131798Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:10:36.1132404Z 2025-05-07T20:10:36.1303497Z GLIBC_2.2.5 2025-05-07T20:10:36.1304528Z GLIBC_2.3 2025-05-07T20:10:36.1305564Z GLIBC_2.14 2025-05-07T20:10:36.1306190Z 2025-05-07T20:10:36.1306211Z 2025-05-07T20:10:36.1307948Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:10:36.1310343Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:10:36.1311005Z 2025-05-07T20:10:36.1571492Z GLIBCXX_3.4 2025-05-07T20:10:36.1572317Z GLIBCXX_3.4.9 2025-05-07T20:10:36.1572659Z GLIBCXX_3.4.11 2025-05-07T20:10:36.1573029Z GLIBCXX_3.4.15 2025-05-07T20:10:36.1573274Z GLIBCXX_3.4.18 2025-05-07T20:10:36.1573494Z GLIBCXX_3.4.20 2025-05-07T20:10:36.1573737Z GLIBCXX_3.4.21 2025-05-07T20:10:36.1573867Z 2025-05-07T20:10:36.1573872Z 2025-05-07T20:10:36.1590960Z + nm -gDC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so > /tmp/tmp.Ln6Ikkhdm1.symbols.txt 2025-05-07T20:10:36.1592641Z 2025-05-07T20:10:36.1807488Z 2025-05-07T20:10:36.1834275Z [CHECK] Total Number of symbols: 2987 2025-05-07T20:10:36.1854491Z [CHECK] Number of fbgemm symbols: 1 2025-05-07T20:10:36.1871941Z + nm -gDCu ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so > /tmp/tmp.SBbLkAf0qP.usymbols.txt 2025-05-07T20:10:36.1872534Z 2025-05-07T20:10:36.1909460Z 2025-05-07T20:10:36.1937650Z [CHECK] Listing out undefined symbols (189 total): 2025-05-07T20:10:36.1954551Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:36.1955833Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:36.1956405Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:10:36.1956769Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:10:36.1957118Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:10:36.1957467Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:10:36.1957786Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:10:36.1958144Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:10:36.1958477Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:10:36.1958833Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:10:36.1959164Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:10:36.1959519Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:10:36.1960028Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:10:36.1960356Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:10:36.1960708Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:10:36.1961035Z U adjust_info_B_num_bits(int, int) 2025-05-07T20:10:36.1961468Z U at::RecordFunction::RecordFunction(at::StepCallbacks&&) 2025-05-07T20:10:36.1961964Z U at::RecordFunction::currentThreadId() 2025-05-07T20:10:36.1962339Z U at::RecordFunction::end() 2025-05-07T20:10:36.1962714Z U at::RecordFunction::~RecordFunction() 2025-05-07T20:10:36.1963095Z U at::SavedTensorDefaultHooks::set_tracing(bool) 2025-05-07T20:10:36.1964087Z U at::_ops::_sparse_coo_tensor_unsafe::call(at::Tensor const&, at::Tensor const&, c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:36.1965332Z U at::_ops::clamp::call(at::Tensor const&, std::optional const&, std::optional const&) 2025-05-07T20:10:36.1966274Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:36.1967556Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:36.1968421Z U at::_ops::fill__Scalar::call(at::Tensor&, c10::Scalar const&) 2025-05-07T20:10:36.1968857Z U at::_ops::flatten_using_ints::call(at::Tensor const&, long, long) 2025-05-07T20:10:36.1969344Z U at::_ops::reshape::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:10:36.1969770Z U at::_ops::unsqueeze::call(at::Tensor const&, long) 2025-05-07T20:10:36.1970619Z U at::_ops::view_as::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:10:36.1971167Z U at::functorch::functorchTLSAccessor() 2025-05-07T20:10:36.1971577Z U at::getStepCallbacksUnlessEmpty(at::RecordScope) 2025-05-07T20:10:36.1972014Z U at::sequence_number::get_and_increment() 2025-05-07T20:10:36.1972355Z U bcmp@GLIBC_2.2.5 2025-05-07T20:10:36.1972679Z U c10::AnyType::get() 2025-05-07T20:10:36.1972986Z U c10::BoolType::get() 2025-05-07T20:10:36.1973401Z U c10::Dispatcher::findSchemaOrThrow(char const*, char const*) 2025-05-07T20:10:36.1973844Z U c10::Dispatcher::realSingleton() 2025-05-07T20:10:36.1974595Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet) 2025-05-07T20:10:36.1975861Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet, c10::ArrayRef) 2025-05-07T20:10:36.1977059Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:10:36.1977652Z U c10::Error::what() const 2025-05-07T20:10:36.1977998Z U c10::FloatType::get() 2025-05-07T20:10:36.1978322Z U c10::GradMode::is_enabled() 2025-05-07T20:10:36.1978679Z U c10::GradMode::set_enabled(bool) 2025-05-07T20:10:36.1979092Z U c10::IValue::TagType::get(c10::IValue const&) 2025-05-07T20:10:36.1979491Z U c10::IValue::hash(c10::IValue const&) 2025-05-07T20:10:36.1979858Z U c10::IValue::isBoolList() const 2025-05-07T20:10:36.1980194Z U c10::IValue::isIntList() const 2025-05-07T20:10:36.1980598Z U c10::IValue::isSymIntList() const 2025-05-07T20:10:36.1980946Z U c10::IValue::isTensorList() const 2025-05-07T20:10:36.1981352Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:10:36.1981741Z U c10::IntType::get() 2025-05-07T20:10:36.1982421Z U c10::ListType::get(std::__cxx11::basic_string, std::allocator > const&, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:10:36.1983239Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:10:36.1983651Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:10:36.1984038Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:10:36.1984426Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:10:36.1984883Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:10:36.1985523Z U c10::SmallVectorBase::grow_pod(void const*, unsigned long, unsigned long) 2025-05-07T20:10:36.1986019Z U c10::StringType::get() 2025-05-07T20:10:36.1986400Z U c10::SymBool::expect_true(char const*, long) const 2025-05-07T20:10:36.1986806Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:10:36.1987264Z U c10::SymBool::guard_size_oblivious(char const*, long) const 2025-05-07T20:10:36.1987757Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:10:36.1988175Z U c10::SymFloat::operator/(c10::SymFloat const&) const 2025-05-07T20:10:36.1988869Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:10:36.1989541Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:10:36.1989931Z U c10::SymInt::operator c10::SymFloat() const 2025-05-07T20:10:36.1990379Z U c10::SymInt::operator*(c10::SymInt const&) const 2025-05-07T20:10:36.1990780Z U c10::SymInt::operator/(c10::SymInt const&) const 2025-05-07T20:10:36.1991178Z U c10::SymInt::promote_to_negative() 2025-05-07T20:10:36.1991648Z U c10::SymInt::sym_eq(c10::SymInt const&) const 2025-05-07T20:10:36.1992040Z U c10::SymInt::sym_gt(c10::SymInt const&) const 2025-05-07T20:10:36.1992431Z U c10::SymInt::sym_le(c10::SymInt const&) const 2025-05-07T20:10:36.1992781Z U c10::SymInt::toSymNode() const 2025-05-07T20:10:36.1993131Z U c10::SymIntType::get() 2025-05-07T20:10:36.1993463Z U c10::TensorImpl::requires_grad() const 2025-05-07T20:10:36.1993814Z U c10::TensorType::get() 2025-05-07T20:10:36.1994145Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:10:36.1994810Z U c10::detail::ListImpl::ListImpl(std::vector >, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:10:36.1995850Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:10:36.1996697Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:10:36.1997557Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:36.1998499Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:10:36.1999489Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:36.2000520Z U c10::impl::OperatorEntry::assertSignatureIsCorrect(c10::impl::CppSignature const&, bool) const 2025-05-07T20:10:36.2001316Z U c10::impl::OperatorEntry::reportError(c10::DispatchKey) const 2025-05-07T20:10:36.2001745Z U c10::impl::raw_local_dispatch_key_set 2025-05-07T20:10:36.2002195Z U c10::initializeFunctionalityOffsetsAndMasks() 2025-05-07T20:10:36.2002837Z U c10::ivalue::ConstantString::create(std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:10:36.2003472Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:10:36.2003894Z U c10::operator<<(std::ostream&, c10::OperatorName const&) 2025-05-07T20:10:36.2004326Z U c10::operator<<(std::ostream&, c10::SymFloat const&) 2025-05-07T20:10:36.2004763Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:10:36.2005224Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:10:36.2005687Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:10:36.2006200Z U fbgemm_gpu::config::is_feature_enabled(fbgemm_gpu::config::FeatureGateName const&) 2025-05-07T20:10:36.2006678Z U free@GLIBC_2.2.5 2025-05-07T20:10:36.2007081Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:10:36.2007478Z U memcmp@GLIBC_2.2.5 2025-05-07T20:10:36.2007838Z U memcpy@GLIBC_2.14 2025-05-07T20:10:36.2008141Z U memmove@GLIBC_2.2.5 2025-05-07T20:10:36.2008471Z U memset@GLIBC_2.2.5 2025-05-07T20:10:36.2008823Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:10:36.2009188Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:10:36.2023080Z U realloc@GLIBC_2.2.5 2025-05-07T20:10:36.2023859Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:10:36.2024215Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:10:36.2024610Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:10:36.2024982Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:10:36.2025367Z U std::__cxx11::basic_stringstream, std::allocator >::~basic_stringstream()@GLIBCXX_3.4.21 2025-05-07T20:10:36.2025748Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:10:36.2025902Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:10:36.2026030Z U std::__throw_bad_array_new_length() 2025-05-07T20:10:36.2026185Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:36.2026358Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:36.2026537Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:10:36.2026688Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:10:36.2026862Z U std::bad_weak_ptr::~bad_weak_ptr()@GLIBCXX_3.4.15 2025-05-07T20:10:36.2027110Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:10:36.2027689Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:10:36.2027860Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:10:36.2028023Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:10:36.2028157Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:10:36.2028308Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:10:36.2028436Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:10:36.2028662Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:10:36.2029046Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:10:36.2029177Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:10:36.2029345Z U std::out_of_range::out_of_range(char const*)@GLIBCXX_3.4.21 2025-05-07T20:10:36.2029509Z U std::out_of_range::~out_of_range()@GLIBCXX_3.4 2025-05-07T20:10:36.2029908Z U std::runtime_error::runtime_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:10:36.2030056Z U std::runtime_error::~runtime_error()@GLIBCXX_3.4 2025-05-07T20:10:36.2030194Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:10:36.2030297Z U strcmp@GLIBC_2.2.5 2025-05-07T20:10:36.2030400Z U strlen@GLIBC_2.2.5 2025-05-07T20:10:36.2030529Z U torch::CppFunction::~CppFunction() 2025-05-07T20:10:36.2032400Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:10:36.2032842Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:10:36.2033117Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:10:36.2033270Z U torch::autograd::AnomalyMode::_enabled 2025-05-07T20:10:36.2033559Z U torch::autograd::AutogradContext::AutogradContext(torch::dynamo::autograd::PackedArgs&) 2025-05-07T20:10:36.2033765Z U torch::autograd::AutogradContext::get_and_bump_dirty() const 2025-05-07T20:10:36.2033963Z U torch::autograd::AutogradContext::get_non_differentiable() const 2025-05-07T20:10:36.2034153Z U torch::autograd::AutogradContext::get_saved_variables() const 2025-05-07T20:10:36.2034515Z U torch::autograd::AutogradContext::save_for_backward(std::vector >) 2025-05-07T20:10:36.2034665Z U torch::autograd::AutogradContext::save_variables() 2025-05-07T20:10:36.2034855Z U torch::autograd::ForwardADLevel::try_get_by_idx(unsigned long) 2025-05-07T20:10:36.2035056Z U torch::autograd::InputMetadata::shape_as_dim_vector() const 2025-05-07T20:10:36.2035187Z U torch::autograd::Node::assign_parent() 2025-05-07T20:10:36.2035305Z U torch::autograd::Node::metadata() 2025-05-07T20:10:36.2035467Z U torch::autograd::Node::name[abi:cxx11]() const 2025-05-07T20:10:36.2035702Z U torch::autograd::SavedVariable::SavedVariable(at::Tensor const&, bool, bool) 2025-05-07T20:10:36.2035959Z U torch::autograd::SavedVariable::unpack(std::shared_ptr) const 2025-05-07T20:10:36.2036121Z U torch::autograd::VariableInfo::VariableInfo() 2025-05-07T20:10:36.2036322Z U torch::autograd::VariableInfo::VariableInfo(at::Tensor const&, bool) 2025-05-07T20:10:36.2036532Z U torch::autograd::VariableInfo::zeros(c10::OptionalDeviceGuard&) const 2025-05-07T20:10:36.2038962Z U torch::autograd::_wrap_outputs(std::vector > const&, std::unordered_set, std::equal_to, std::allocator > const&, std::unordered_set, std::equal_to, std::allocator > const&, c10::ArrayRef >, std::shared_ptr const&, std::function > (std::vector >, std::vector >)> const&, std::unordered_set, std::equal_to, std::allocator > const&, std::function const&) 2025-05-07T20:10:36.2039173Z U torch::autograd::deleteNode(torch::autograd::Node*) 2025-05-07T20:10:36.2039346Z U torch::autograd::get_current_graph_task_exec_info() 2025-05-07T20:10:36.2039506Z U torch::autograd::impl::gradient_edge(at::Tensor const&) 2025-05-07T20:10:36.2040257Z U torch::autograd::profiler::record_function_enter_new(std::__cxx11::basic_string, std::allocator > const&, std::optional, std::allocator > > const&) 2025-05-07T20:10:36.2040411Z U torch::dynamo::autograd::getPyCompilerInterface() 2025-05-07T20:10:36.2040794Z U torch::dynamo::autograd::get_input_metadata(std::vector > const&) 2025-05-07T20:10:36.2041186Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:10:36.2041294Z U typeinfo for c10::Error 2025-05-07T20:10:36.2041434Z U typeinfo for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:10:36.2041583Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:10:36.2041718Z U typeinfo for std::out_of_range@GLIBCXX_3.4 2025-05-07T20:10:36.2041874Z U typeinfo for std::runtime_error@GLIBCXX_3.4 2025-05-07T20:10:36.2042017Z U typeinfo for torch::autograd::Node 2025-05-07T20:10:36.2042167Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:10:36.2042328Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:10:36.2042483Z U vtable for __cxxabiv1::__pointer_type_info@CXXABI_1.3 2025-05-07T20:10:36.2042662Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:10:36.2042767Z U vtable for c10::Error 2025-05-07T20:10:36.2043082Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:36.2043242Z U vtable for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:10:36.2043459Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:10:36.2043579Z U vtable for torch::autograd::Node 2025-05-07T20:10:36.2043771Z w TLS init function for c10::impl::raw_local_dispatch_key_set 2025-05-07T20:10:36.2043887Z w _ITM_deregisterTMCloneTable 2025-05-07T20:10:36.2043996Z w _ITM_registerTMCloneTable 2025-05-07T20:10:36.2044134Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:10:36.2044226Z w __gmon_start__ 2025-05-07T20:10:36.2044326Z w __pthread_key_create 2025-05-07T20:10:36.2044467Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:10:36.2044582Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:10:36.2044724Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:10:36.2044999Z + ldd ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:10:36.2045005Z 2025-05-07T20:10:36.2045191Z linux-vdso.so.1 (0x00007ffd5f151000) 2025-05-07T20:10:36.2045285Z libc10.so => not found 2025-05-07T20:10:36.2045411Z libnvrtc.so.12 => not found 2025-05-07T20:10:36.2045509Z libc10_cuda.so => not found 2025-05-07T20:10:36.2045607Z libnccl.so.2 => not found 2025-05-07T20:10:36.2045702Z libcuda.so.1 => not found 2025-05-07T20:10:36.2046155Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_config.so (0x00007fc6a9fcc000) 2025-05-07T20:10:36.2046608Z fbgemm_gpu_tbe_utils.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_utils.so (0x00007fc6a8800000) 2025-05-07T20:10:36.2046708Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:36.2046830Z libtorch.so => not found 2025-05-07T20:10:36.2046927Z libtorch_cpu.so => not found 2025-05-07T20:10:36.2047030Z libtorch_cuda.so => not found 2025-05-07T20:10:36.2047189Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007fc6a859c000) 2025-05-07T20:10:36.2047361Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007fc6a9f9c000) 2025-05-07T20:10:36.2047485Z libc.so.6 => /lib64/libc.so.6 (0x00007fc6a8394000) 2025-05-07T20:10:36.2047612Z /lib64/ld-linux-x86-64.so.2 (0x00007fc6a9fdd000) 2025-05-07T20:10:36.2047728Z libtorch.so => not found 2025-05-07T20:10:36.2047819Z libc10.so => not found 2025-05-07T20:10:36.2047915Z libnvrtc.so.12 => not found 2025-05-07T20:10:36.2048037Z libc10_cuda.so => not found 2025-05-07T20:10:36.2048131Z libnccl.so.2 => not found 2025-05-07T20:10:36.2048223Z libcuda.so.1 => not found 2025-05-07T20:10:36.2048352Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:36.2048471Z libtorch_cpu.so => not found 2025-05-07T20:10:36.2048570Z libtorch_cuda.so => not found 2025-05-07T20:10:36.2048718Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007fc6a9f42000) 2025-05-07T20:10:36.2048911Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007fc6a9f3d000) 2025-05-07T20:10:36.2049006Z libtorch.so => not found 2025-05-07T20:10:36.2049099Z libc10.so => not found 2025-05-07T20:10:36.2049194Z libnvrtc.so.12 => not found 2025-05-07T20:10:36.2049338Z libc10_cuda.so => not found 2025-05-07T20:10:36.2049437Z libnccl.so.2 => not found 2025-05-07T20:10:36.2049533Z libcuda.so.1 => not found 2025-05-07T20:10:36.2049662Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:36.2049763Z libtorch_cpu.so => not found 2025-05-07T20:10:36.2049862Z libtorch_cuda.so => not found 2025-05-07T20:10:36.2049954Z libcudart.so.12 => not found 2025-05-07T20:10:36.2050101Z libm.so.6 => /lib64/libm.so.6 (0x00007fc6a9925000) 2025-05-07T20:10:36.2050106Z 2025-05-07T20:10:36.2050218Z [CHECK] Displaying ELF information: 2025-05-07T20:10:36.2050515Z + readelf -d ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:10:36.2050520Z 2025-05-07T20:10:36.2051437Z 2025-05-07T20:10:36.2051627Z Dynamic section at offset 0x4b5fc8 contains 40 entries: 2025-05-07T20:10:36.2051747Z Tag Type Name/Value 2025-05-07T20:10:36.2051942Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:10:36.2052190Z 0x0000000000000001 (NEEDED) Shared library: [libnvrtc.so.12] 2025-05-07T20:10:36.2052384Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:10:36.2052569Z 0x0000000000000001 (NEEDED) Shared library: [libnccl.so.2] 2025-05-07T20:10:36.2052780Z 0x0000000000000001 (NEEDED) Shared library: [libcuda.so.1] 2025-05-07T20:10:36.2052990Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_config.so] 2025-05-07T20:10:36.2053202Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_utils.so] 2025-05-07T20:10:36.2053422Z 0x0000000000000001 (NEEDED) Shared library: [libnvidia-ml.so.1] 2025-05-07T20:10:36.2053614Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:10:36.2053805Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:10:36.2054055Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:10:36.2054248Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:10:36.2054432Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:10:36.2054635Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:10:36.2054880Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:10:36.2055163Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_training_backward_split_host.so] 2025-05-07T20:10:36.2055326Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:10:36.2055462Z 0x000000000000000c (INIT) 0xd6000 2025-05-07T20:10:36.2055573Z 0x000000000000000d (FINI) 0x3f64b8 2025-05-07T20:10:36.2055684Z 0x0000000000000019 (INIT_ARRAY) 0x4add80 2025-05-07T20:10:36.2055832Z 0x000000000000001b (INIT_ARRAYSZ) 304 (bytes) 2025-05-07T20:10:36.2055950Z 0x000000000000001a (FINI_ARRAY) 0x4adeb0 2025-05-07T20:10:36.2056068Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:10:36.2056208Z 0x000000006ffffef5 (GNU_HASH) 0x200 2025-05-07T20:10:36.2056400Z 0x0000000000000005 (STRTAB) 0x16e00 2025-05-07T20:10:36.2056517Z 0x0000000000000006 (SYMTAB) 0x55e0 2025-05-07T20:10:36.2056652Z 0x000000000000000a (STRSZ) 609767 (bytes) 2025-05-07T20:10:36.2056971Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:10:36.2057124Z 0x0000000000000003 (PLTGOT) 0x4b8fe8 2025-05-07T20:10:36.2057259Z 0x0000000000000002 (PLTRELSZ) 31704 (bytes) 2025-05-07T20:10:36.2057397Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:10:36.2057517Z 0x0000000000000017 (JMPREL) 0xcdaf0 2025-05-07T20:10:36.2057635Z 0x0000000000000007 (RELA) 0xad450 2025-05-07T20:10:36.2057835Z 0x0000000000000008 (RELASZ) 132768 (bytes) 2025-05-07T20:10:36.2057992Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:10:36.2058114Z 0x000000006ffffffe (VERNEED) 0xad340 2025-05-07T20:10:36.2058233Z 0x000000006fffffff (VERNEEDNUM) 4 2025-05-07T20:10:36.2058373Z 0x000000006ffffff0 (VERSYM) 0xabbe8 2025-05-07T20:10:36.2058489Z 0x000000006ffffff9 (RELACOUNT) 40 2025-05-07T20:10:36.2058603Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:10:36.2058618Z 2025-05-07T20:10:36.2058766Z ################################################################################ 2025-05-07T20:10:36.2058770Z 2025-05-07T20:10:36.2058774Z 2025-05-07T20:10:36.2058896Z ################################################################################ 2025-05-07T20:10:36.2059208Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:10:36.2059347Z [CHECK] Listing out library size: 2025-05-07T20:10:36.2059645Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:10:36.2059650Z 2025-05-07T20:10:36.2066476Z 339 ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:10:36.2066671Z 2025-05-07T20:10:36.2068046Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:10:36.2068623Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_forward.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:10:36.2068629Z 2025-05-07T20:10:36.3042489Z GLIBC_2.2.5 2025-05-07T20:10:36.3042662Z GLIBC_2.3 2025-05-07T20:10:36.3042774Z GLIBC_2.14 2025-05-07T20:10:36.3043291Z 2025-05-07T20:10:36.3043354Z 2025-05-07T20:10:36.3043869Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:10:36.3044626Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_forward.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:10:36.3044631Z 2025-05-07T20:10:36.4005903Z GLIBCXX_3.4 2025-05-07T20:10:36.4006111Z GLIBCXX_3.4.9 2025-05-07T20:10:36.4006226Z GLIBCXX_3.4.20 2025-05-07T20:10:36.4006338Z GLIBCXX_3.4.21 2025-05-07T20:10:36.4006345Z 2025-05-07T20:10:36.4006583Z 2025-05-07T20:10:36.4039630Z + nm -gDC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_forward.so > /tmp/tmp.PR4u12NAUG.symbols.txt 2025-05-07T20:10:36.4039676Z 2025-05-07T20:10:36.5010129Z 2025-05-07T20:10:36.5074058Z [CHECK] Total Number of symbols: 12626 2025-05-07T20:10:36.5138987Z [CHECK] Number of fbgemm symbols: 5267 2025-05-07T20:10:36.5165534Z + nm -gDCu ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_forward.so > /tmp/tmp.O80XcYOGvx.usymbols.txt 2025-05-07T20:10:36.5166089Z 2025-05-07T20:10:36.5215049Z 2025-05-07T20:10:36.5243999Z [CHECK] Listing out undefined symbols (171 total): 2025-05-07T20:10:36.5260253Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:36.5262022Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:10:36.5263018Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:10:36.5264165Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:10:36.5265263Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:10:36.5265808Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:10:36.5266185Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:10:36.5266553Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:10:36.5266928Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:10:36.5267280Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:10:36.5267615Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:10:36.5267995Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:10:36.5268327Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:10:36.5268642Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:10:36.5268987Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:10:36.5269302Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:10:36.5269641Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:10:36.5269966Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:10:36.5270477Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:10:36.5270801Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:10:36.5271115Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:10:36.5271499Z U at::RecordFunction::RecordFunction(at::StepCallbacks&&) 2025-05-07T20:10:36.5271904Z U at::RecordFunction::~RecordFunction() 2025-05-07T20:10:36.5272450Z U at::Tensor fbgemm_gpu::reshape_vbe_offsets(at::Tensor const&, at::Tensor const&, long, int) 2025-05-07T20:10:36.5273154Z U at::Tensor fbgemm_gpu::reshape_vbe_offsets(at::Tensor const&, at::Tensor const&, long, int) 2025-05-07T20:10:36.5273770Z U at::Tensor::index(std::initializer_list) const 2025-05-07T20:10:36.5274380Z U at::Tensor::index_put_(std::initializer_list, at::Tensor const&) 2025-05-07T20:10:36.5275414Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:36.5276361Z U at::_ops::flatten_using_ints::call(at::Tensor const&, long, long) 2025-05-07T20:10:36.5276863Z U at::_ops::reshape::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:10:36.5277413Z U at::_ops::select_int::call(at::Tensor const&, long, c10::SymInt) 2025-05-07T20:10:36.5277875Z U at::getStepCallbacksUnlessEmpty(at::RecordScope) 2025-05-07T20:10:36.5278336Z U c10::BFloat16* at::TensorBase::data_ptr() const 2025-05-07T20:10:36.5278817Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:36.5279287Z U c10::BoolType::get() 2025-05-07T20:10:36.5279638Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:10:36.5280100Z U c10::Dispatcher::findSchemaOrThrow(char const*, char const*) 2025-05-07T20:10:36.5280517Z U c10::Dispatcher::realSingleton() 2025-05-07T20:10:36.5281241Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet) 2025-05-07T20:10:36.5282559Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet, c10::ArrayRef) 2025-05-07T20:10:36.5283574Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:10:36.5284103Z U c10::Error::what() const 2025-05-07T20:10:36.5284398Z U c10::FloatType::get() 2025-05-07T20:10:36.5284727Z U c10::Half* at::TensorBase::data_ptr() const 2025-05-07T20:10:36.5286628Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:36.5287044Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:10:36.5287366Z U c10::IntType::get() 2025-05-07T20:10:36.5287721Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:10:36.5288090Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:10:36.5288471Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:10:36.5288799Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:10:36.5289159Z U c10::SymBool::expect_true(char const*, long) const 2025-05-07T20:10:36.5289540Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:10:36.5289907Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:10:36.5290529Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:10:36.5291123Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:10:36.5291483Z U c10::SymInt::operator+=(c10::SymInt const&) 2025-05-07T20:10:36.5291842Z U c10::SymInt::operator/(c10::SymInt const&) const 2025-05-07T20:10:36.5292180Z U c10::SymInt::promote_to_negative() 2025-05-07T20:10:36.5292523Z U c10::SymInt::sym_eq(c10::SymInt const&) const 2025-05-07T20:10:36.5292862Z U c10::SymInt::sym_ge(c10::SymInt const&) const 2025-05-07T20:10:36.5293211Z U c10::SymInt::sym_gt(c10::SymInt const&) const 2025-05-07T20:10:36.5293554Z U c10::SymInt::sym_le(c10::SymInt const&) const 2025-05-07T20:10:36.5293886Z U c10::SymInt::sym_ne(c10::SymInt const&) const 2025-05-07T20:10:36.5294217Z U c10::SymInt::toSymNode() const 2025-05-07T20:10:36.5294519Z U c10::SymIntType::get() 2025-05-07T20:10:36.5294821Z U c10::TensorType::get() 2025-05-07T20:10:36.5295118Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:10:36.5295994Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:10:36.5297187Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:10:36.5297545Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:10:36.5297908Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:10:36.5298249Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:10:36.5298604Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:10:36.5298991Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:10:36.5299453Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:10:36.5299923Z U c10::cuda::device_count() 2025-05-07T20:10:36.5300263Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:10:36.5300650Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:10:36.5301049Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:10:36.5301439Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:10:36.5301855Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:10:36.5302233Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:10:36.5302971Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:10:36.5303873Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:10:36.5304716Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:36.5305652Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:10:36.5306698Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:36.5307480Z U c10::get_default_dtype() 2025-05-07T20:10:36.5307817Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:10:36.5308150Z U c10::impl::GPUTrace::haveState 2025-05-07T20:10:36.5308690Z U c10::impl::OperatorEntry::assertSignatureIsCorrect(c10::impl::CppSignature const&, bool) const 2025-05-07T20:10:36.5309408Z U c10::impl::OperatorEntry::reportError(c10::DispatchKey) const 2025-05-07T20:10:36.5309789Z U c10::impl::device_guard_impl_registry 2025-05-07T20:10:36.5310115Z U c10::impl::raw_local_dispatch_key_set 2025-05-07T20:10:36.5310461Z U c10::initializeFunctionalityOffsetsAndMasks() 2025-05-07T20:10:36.5310823Z U c10::operator%(c10::SymInt const&, int) 2025-05-07T20:10:36.5311153Z U c10::operator*(c10::SymInt const&, long) 2025-05-07T20:10:36.5311491Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:10:36.5311828Z U c10::operator<(c10::SymInt const&, int) 2025-05-07T20:10:36.5312166Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:10:36.5312709Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:10:36.5313098Z U c10::operator<<(std::ostream&, c10::OperatorName const&) 2025-05-07T20:10:36.5313510Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:10:36.5314041Z U c10::warn(c10::Warning const&) 2025-05-07T20:10:36.5314443Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:10:36.5314884Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:10:36.5315243Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:10:36.5315641Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:10:36.5316003Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:10:36.5316355Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:10:36.5316700Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:10:36.5317046Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:10:36.5317464Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:10:36.5317802Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:10:36.5318156Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:10:36.5318494Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:10:36.5318851Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:10:36.5319199Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:10:36.5319729Z U fbgemm_gpu::reshape_vbe_output(at::Tensor const&, long, at::Tensor const&, at::Tensor const&) 2025-05-07T20:10:36.5320255Z U float at::Tensor::item() const 2025-05-07T20:10:36.5320618Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:10:36.5321032Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:36.5321386Z U free@GLIBC_2.2.5 2025-05-07T20:10:36.5321699Z U int at::Tensor::item() const 2025-05-07T20:10:36.5322061Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:10:36.5322463Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:36.5322902Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:10:36.5323318Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:10:36.5323718Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:36.5324069Z U memcpy@GLIBC_2.14 2025-05-07T20:10:36.5324364Z U memmove@GLIBC_2.2.5 2025-05-07T20:10:36.5324693Z U memset@GLIBC_2.2.5 2025-05-07T20:10:36.5324995Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:10:36.5325346Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:10:36.5325909Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:10:36.5326741Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:10:36.5327355Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:10:36.5327716Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:36.5328114Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:36.5328530Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:10:36.5329170Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:10:36.5330133Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:10:36.5330860Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:10:36.5331206Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:10:36.5331528Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:10:36.5331858Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:10:36.5332181Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:10:36.5332553Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:10:36.5333056Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:10:36.5333532Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:10:36.5333864Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:10:36.5334147Z U strlen@GLIBC_2.2.5 2025-05-07T20:10:36.5334449Z U torch::CppFunction::~CppFunction() 2025-05-07T20:10:36.5335205Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:10:36.5336282Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:10:36.5337317Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:10:36.5338049Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:10:36.5338623Z U typeinfo for c10::Error 2025-05-07T20:10:36.5338993Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:10:36.5339411Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:10:36.5339849Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:10:36.5340232Z U vtable for c10::Error 2025-05-07T20:10:36.5340791Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:36.5341471Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:10:36.5342000Z w TLS init function for c10::impl::raw_local_dispatch_key_set 2025-05-07T20:10:36.5342442Z w _ITM_deregisterTMCloneTable 2025-05-07T20:10:36.5342786Z w _ITM_registerTMCloneTable 2025-05-07T20:10:36.5343094Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:10:36.5343432Z w __gmon_start__ 2025-05-07T20:10:36.5343704Z w __pthread_key_create 2025-05-07T20:10:36.5344057Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:10:36.5344538Z + ldd ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:10:36.5344899Z 2025-05-07T20:10:36.5345043Z linux-vdso.so.1 (0x00007ffd093af000) 2025-05-07T20:10:36.5345347Z libc10.so => not found 2025-05-07T20:10:36.5345597Z libnvrtc.so.12 => not found 2025-05-07T20:10:36.5345875Z libc10_cuda.so => not found 2025-05-07T20:10:36.5346130Z libnccl.so.2 => not found 2025-05-07T20:10:36.5346396Z libcuda.so.1 => not found 2025-05-07T20:10:36.5347022Z fbgemm_gpu_tbe_common.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_common.so (0x00007f02d7a00000) 2025-05-07T20:10:36.5347701Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:36.5347986Z libtorch.so => not found 2025-05-07T20:10:36.5348254Z libtorch_cpu.so => not found 2025-05-07T20:10:36.5348534Z libtorch_cuda.so => not found 2025-05-07T20:10:36.5348795Z libcudart.so.12 => not found 2025-05-07T20:10:36.5349246Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f02d779c000) 2025-05-07T20:10:36.5349635Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f02edac2000) 2025-05-07T20:10:36.5350003Z libc.so.6 => /lib64/libc.so.6 (0x00007f02d7594000) 2025-05-07T20:10:36.5350332Z /lib64/ld-linux-x86-64.so.2 (0x00007f02edaf8000) 2025-05-07T20:10:36.5350639Z libc10.so => not found 2025-05-07T20:10:36.5350864Z libnvrtc.so.12 => not found 2025-05-07T20:10:36.5351123Z libc10_cuda.so => not found 2025-05-07T20:10:36.5351373Z libnccl.so.2 => not found 2025-05-07T20:10:36.5351604Z libcuda.so.1 => not found 2025-05-07T20:10:36.5352092Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm.so (0x00007f02d7000000) 2025-05-07T20:10:36.5352969Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_config.so (0x00007f02edab5000) 2025-05-07T20:10:36.5353595Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:36.5353857Z libtorch.so => not found 2025-05-07T20:10:36.5354133Z libtorch_cpu.so => not found 2025-05-07T20:10:36.5354423Z libtorch_cuda.so => not found 2025-05-07T20:10:36.5354687Z libcudart.so.12 => not found 2025-05-07T20:10:36.5355051Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f02eda5d000) 2025-05-07T20:10:36.5355415Z libm.so.6 => /lib64/libm.so.6 (0x00007f02ed982000) 2025-05-07T20:10:36.5355754Z libc10.so => not found 2025-05-07T20:10:36.5355992Z libnvrtc.so.12 => not found 2025-05-07T20:10:36.5356266Z libc10_cuda.so => not found 2025-05-07T20:10:36.5356520Z libnccl.so.2 => not found 2025-05-07T20:10:36.5356793Z libcuda.so.1 => not found 2025-05-07T20:10:36.5357281Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/asmjit.so (0x00007f02ed907000) 2025-05-07T20:10:36.5357840Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:36.5358137Z libtorch.so => not found 2025-05-07T20:10:36.5358390Z libtorch_cpu.so => not found 2025-05-07T20:10:36.5358679Z libtorch_cuda.so => not found 2025-05-07T20:10:36.5358933Z libtorch.so => not found 2025-05-07T20:10:36.5359198Z libc10.so => not found 2025-05-07T20:10:36.5359435Z libnvrtc.so.12 => not found 2025-05-07T20:10:36.5359716Z libc10_cuda.so => not found 2025-05-07T20:10:36.5359970Z libnccl.so.2 => not found 2025-05-07T20:10:36.5360238Z libcuda.so.1 => not found 2025-05-07T20:10:36.5360518Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:36.5360811Z libtorch_cpu.so => not found 2025-05-07T20:10:36.5361096Z libtorch_cuda.so => not found 2025-05-07T20:10:36.5361430Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007f02d7dfb000) 2025-05-07T20:10:36.5361816Z libtorch.so => not found 2025-05-07T20:10:36.5362056Z libc10.so => not found 2025-05-07T20:10:36.5362312Z libnvrtc.so.12 => not found 2025-05-07T20:10:36.5362573Z libc10_cuda.so => not found 2025-05-07T20:10:36.5362882Z libnccl.so.2 => not found 2025-05-07T20:10:36.5363129Z libcuda.so.1 => not found 2025-05-07T20:10:36.5363409Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:36.5363673Z libtorch_cpu.so => not found 2025-05-07T20:10:36.5363963Z libtorch_cuda.so => not found 2025-05-07T20:10:36.5364287Z librt.so.1 => /lib64/librt.so.1 (0x00007f02d7df6000) 2025-05-07T20:10:36.5364515Z 2025-05-07T20:10:36.5364624Z [CHECK] Displaying ELF information: 2025-05-07T20:10:36.5365084Z + readelf -d ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:10:36.5365433Z 2025-05-07T20:10:36.5371470Z 2025-05-07T20:10:36.5372544Z Dynamic section at offset 0x15292018 contains 40 entries: 2025-05-07T20:10:36.5373712Z Tag Type Name/Value 2025-05-07T20:10:36.5374954Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:10:36.5376705Z 0x0000000000000001 (NEEDED) Shared library: [libnvrtc.so.12] 2025-05-07T20:10:36.5377254Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:10:36.5377792Z 0x0000000000000001 (NEEDED) Shared library: [libnccl.so.2] 2025-05-07T20:10:36.5378328Z 0x0000000000000001 (NEEDED) Shared library: [libcuda.so.1] 2025-05-07T20:10:36.5378878Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_common.so] 2025-05-07T20:10:36.5379457Z 0x0000000000000001 (NEEDED) Shared library: [libnvidia-ml.so.1] 2025-05-07T20:10:36.5379982Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:10:36.5380514Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:10:36.5381031Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:10:36.5381574Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:10:36.5382094Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:10:36.5382803Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:10:36.5383329Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:10:36.5383842Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:10:36.5384449Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_training_forward.so] 2025-05-07T20:10:36.5385047Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:10:36.5385484Z 0x000000000000000c (INIT) 0x453000 2025-05-07T20:10:36.5385842Z 0x000000000000000d (FINI) 0x1fe941c 2025-05-07T20:10:36.5386224Z 0x0000000000000019 (INIT_ARRAY) 0x152889a8 2025-05-07T20:10:36.5386627Z 0x000000000000001b (INIT_ARRAYSZ) 752 (bytes) 2025-05-07T20:10:36.5386997Z 0x000000000000001a (FINI_ARRAY) 0x15288c98 2025-05-07T20:10:36.5387384Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:10:36.5387731Z 0x000000006ffffef5 (GNU_HASH) 0x200 2025-05-07T20:10:36.5388075Z 0x0000000000000005 (STRTAB) 0x624b8 2025-05-07T20:10:36.5388405Z 0x0000000000000006 (SYMTAB) 0x184f0 2025-05-07T20:10:36.5388785Z 0x000000000000000a (STRSZ) 3694099 (bytes) 2025-05-07T20:10:36.5389149Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:10:36.5389519Z 0x0000000000000003 (PLTGOT) 0x152a8fe8 2025-05-07T20:10:36.5389900Z 0x0000000000000002 (PLTRELSZ) 14520 (bytes) 2025-05-07T20:10:36.5390290Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:10:36.5390639Z 0x0000000000000017 (JMPREL) 0x44ece0 2025-05-07T20:10:36.5390974Z 0x0000000000000007 (RELA) 0x3ee668 2025-05-07T20:10:36.5391348Z 0x0000000000000008 (RELASZ) 394872 (bytes) 2025-05-07T20:10:36.5391711Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:10:36.5392086Z 0x000000006ffffffe (VERNEED) 0x3ee578 2025-05-07T20:10:36.5392464Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:10:36.5392813Z 0x000000006ffffff0 (VERSYM) 0x3e82cc 2025-05-07T20:10:36.5393171Z 0x000000006ffffff9 (RELACOUNT) 1976 2025-05-07T20:10:36.5393489Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:10:36.5393695Z 2025-05-07T20:10:36.5393837Z ################################################################################ 2025-05-07T20:10:36.5394067Z 2025-05-07T20:10:36.5394071Z 2025-05-07T20:10:36.5394191Z ################################################################################ 2025-05-07T20:10:36.5394745Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:10:36.5395287Z [CHECK] Listing out library size: 2025-05-07T20:10:36.5395775Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:10:36.5396180Z 2025-05-07T20:10:36.5396439Z 1 ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:10:36.5396891Z 2025-05-07T20:10:36.5397300Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:10:36.5398318Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_embedding_inplace_ops.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:10:36.5398917Z 2025-05-07T20:10:36.5446002Z GLIBC_2.2.5 2025-05-07T20:10:36.5446547Z GLIBC_2.3 2025-05-07T20:10:36.5446788Z GLIBC_2.14 2025-05-07T20:10:36.5446909Z 2025-05-07T20:10:36.5446914Z 2025-05-07T20:10:36.5447381Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:10:36.5448457Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_embedding_inplace_ops.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:10:36.5449314Z 2025-05-07T20:10:36.5508615Z GLIBCXX_3.4 2025-05-07T20:10:36.5509278Z GLIBCXX_3.4.9 2025-05-07T20:10:36.5509881Z GLIBCXX_3.4.18 2025-05-07T20:10:36.5510450Z GLIBCXX_3.4.20 2025-05-07T20:10:36.5511028Z GLIBCXX_3.4.21 2025-05-07T20:10:36.5511381Z 2025-05-07T20:10:36.5511395Z 2025-05-07T20:10:36.5530812Z + nm -gDC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_embedding_inplace_ops.so > /tmp/tmp.cOcqT7i2pB.symbols.txt 2025-05-07T20:10:36.5532634Z 2025-05-07T20:10:36.5558221Z 2025-05-07T20:10:36.5587011Z [CHECK] Total Number of symbols: 357 2025-05-07T20:10:36.5608168Z [CHECK] Number of fbgemm symbols: 57 2025-05-07T20:10:36.5627421Z + nm -gDCu ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_embedding_inplace_ops.so > /tmp/tmp.e0rmKM2zVz.usymbols.txt 2025-05-07T20:10:36.5627952Z 2025-05-07T20:10:36.5651698Z 2025-05-07T20:10:36.5679878Z [CHECK] Listing out undefined symbols (118 total): 2025-05-07T20:10:36.5700041Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:36.5702434Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:36.5703369Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:10:36.5703723Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:10:36.5704135Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:10:36.5704651Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:10:36.5705051Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:10:36.5705428Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:10:36.5705796Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:10:36.5706171Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:10:36.5706515Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:10:36.5706894Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:10:36.5707212Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:10:36.5707542Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:10:36.5707860Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:10:36.5708197Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:10:36.5708529Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:10:36.5708843Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:10:36.5709182Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:10:36.5709493Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:10:36.5710287Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:36.5711601Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:36.5712599Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:10:36.5713036Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:10:36.5713514Z U c10::IntType::get() 2025-05-07T20:10:36.5713867Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:10:36.5714277Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:10:36.5714826Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:10:36.5715514Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:10:36.5716115Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:10:36.5717522Z U c10::SymInt::promote_to_negative() 2025-05-07T20:10:36.5717842Z U c10::TensorType::get() 2025-05-07T20:10:36.5718144Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:10:36.5719009Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:10:36.5719932Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:10:36.5720268Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:10:36.5720600Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:10:36.5720915Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:10:36.5721242Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:10:36.5721578Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:10:36.5722008Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:10:36.5722451Z U c10::cuda::device_count() 2025-05-07T20:10:36.5722768Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:10:36.5723137Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:10:36.5723498Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:10:36.5723883Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:10:36.5724312Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:10:36.5724667Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:10:36.5725356Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:10:36.5726151Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:10:36.5726973Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:36.5727843Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:10:36.5728776Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:36.5729528Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:10:36.5729850Z U c10::impl::GPUTrace::haveState 2025-05-07T20:10:36.5730164Z U c10::impl::device_guard_impl_registry 2025-05-07T20:10:36.5730523Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:10:36.5730891Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:10:36.5731234Z U c10::warn(c10::Warning const&) 2025-05-07T20:10:36.5731627Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:10:36.5732030Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:10:36.5732390Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:10:36.5732731Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:10:36.5733077Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:10:36.5733395Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:10:36.5733720Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:10:36.5734059Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:10:36.5734403Z U cudaGetDeviceProperties_v2@libcudart.so.12 2025-05-07T20:10:36.5734762Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:10:36.5735106Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:10:36.5735439Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:10:36.5735752Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:10:36.5736087Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:10:36.5736505Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:10:36.5737146Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:36.5737598Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:10:36.5738128Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:36.5738498Z U memcpy@GLIBC_2.14 2025-05-07T20:10:36.5738784Z U memmove@GLIBC_2.2.5 2025-05-07T20:10:36.5739090Z U memset@GLIBC_2.2.5 2025-05-07T20:10:36.5739393Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:10:36.5739745Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:10:36.5740320Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:10:36.5741138Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:10:36.5741958Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:10:36.5742781Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:10:36.5743372Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:10:36.5743720Z U std::__throw_bad_array_new_length() 2025-05-07T20:10:36.5744080Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:36.5744477Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:36.5744925Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:10:36.5745445Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:10:36.5746362Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:10:36.5747164Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:10:36.5747513Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:10:36.5747872Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:10:36.5748208Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:10:36.5748619Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:10:36.5749146Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:10:36.5749636Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:10:36.5749990Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:10:36.5750297Z U strlen@GLIBC_2.2.5 2025-05-07T20:10:36.5750623Z U torch::CppFunction::~CppFunction() 2025-05-07T20:10:36.5751428Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:10:36.5752572Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:10:36.5753392Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:10:36.5754110Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:10:36.5754823Z U unsigned char* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:36.5755308Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:10:36.5755723Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:10:36.5756188Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:10:36.5756781Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:36.5757442Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:10:36.5757896Z w _ITM_deregisterTMCloneTable 2025-05-07T20:10:36.5758215Z w _ITM_registerTMCloneTable 2025-05-07T20:10:36.5758537Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:10:36.5758832Z w __gmon_start__ 2025-05-07T20:10:36.5759227Z w __pthread_key_create 2025-05-07T20:10:36.5759557Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:10:36.5760042Z + ldd ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:10:36.5760491Z 2025-05-07T20:10:36.5760626Z linux-vdso.so.1 (0x00007ffd79dc7000) 2025-05-07T20:10:36.5760895Z libtorch.so => not found 2025-05-07T20:10:36.5761138Z libc10.so => not found 2025-05-07T20:10:36.5761386Z libnvrtc.so.12 => not found 2025-05-07T20:10:36.5761791Z libc10_cuda.so => not found 2025-05-07T20:10:36.5762028Z libnccl.so.2 => not found 2025-05-07T20:10:36.5762273Z libcuda.so.1 => not found 2025-05-07T20:10:36.5762512Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:36.5762774Z libtorch_cpu.so => not found 2025-05-07T20:10:36.5763033Z libtorch_cuda.so => not found 2025-05-07T20:10:36.5763275Z libcudart.so.12 => not found 2025-05-07T20:10:36.5763616Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007fc09c9dc000) 2025-05-07T20:10:36.5763998Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007fc09c986000) 2025-05-07T20:10:36.5764378Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007fc09c958000) 2025-05-07T20:10:36.5764722Z libc.so.6 => /lib64/libc.so.6 (0x00007fc09c750000) 2025-05-07T20:10:36.5765060Z /lib64/ld-linux-x86-64.so.2 (0x00007fc09ccbb000) 2025-05-07T20:10:36.5765385Z libm.so.6 => /lib64/libm.so.6 (0x00007fc09c675000) 2025-05-07T20:10:36.5765609Z 2025-05-07T20:10:36.5765709Z [CHECK] Displaying ELF information: 2025-05-07T20:10:36.5766147Z + readelf -d ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:10:36.5766492Z 2025-05-07T20:10:36.5782365Z 2025-05-07T20:10:36.5782544Z Dynamic section at offset 0x71b10 contains 39 entries: 2025-05-07T20:10:36.5782981Z Tag Type Name/Value 2025-05-07T20:10:36.5783397Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:10:36.5783907Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:10:36.5784422Z 0x0000000000000001 (NEEDED) Shared library: [libnvrtc.so.12] 2025-05-07T20:10:36.5784928Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:10:36.5785440Z 0x0000000000000001 (NEEDED) Shared library: [libnccl.so.2] 2025-05-07T20:10:36.5785933Z 0x0000000000000001 (NEEDED) Shared library: [libcuda.so.1] 2025-05-07T20:10:36.5786450Z 0x0000000000000001 (NEEDED) Shared library: [libnvidia-ml.so.1] 2025-05-07T20:10:36.5786972Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:10:36.5787476Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:10:36.5788011Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:10:36.5788513Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:10:36.5789111Z 0x0000000000000001 (NEEDED) Shared library: [libgomp.so.1] 2025-05-07T20:10:36.5789638Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:10:36.5790130Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:10:36.5790658Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:10:36.5791278Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_embedding_inplace_ops.so] 2025-05-07T20:10:36.5791776Z 0x000000000000000c (INIT) 0x10000 2025-05-07T20:10:36.5792105Z 0x000000000000000d (FINI) 0x316ac 2025-05-07T20:10:36.5792449Z 0x0000000000000019 (INIT_ARRAY) 0x71130 2025-05-07T20:10:36.5792806Z 0x000000000000001b (INIT_ARRAYSZ) 40 (bytes) 2025-05-07T20:10:36.5793151Z 0x000000000000001a (FINI_ARRAY) 0x71158 2025-05-07T20:10:36.5793508Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:10:36.5793856Z 0x000000006ffffef5 (GNU_HASH) 0x238 2025-05-07T20:10:36.5794200Z 0x0000000000000005 (STRTAB) 0x2ba8 2025-05-07T20:10:36.5794524Z 0x0000000000000006 (SYMTAB) 0xa18 2025-05-07T20:10:36.5794886Z 0x000000000000000a (STRSZ) 36158 (bytes) 2025-05-07T20:10:36.5795244Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:10:36.5795604Z 0x0000000000000003 (PLTGOT) 0x71fe8 2025-05-07T20:10:36.5795969Z 0x0000000000000002 (PLTRELSZ) 5520 (bytes) 2025-05-07T20:10:36.5796348Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:10:36.5796689Z 0x0000000000000017 (JMPREL) 0xdfa8 2025-05-07T20:10:36.5797014Z 0x0000000000000007 (RELA) 0xbcc8 2025-05-07T20:10:36.5797550Z 0x0000000000000008 (RELASZ) 8928 (bytes) 2025-05-07T20:10:36.5797891Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:10:36.5798412Z 0x000000006ffffffe (VERNEED) 0xbbb8 2025-05-07T20:10:36.5798796Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:10:36.5799175Z 0x000000006ffffff0 (VERSYM) 0xb8e6 2025-05-07T20:10:36.5799511Z 0x000000006ffffff9 (RELACOUNT) 162 2025-05-07T20:10:36.5799813Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:10:36.5800010Z 2025-05-07T20:10:36.5800138Z ################################################################################ 2025-05-07T20:10:36.5800362Z 2025-05-07T20:10:36.5800366Z 2025-05-07T20:10:36.5800483Z ################################################################################ 2025-05-07T20:10:36.5801042Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:10:36.5801545Z [CHECK] Listing out library size: 2025-05-07T20:10:36.5802001Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:10:36.5802378Z 2025-05-07T20:10:36.5802606Z 35 ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:10:36.5802920Z 2025-05-07T20:10:36.5803313Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:10:36.5804308Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_index_select.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:10:36.5804899Z 2025-05-07T20:10:36.5926831Z GLIBC_2.2.5 2025-05-07T20:10:36.5927176Z GLIBC_2.3 2025-05-07T20:10:36.5927496Z GLIBC_2.14 2025-05-07T20:10:36.5930382Z 2025-05-07T20:10:36.5930398Z 2025-05-07T20:10:36.5931093Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:10:36.5932160Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_index_select.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:10:36.5932769Z 2025-05-07T20:10:36.6052096Z GLIBCXX_3.4 2025-05-07T20:10:36.6052752Z GLIBCXX_3.4.9 2025-05-07T20:10:36.6053376Z GLIBCXX_3.4.11 2025-05-07T20:10:36.6053966Z GLIBCXX_3.4.15 2025-05-07T20:10:36.6054543Z GLIBCXX_3.4.18 2025-05-07T20:10:36.6054766Z GLIBCXX_3.4.20 2025-05-07T20:10:36.6054989Z GLIBCXX_3.4.21 2025-05-07T20:10:36.6055733Z 2025-05-07T20:10:36.6055746Z 2025-05-07T20:10:36.6075905Z + nm -gDC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_index_select.so > /tmp/tmp.PofFbR5M9M.symbols.txt 2025-05-07T20:10:36.6076046Z 2025-05-07T20:10:36.6167919Z 2025-05-07T20:10:36.6197995Z [CHECK] Total Number of symbols: 1545 2025-05-07T20:10:36.6213612Z [CHECK] Number of fbgemm symbols: 211 2025-05-07T20:10:36.6229244Z + nm -gDCu ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_index_select.so > /tmp/tmp.wUafQ0Wp1w.usymbols.txt 2025-05-07T20:10:36.6229284Z 2025-05-07T20:10:36.6251482Z 2025-05-07T20:10:36.6283601Z [CHECK] Listing out undefined symbols (266 total): 2025-05-07T20:10:36.6299991Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:36.6300349Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:36.6300505Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:10:36.6300738Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:10:36.6300992Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:10:36.6301190Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:10:36.6301575Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:10:36.6301772Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:10:36.6301953Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:10:36.6302170Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:10:36.6302340Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:10:36.6302627Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:10:36.6302835Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:10:36.6303010Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:10:36.6303259Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:10:36.6303467Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:10:36.6303677Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:10:36.6303822Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:10:36.6303939Z U __cxa_pure_virtual@CXXABI_1.3 2025-05-07T20:10:36.6304049Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:10:36.6304162Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:10:36.6304284Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:10:36.6304397Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:10:36.6304504Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:10:36.6304653Z U adjust_info_B_num_bits(int, int) 2025-05-07T20:10:36.6304811Z U at::Context::deterministicAlgorithms() const 2025-05-07T20:10:36.6304995Z U at::RecordFunction::RecordFunction(at::StepCallbacks&&) 2025-05-07T20:10:36.6305152Z U at::RecordFunction::currentThreadId() 2025-05-07T20:10:36.6305289Z U at::RecordFunction::~RecordFunction() 2025-05-07T20:10:36.6305445Z U at::SavedTensorDefaultHooks::set_tracing(bool) 2025-05-07T20:10:36.6305647Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:10:36.6305789Z U at::TensorMaker::make_tensor() 2025-05-07T20:10:36.6305920Z U at::_ops::all::call(at::Tensor const&) 2025-05-07T20:10:36.6306082Z U at::_ops::concat::call(c10::ArrayRef, long) 2025-05-07T20:10:36.6306267Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:10:36.6306914Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:36.6307580Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:36.6307816Z U at::_ops::eq_Scalar::call(at::Tensor const&, c10::Scalar const&) 2025-05-07T20:10:36.6307995Z U at::_ops::eq_Tensor::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:10:36.6308208Z U at::_ops::flatten_using_ints::call(at::Tensor const&, long, long) 2025-05-07T20:10:36.6308381Z U at::_ops::gt_Scalar::call(at::Tensor const&, c10::Scalar const&) 2025-05-07T20:10:36.6308683Z U at::_ops::index_add::call(at::Tensor const&, long, at::Tensor const&, at::Tensor const&, c10::Scalar const&) 2025-05-07T20:10:36.6308918Z U at::_ops::index_select::call(at::Tensor const&, long, at::Tensor const&) 2025-05-07T20:10:36.6309051Z U at::_ops::max::call(at::Tensor const&) 2025-05-07T20:10:36.6309230Z U at::_ops::mul_Tensor::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:10:36.6309467Z U at::_ops::narrow::call(at::Tensor const&, long, c10::SymInt, c10::SymInt) 2025-05-07T20:10:36.6309708Z U at::_ops::select_int::call(at::Tensor const&, long, c10::SymInt) 2025-05-07T20:10:36.6309956Z U at::_ops::split_with_sizes::call(at::Tensor const&, c10::ArrayRef, long) 2025-05-07T20:10:36.6310289Z U at::_ops::to_dtype::call(at::Tensor const&, c10::ScalarType, bool, bool, std::optional) 2025-05-07T20:10:36.6310946Z U at::_ops::to_dtype_layout::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, bool, bool, std::optional) 2025-05-07T20:10:36.6311134Z U at::_ops::view::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:10:36.6311322Z U at::_ops::view_as::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:10:36.6311801Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:36.6312402Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:36.6312545Z U at::cuda::getCurrentDeviceProperties() 2025-05-07T20:10:36.6312679Z U at::functorch::functorchTLSAccessor() 2025-05-07T20:10:36.6312865Z U at::getStepCallbacksUnlessEmpty(at::RecordScope) 2025-05-07T20:10:36.6312975Z U at::globalContext() 2025-05-07T20:10:36.6313125Z U at::has_internal_overlap(at::TensorBase const&) 2025-05-07T20:10:36.6313259Z U at::sequence_number::get_and_increment() 2025-05-07T20:10:36.6313492Z U bcmp@GLIBC_2.2.5 2025-05-07T20:10:36.6313615Z U bool at::Tensor::item() const 2025-05-07T20:10:36.6313718Z U c10::AnyType::get() 2025-05-07T20:10:36.6313912Z U c10::AutogradMetaInterface::~AutogradMetaInterface() 2025-05-07T20:10:36.6314118Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:36.6314221Z U c10::BoolType::get() 2025-05-07T20:10:36.6314411Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:10:36.6314600Z U c10::Dispatcher::findSchemaOrThrow(char const*, char const*) 2025-05-07T20:10:36.6314747Z U c10::Dispatcher::realSingleton() 2025-05-07T20:10:36.6315362Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet) 2025-05-07T20:10:36.6315945Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet, c10::ArrayRef) 2025-05-07T20:10:36.6316322Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:10:36.6316450Z U c10::Error::what() const 2025-05-07T20:10:36.6316562Z U c10::GradMode::is_enabled() 2025-05-07T20:10:36.6316674Z U c10::GradMode::set_enabled(bool) 2025-05-07T20:10:36.6316859Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:36.6317014Z U c10::IValue::TagType::get(c10::IValue const&) 2025-05-07T20:10:36.6317132Z U c10::IValue::hash(c10::IValue const&) 2025-05-07T20:10:36.6317261Z U c10::IValue::isBoolList() const 2025-05-07T20:10:36.6317369Z U c10::IValue::isIntList() const 2025-05-07T20:10:36.6317486Z U c10::IValue::isSymIntList() const 2025-05-07T20:10:36.6317619Z U c10::IValue::isTensorList() const 2025-05-07T20:10:36.6317805Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:10:36.6317955Z U c10::IntType::get() 2025-05-07T20:10:36.6318398Z U c10::ListType::get(std::__cxx11::basic_string, std::allocator > const&, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:10:36.6318568Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:10:36.6318717Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:10:36.6318869Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:10:36.6318998Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:10:36.6319277Z U c10::SmallVectorBase::grow_pod(void const*, unsigned long, unsigned long) 2025-05-07T20:10:36.6319442Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:10:36.6319554Z U c10::StringType::get() 2025-05-07T20:10:36.6319728Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:10:36.6320101Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:10:36.6320240Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:10:36.6320390Z U c10::SymInt::promote_to_negative() 2025-05-07T20:10:36.6320501Z U c10::SymInt::toSymNode() const 2025-05-07T20:10:36.6320609Z U c10::SymIntType::get() 2025-05-07T20:10:36.6320793Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:10:36.6320916Z U c10::TensorImpl::requires_grad() const 2025-05-07T20:10:36.6321325Z U c10::TensorImpl::set_autograd_meta(std::unique_ptr >) 2025-05-07T20:10:36.6321507Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:10:36.6321614Z U c10::TensorType::get() 2025-05-07T20:10:36.6321798Z U c10::Type::isSubtypeOfExt(c10::Type const&, std::ostream*) const 2025-05-07T20:10:36.6321925Z U c10::Type::is_module() const 2025-05-07T20:10:36.6322055Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:10:36.6322712Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:10:36.6322895Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:10:36.6323014Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:10:36.6323142Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:10:36.6323296Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:10:36.6323420Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:10:36.6323543Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:10:36.6323806Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:10:36.6323918Z U c10::cuda::device_count() 2025-05-07T20:10:36.6324057Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:10:36.6324191Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:10:36.6324360Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:10:36.6324503Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:10:36.6324663Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:10:36.6324805Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:10:36.6325209Z U c10::detail::ListImpl::ListImpl(std::vector >, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:10:36.6325708Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:10:36.6325977Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:10:36.6326454Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:36.6326801Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:10:36.6327339Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:36.6327605Z U c10::impl::ExcludeDispatchKeyGuard::ExcludeDispatchKeyGuard(c10::DispatchKeySet) 2025-05-07T20:10:36.6327895Z U c10::impl::ExcludeDispatchKeyGuard::ExcludeDispatchKeyGuard(c10::DispatchKeySet) 2025-05-07T20:10:36.6328084Z U c10::impl::ExcludeDispatchKeyGuard::~ExcludeDispatchKeyGuard() 2025-05-07T20:10:36.6328207Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:10:36.6328346Z U c10::impl::GPUTrace::haveState 2025-05-07T20:10:36.6328646Z U c10::impl::OperatorEntry::assertSignatureIsCorrect(c10::impl::CppSignature const&, bool) const 2025-05-07T20:10:36.6328828Z U c10::impl::OperatorEntry::reportError(c10::DispatchKey) const 2025-05-07T20:10:36.6329008Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:10:36.6329175Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:10:36.6329303Z U c10::impl::device_guard_impl_registry 2025-05-07T20:10:36.6329444Z U c10::impl::raw_local_dispatch_key_set 2025-05-07T20:10:36.6329594Z U c10::initializeFunctionalityOffsetsAndMasks() 2025-05-07T20:10:36.6329944Z U c10::ivalue::ConstantString::create(std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:10:36.6330115Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:10:36.6330251Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:10:36.6330451Z U c10::operator<<(std::ostream&, c10::OperatorName const&) 2025-05-07T20:10:36.6330591Z U c10::throwNullDataPtrError() 2025-05-07T20:10:36.6330705Z U c10::typeKindToString(c10::TypeKind) 2025-05-07T20:10:36.6330815Z U c10::warn(c10::Warning const&) 2025-05-07T20:10:36.6330976Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:10:36.6331164Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:10:36.6331285Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:10:36.6331411Z U cudaDeviceGetAttribute@libcudart.so.12 2025-05-07T20:10:36.6331561Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:10:36.6331685Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:10:36.6331800Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:10:36.6331950Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:10:36.6332065Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:10:36.6332176Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:10:36.6332327Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:10:36.6332449Z U cudaFuncSetAttribute@libcudart.so.12 2025-05-07T20:10:36.6332589Z U cudaGetDeviceProperties_v2@libcudart.so.12 2025-05-07T20:10:36.6332713Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:10:36.6332870Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:10:36.6332985Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:10:36.6333098Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:10:36.6333249Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:10:36.6333374Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:10:36.6333565Z U fbgemm_gpu::asynchronous_complete_cumsum_cpu(at::Tensor const&) 2025-05-07T20:10:36.6333796Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:36.6333896Z U free@GLIBC_2.2.5 2025-05-07T20:10:36.6334040Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:36.6334132Z U log2f@GLIBC_2.2.5 2025-05-07T20:10:36.6334275Z U long at::Tensor::item() const 2025-05-07T20:10:36.6334448Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:10:36.6334586Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:10:36.6334753Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:36.6334856Z U memcmp@GLIBC_2.2.5 2025-05-07T20:10:36.6334952Z U memcpy@GLIBC_2.14 2025-05-07T20:10:36.6335076Z U memmove@GLIBC_2.2.5 2025-05-07T20:10:36.6335175Z U memset@GLIBC_2.2.5 2025-05-07T20:10:36.6335295Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:10:36.6335437Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:10:36.6335536Z U realloc@GLIBC_2.2.5 2025-05-07T20:10:36.6335743Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:10:36.6336066Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:10:36.6336544Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:10:36.6337026Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:10:36.6337428Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:10:36.6337652Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:10:36.6337782Z U std::__throw_bad_array_new_length() 2025-05-07T20:10:36.6337964Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:36.6338115Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:36.6338332Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:10:36.6338503Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:10:36.6338658Z U std::bad_weak_ptr::~bad_weak_ptr()@GLIBCXX_3.4.15 2025-05-07T20:10:36.6338907Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:10:36.6339507Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:10:36.6339650Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:10:36.6339782Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:10:36.6339936Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:10:36.6340064Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:10:36.6340189Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:10:36.6340403Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:10:36.6340671Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:10:36.6340810Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:10:36.6340982Z U std::out_of_range::out_of_range(char const*)@GLIBCXX_3.4.21 2025-05-07T20:10:36.6341146Z U std::out_of_range::~out_of_range()@GLIBCXX_3.4 2025-05-07T20:10:36.6341363Z U std::runtime_error::runtime_error(char const*)@GLIBCXX_3.4.21 2025-05-07T20:10:36.6341817Z U std::runtime_error::runtime_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:10:36.6341964Z U std::runtime_error::~runtime_error()@GLIBCXX_3.4 2025-05-07T20:10:36.6342083Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:10:36.6342189Z U strcmp@GLIBC_2.2.5 2025-05-07T20:10:36.6342317Z U strlen@GLIBC_2.2.5 2025-05-07T20:10:36.6342449Z U torch::CppFunction::~CppFunction() 2025-05-07T20:10:36.6343037Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:10:36.6343517Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:10:36.6343785Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:10:36.6343944Z U torch::autograd::AnomalyMode::_enabled 2025-05-07T20:10:36.6344242Z U torch::autograd::AutogradContext::AutogradContext(torch::dynamo::autograd::PackedArgs&) 2025-05-07T20:10:36.6344435Z U torch::autograd::AutogradContext::get_and_bump_dirty() const 2025-05-07T20:10:36.6344672Z U torch::autograd::AutogradContext::get_non_differentiable() const 2025-05-07T20:10:36.6344867Z U torch::autograd::AutogradContext::get_saved_variables() const 2025-05-07T20:10:36.6345216Z U torch::autograd::AutogradContext::save_for_backward(std::vector >) 2025-05-07T20:10:36.6345398Z U torch::autograd::AutogradContext::save_variables() 2025-05-07T20:10:36.6345624Z U torch::autograd::ForwardADLevel::try_get_by_idx(unsigned long) 2025-05-07T20:10:36.6345811Z U torch::autograd::InputMetadata::shape_as_dim_vector() const 2025-05-07T20:10:36.6345968Z U torch::autograd::Node::assign_parent() 2025-05-07T20:10:36.6346092Z U torch::autograd::Node::metadata() 2025-05-07T20:10:36.6346266Z U torch::autograd::Node::name[abi:cxx11]() const 2025-05-07T20:10:36.6346523Z U torch::autograd::SavedVariable::SavedVariable(at::Tensor const&, bool, bool) 2025-05-07T20:10:36.6346818Z U torch::autograd::SavedVariable::unpack(std::shared_ptr) const 2025-05-07T20:10:36.6346970Z U torch::autograd::VariableInfo::VariableInfo() 2025-05-07T20:10:36.6347188Z U torch::autograd::VariableInfo::VariableInfo(at::Tensor const&, bool) 2025-05-07T20:10:36.6347436Z U torch::autograd::VariableInfo::zeros(c10::OptionalDeviceGuard&) const 2025-05-07T20:10:36.6353751Z U torch::autograd::_wrap_outputs(std::vector > const&, std::unordered_set, std::equal_to, std::allocator > const&, std::unordered_set, std::equal_to, std::allocator > const&, c10::ArrayRef >, std::shared_ptr const&, std::function > (std::vector >, std::vector >)> const&, std::unordered_set, std::equal_to, std::allocator > const&, std::function const&) 2025-05-07T20:10:36.6353985Z U torch::autograd::deleteNode(torch::autograd::Node*) 2025-05-07T20:10:36.6354146Z U torch::autograd::get_current_graph_task_exec_info() 2025-05-07T20:10:36.6354309Z U torch::autograd::impl::gradient_edge(at::Tensor const&) 2025-05-07T20:10:36.6354484Z U torch::dynamo::autograd::getPyCompilerInterface() 2025-05-07T20:10:36.6354866Z U torch::dynamo::autograd::get_input_metadata(std::vector > const&) 2025-05-07T20:10:36.6355217Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:10:36.6355754Z U transpose_embedding_input(at::Tensor, long, at::Tensor, at::Tensor, bool, std::optional const&, long, long, long, bool, std::optional const&, long, long) 2025-05-07T20:10:36.6355907Z U typeinfo for c10::Error 2025-05-07T20:10:36.6356081Z U typeinfo for c10::Type 2025-05-07T20:10:36.6356227Z U typeinfo for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:10:36.6356380Z U typeinfo for std::out_of_range@GLIBCXX_3.4 2025-05-07T20:10:36.6356515Z U typeinfo for std::runtime_error@GLIBCXX_3.4 2025-05-07T20:10:36.6356638Z U typeinfo for torch::autograd::Node 2025-05-07T20:10:36.6356816Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:10:36.6356978Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:10:36.6357138Z U vtable for __cxxabiv1::__pointer_type_info@CXXABI_1.3 2025-05-07T20:10:36.6357316Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:10:36.6357476Z U vtable for __cxxabiv1::__vmi_class_type_info@CXXABI_1.3 2025-05-07T20:10:36.6357580Z U vtable for c10::Error 2025-05-07T20:10:36.6357709Z U vtable for c10::ListType 2025-05-07T20:10:36.6358069Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:36.6358212Z U vtable for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:10:36.6358458Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:10:36.6358592Z U vtable for torch::autograd::AutogradMeta 2025-05-07T20:10:36.6358745Z U vtable for torch::autograd::Node 2025-05-07T20:10:36.6358946Z w TLS init function for c10::impl::raw_local_dispatch_key_set 2025-05-07T20:10:36.6359058Z w _ITM_deregisterTMCloneTable 2025-05-07T20:10:36.6359166Z w _ITM_registerTMCloneTable 2025-05-07T20:10:36.6359292Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:10:36.6359387Z w __gmon_start__ 2025-05-07T20:10:36.6359489Z w __pthread_key_create 2025-05-07T20:10:36.6359602Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:10:36.6359744Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:10:36.6359891Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:10:36.6360104Z + ldd ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:10:36.6360109Z 2025-05-07T20:10:36.6360266Z linux-vdso.so.1 (0x00007ffd192bd000) 2025-05-07T20:10:36.6360370Z libc10.so => not found 2025-05-07T20:10:36.6360471Z libnvrtc.so.12 => not found 2025-05-07T20:10:36.6360594Z libc10_cuda.so => not found 2025-05-07T20:10:36.6360733Z libnccl.so.2 => not found 2025-05-07T20:10:36.6360825Z libcuda.so.1 => not found 2025-05-07T20:10:36.6361363Z fbgemm_gpu_sparse_async_cumsum.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_sparse_async_cumsum.so (0x00007f8275c59000) 2025-05-07T20:10:36.6361842Z fbgemm_gpu_tbe_utils.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_utils.so (0x00007f8274a00000) 2025-05-07T20:10:36.6361943Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:36.6362034Z libtorch.so => not found 2025-05-07T20:10:36.6362153Z libtorch_cpu.so => not found 2025-05-07T20:10:36.6362249Z libtorch_cuda.so => not found 2025-05-07T20:10:36.6362368Z libcudart.so.12 => not found 2025-05-07T20:10:36.6362518Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f827479c000) 2025-05-07T20:10:36.6362635Z libm.so.6 => /lib64/libm.so.6 (0x00007f8275b7e000) 2025-05-07T20:10:36.6362774Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f82781bd000) 2025-05-07T20:10:36.6362904Z libc.so.6 => /lib64/libc.so.6 (0x00007f8274594000) 2025-05-07T20:10:36.6363022Z /lib64/ld-linux-x86-64.so.2 (0x00007f82781f3000) 2025-05-07T20:10:36.6363106Z libc10.so => not found 2025-05-07T20:10:36.6363210Z libnvrtc.so.12 => not found 2025-05-07T20:10:36.6363296Z libc10_cuda.so => not found 2025-05-07T20:10:36.6363383Z libnccl.so.2 => not found 2025-05-07T20:10:36.6363469Z libcuda.so.1 => not found 2025-05-07T20:10:36.6363574Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:36.6363654Z libtorch.so => not found 2025-05-07T20:10:36.6363743Z libtorch_cpu.so => not found 2025-05-07T20:10:36.6363843Z libtorch_cuda.so => not found 2025-05-07T20:10:36.6363930Z libcudart.so.12 => not found 2025-05-07T20:10:36.6364014Z libtorch.so => not found 2025-05-07T20:10:36.6364093Z libc10.so => not found 2025-05-07T20:10:36.6364193Z libnvrtc.so.12 => not found 2025-05-07T20:10:36.6364279Z libc10_cuda.so => not found 2025-05-07T20:10:36.6364365Z libnccl.so.2 => not found 2025-05-07T20:10:36.6364461Z libcuda.so.1 => not found 2025-05-07T20:10:36.6364552Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:36.6364639Z libtorch_cpu.so => not found 2025-05-07T20:10:36.6364729Z libtorch_cuda.so => not found 2025-05-07T20:10:36.6364829Z libcudart.so.12 => not found 2025-05-07T20:10:36.6364967Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f8275b28000) 2025-05-07T20:10:36.6364979Z 2025-05-07T20:10:36.6365107Z [CHECK] Displaying ELF information: 2025-05-07T20:10:36.6365352Z + readelf -d ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:10:36.6365357Z 2025-05-07T20:10:36.6399883Z 2025-05-07T20:10:36.6400595Z Dynamic section at offset 0x220d958 contains 42 entries: 2025-05-07T20:10:36.6401043Z Tag Type Name/Value 2025-05-07T20:10:36.6401890Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:10:36.6402478Z 0x0000000000000001 (NEEDED) Shared library: [libnvrtc.so.12] 2025-05-07T20:10:36.6403047Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:10:36.6403634Z 0x0000000000000001 (NEEDED) Shared library: [libnccl.so.2] 2025-05-07T20:10:36.6404186Z 0x0000000000000001 (NEEDED) Shared library: [libcuda.so.1] 2025-05-07T20:10:36.6404905Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_sparse_async_cumsum.so] 2025-05-07T20:10:36.6405556Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_utils.so] 2025-05-07T20:10:36.6406149Z 0x0000000000000001 (NEEDED) Shared library: [libnvidia-ml.so.1] 2025-05-07T20:10:36.6406708Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:10:36.6407302Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:10:36.6407817Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:10:36.6408052Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:10:36.6408248Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:10:36.6408422Z 0x0000000000000001 (NEEDED) Shared library: [libm.so.6] 2025-05-07T20:10:36.6408599Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:10:36.6408845Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:10:36.6409055Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:10:36.6409276Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_index_select.so] 2025-05-07T20:10:36.6409439Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:10:36.6409580Z 0x000000000000000c (INIT) 0x56000 2025-05-07T20:10:36.6409688Z 0x000000000000000d (FINI) 0x1515ac 2025-05-07T20:10:36.6409797Z 0x0000000000000019 (INIT_ARRAY) 0x220b430 2025-05-07T20:10:36.6409926Z 0x000000000000001b (INIT_ARRAYSZ) 144 (bytes) 2025-05-07T20:10:36.6410036Z 0x000000000000001a (FINI_ARRAY) 0x220b4c0 2025-05-07T20:10:36.6410147Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:10:36.6410267Z 0x000000006ffffef5 (GNU_HASH) 0x238 2025-05-07T20:10:36.6410373Z 0x0000000000000005 (STRTAB) 0xbb50 2025-05-07T20:10:36.6410482Z 0x0000000000000006 (SYMTAB) 0x2a60 2025-05-07T20:10:36.6410616Z 0x000000000000000a (STRSZ) 242227 (bytes) 2025-05-07T20:10:36.6410744Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:10:36.6410852Z 0x0000000000000003 (PLTGOT) 0x220efe8 2025-05-07T20:10:36.6410975Z 0x0000000000000002 (PLTRELSZ) 16872 (bytes) 2025-05-07T20:10:36.6411086Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:10:36.6411190Z 0x0000000000000017 (JMPREL) 0x512d8 2025-05-07T20:10:36.6411292Z 0x0000000000000007 (RELA) 0x47af8 2025-05-07T20:10:36.6411424Z 0x0000000000000008 (RELASZ) 38880 (bytes) 2025-05-07T20:10:36.6411533Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:10:36.6411638Z 0x000000006ffffffe (VERNEED) 0x47998 2025-05-07T20:10:36.6411737Z 0x000000006fffffff (VERNEEDNUM) 6 2025-05-07T20:10:36.6411855Z 0x000000006ffffff0 (VERSYM) 0x46d84 2025-05-07T20:10:36.6411955Z 0x000000006ffffff9 (RELACOUNT) 571 2025-05-07T20:10:36.6412090Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:10:36.6412094Z 2025-05-07T20:10:36.6412223Z ################################################################################ 2025-05-07T20:10:36.6412227Z 2025-05-07T20:10:36.6412231Z 2025-05-07T20:10:36.6412338Z ################################################################################ 2025-05-07T20:10:36.6412578Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:10:36.6412689Z [CHECK] Listing out library size: 2025-05-07T20:10:36.6412901Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:10:36.6412905Z 2025-05-07T20:10:36.6418309Z 73 ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:10:36.6421501Z 2025-05-07T20:10:36.6423166Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:10:36.6424461Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_py.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:10:36.6424484Z 2025-05-07T20:10:36.6822029Z GLIBC_2.2.5 2025-05-07T20:10:36.6822286Z GLIBC_2.3 2025-05-07T20:10:36.6822505Z GLIBC_2.14 2025-05-07T20:10:36.6822807Z 2025-05-07T20:10:36.6822929Z 2025-05-07T20:10:36.6824283Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:10:36.6825974Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_py.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:10:36.6825994Z 2025-05-07T20:10:36.7218137Z GLIBCXX_3.4 2025-05-07T20:10:36.7218246Z GLIBCXX_3.4.9 2025-05-07T20:10:36.7218333Z GLIBCXX_3.4.11 2025-05-07T20:10:36.7218416Z GLIBCXX_3.4.14 2025-05-07T20:10:36.7218525Z GLIBCXX_3.4.15 2025-05-07T20:10:36.7218762Z GLIBCXX_3.4.18 2025-05-07T20:10:36.7218849Z GLIBCXX_3.4.19 2025-05-07T20:10:36.7218932Z GLIBCXX_3.4.20 2025-05-07T20:10:36.7219039Z GLIBCXX_3.4.21 2025-05-07T20:10:36.7219045Z 2025-05-07T20:10:36.7219050Z 2025-05-07T20:10:36.7239823Z + nm -gDC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_py.so > /tmp/tmp.Gu7BeSclj5.symbols.txt 2025-05-07T20:10:36.7239976Z 2025-05-07T20:10:36.7581051Z 2025-05-07T20:10:36.7614269Z [CHECK] Total Number of symbols: 6648 2025-05-07T20:10:36.7638572Z [CHECK] Number of fbgemm symbols: 4516 2025-05-07T20:10:36.7655627Z + nm -gDCu ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_py.so > /tmp/tmp.BWlkeCSqAu.usymbols.txt 2025-05-07T20:10:36.7655739Z 2025-05-07T20:10:36.7695629Z 2025-05-07T20:10:36.7722037Z [CHECK] Listing out undefined symbols (465 total): 2025-05-07T20:10:36.7739022Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:36.7739434Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:36.7739549Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:10:36.7739653Z U __assert_fail@GLIBC_2.2.5 2025-05-07T20:10:36.7739818Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:10:36.7739964Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:10:36.7740100Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:10:36.7740254Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:10:36.7740387Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:10:36.7740511Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:10:36.7740642Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:10:36.7740772Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:10:36.7740881Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:10:36.7740992Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:10:36.7741269Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:10:36.7741378Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:10:36.7741485Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:10:36.7741617Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:10:36.7759403Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:10:36.7759621Z U __cxa_pure_virtual@CXXABI_1.3 2025-05-07T20:10:36.7759912Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:10:36.7760068Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:10:36.7760182Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:10:36.7760304Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:10:36.7760445Z U __once_proxy@GLIBCXX_3.4.11 2025-05-07T20:10:36.7760554Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:10:36.7760760Z U at::RecordFunction::RecordFunction(at::StepCallbacks&&) 2025-05-07T20:10:36.7760928Z U at::RecordFunction::currentThreadId() 2025-05-07T20:10:36.7761059Z U at::RecordFunction::~RecordFunction() 2025-05-07T20:10:36.7761223Z U at::SavedTensorDefaultHooks::set_tracing(bool) 2025-05-07T20:10:36.7761347Z U at::SplitUntil32Bit::begin() const 2025-05-07T20:10:36.7761492Z U at::SplitUntil32Bit::end() const 2025-05-07T20:10:36.7761650Z U at::SplitUntil32Bit::iterator::operator*() const 2025-05-07T20:10:36.7761792Z U at::SplitUntil32Bit::iterator::operator++() 2025-05-07T20:10:36.7762110Z U at::Tensor::index(std::initializer_list) const 2025-05-07T20:10:36.7762311Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:10:36.7762562Z U at::TensorIteratorBase::build(at::TensorIteratorConfig&) 2025-05-07T20:10:36.7762758Z U at::TensorIteratorBase::can_use_32bit_indexing() const 2025-05-07T20:10:36.7762906Z U at::TensorIteratorBase::data_ptr(long) const 2025-05-07T20:10:36.7763054Z U at::TensorIteratorBase::is_contiguous() const 2025-05-07T20:10:36.7763203Z U at::TensorIteratorBase::numel() const 2025-05-07T20:10:36.7763362Z U at::TensorIteratorBase::with_32bit_indexing() const 2025-05-07T20:10:36.7763591Z U at::TensorIteratorConfig::add_borrowed_input(at::TensorBase const&) 2025-05-07T20:10:36.7763831Z U at::TensorIteratorConfig::add_borrowed_output(at::TensorBase const&) 2025-05-07T20:10:36.7763953Z U at::TensorMaker::make_tensor() 2025-05-07T20:10:36.7764104Z U at::_ops::_is_all_true::call(at::Tensor const&) 2025-05-07T20:10:36.7764272Z U at::_ops::_unique::call(at::Tensor const&, bool, bool) 2025-05-07T20:10:36.7764515Z U at::_ops::add_Tensor::call(at::Tensor const&, at::Tensor const&, c10::Scalar const&) 2025-05-07T20:10:36.7764744Z U at::_ops::add__Tensor::call(at::Tensor&, at::Tensor const&, c10::Scalar const&) 2025-05-07T20:10:36.7764889Z U at::_ops::all::call(at::Tensor const&) 2025-05-07T20:10:36.7765344Z U at::_ops::baddbmm::call(at::Tensor const&, at::Tensor const&, at::Tensor const&, c10::Scalar const&, c10::Scalar const&) 2025-05-07T20:10:36.7765559Z U at::_ops::broadcast_to::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:10:36.7765754Z U at::_ops::cat::call(c10::IListRef const&, long) 2025-05-07T20:10:36.7765967Z U at::_ops::cat_out::call(c10::IListRef const&, long, at::Tensor&) 2025-05-07T20:10:36.7766147Z U at::_ops::clamp_max::call(at::Tensor const&, c10::Scalar const&) 2025-05-07T20:10:36.7766383Z U at::_ops::clone::call(at::Tensor const&, std::optional) 2025-05-07T20:10:36.7766572Z U at::_ops::contiguous::call(at::Tensor const&, c10::MemoryFormat) 2025-05-07T20:10:36.7766763Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:10:36.7767007Z U at::_ops::cumsum::call(at::Tensor const&, long, std::optional) 2025-05-07T20:10:36.7767346Z U at::_ops::diff::call(at::Tensor const&, long, long, std::optional const&, std::optional const&) 2025-05-07T20:10:36.7767554Z U at::_ops::div_Scalar::call(at::Tensor const&, c10::Scalar const&) 2025-05-07T20:10:36.7768151Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:36.7768793Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:36.7768990Z U at::_ops::fill__Scalar::call(at::Tensor&, c10::Scalar const&) 2025-05-07T20:10:36.7769186Z U at::_ops::flatten_using_ints::call(at::Tensor const&, long, long) 2025-05-07T20:10:36.7769320Z U at::_ops::floor::call(at::Tensor const&) 2025-05-07T20:10:36.7769861Z U at::_ops::full::call(c10::ArrayRef, c10::Scalar const&, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:36.7770067Z U at::_ops::ge_Scalar::call(at::Tensor const&, c10::Scalar const&) 2025-05-07T20:10:36.7770552Z U at::_ops::index_put_::call(at::Tensor&, c10::List > const&, at::Tensor const&, bool) 2025-05-07T20:10:36.7770827Z U at::_ops::index_select::call(at::Tensor const&, long, at::Tensor const&) 2025-05-07T20:10:36.7770996Z U at::_ops::item::call(at::Tensor const&) 2025-05-07T20:10:36.7771170Z U at::_ops::le_Tensor::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:10:36.7771385Z U at::_ops::max::call(at::Tensor const&) 2025-05-07T20:10:36.7771560Z U at::_ops::mul_Tensor::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:10:36.7772128Z U at::_ops::ones_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:36.7772335Z U at::_ops::permute::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:10:36.7772850Z U at::_ops::range::call(c10::Scalar const&, c10::Scalar const&, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:36.7773051Z U at::_ops::reshape::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:10:36.7773373Z U at::_ops::resize_::call(at::Tensor const&, c10::ArrayRef, std::optional) 2025-05-07T20:10:36.7773554Z U at::_ops::select_int::call(at::Tensor const&, long, c10::SymInt) 2025-05-07T20:10:36.7774014Z U at::_ops::set__source_Storage_storage_offset::call(at::Tensor&, c10::Storage, c10::SymInt, c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:10:36.7774373Z U at::_ops::slice_Tensor::call(at::Tensor const&, long, std::optional, std::optional, c10::SymInt) 2025-05-07T20:10:36.7774523Z U at::_ops::sort::call(at::Tensor const&, long, bool) 2025-05-07T20:10:36.7774779Z U at::_ops::split_sizes::call(at::Tensor const&, c10::ArrayRef, long) 2025-05-07T20:10:36.7774932Z U at::_ops::squeeze_dim::call(at::Tensor const&, long) 2025-05-07T20:10:36.7775209Z U at::_ops::sub_Tensor::call(at::Tensor const&, at::Tensor const&, c10::Scalar const&) 2025-05-07T20:10:36.7775423Z U at::_ops::sum::call(at::Tensor const&, std::optional) 2025-05-07T20:10:36.7775682Z U at::_ops::tensor_split_indices::call(at::Tensor const&, c10::ArrayRef, long) 2025-05-07T20:10:36.7775993Z U at::_ops::to_dtype::call(at::Tensor const&, c10::ScalarType, bool, bool, std::optional) 2025-05-07T20:10:36.7776757Z U at::_ops::to_dtype_layout::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, bool, bool, std::optional) 2025-05-07T20:10:36.7776934Z U at::_ops::transpose_int::call(at::Tensor const&, long, long) 2025-05-07T20:10:36.7777194Z U at::_ops::unique_consecutive::call(at::Tensor const&, bool, bool, std::optional) 2025-05-07T20:10:36.7777356Z U at::_ops::unsqueeze::call(at::Tensor const&, long) 2025-05-07T20:10:36.7777535Z U at::_ops::view::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:10:36.7777703Z U at::_ops::view_as::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:10:36.7777850Z U at::_ops::zero_::call(at::Tensor&) 2025-05-07T20:10:36.7778324Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:36.7778957Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:36.7779276Z U at::checkScalarTypes(char const*, at::TensorArg const&, c10::ArrayRef) 2025-05-07T20:10:36.7779415Z U at::cuda::getCurrentCUDABlasHandle() 2025-05-07T20:10:36.7779578Z U at::cuda::getCurrentDeviceProperties() 2025-05-07T20:10:36.7779719Z U at::cuda::getDeviceProperties(signed char) 2025-05-07T20:10:36.7779869Z U at::cuda::get_p2p_access(signed char, signed char) 2025-05-07T20:10:36.7780209Z U at::detail::computeStorageNbytes(c10::ArrayRef, c10::ArrayRef, unsigned long, unsigned long) 2025-05-07T20:10:36.7780339Z U at::functorch::functorchTLSAccessor() 2025-05-07T20:10:36.7780488Z U at::getStepCallbacksUnlessEmpty(at::RecordScope) 2025-05-07T20:10:36.7780591Z U at::get_num_threads() 2025-05-07T20:10:36.7780696Z U at::get_thread_num() 2025-05-07T20:10:36.7780903Z U at::internal::OpaqueOptionalTensorRef::~OpaqueOptionalTensorRef() 2025-05-07T20:10:36.7781015Z U at::internal::set_thread_num(int) 2025-05-07T20:10:36.7781254Z U at::native::_rowwise_prune(at::Tensor const&, at::Tensor const&, c10::ScalarType) 2025-05-07T20:10:36.7781813Z U at::native::empty_like(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:36.7782423Z U at::native::empty_meta_symint(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:36.7782688Z U at::native::resize_(at::Tensor const&, c10::ArrayRef, std::optional) 2025-05-07T20:10:36.7782826Z U at::print(std::ostream&, at::Tensor const&, long) 2025-05-07T20:10:36.7782955Z U at::sequence_number::get_and_increment() 2025-05-07T20:10:36.7783119Z U at::tensor(c10::ArrayRef, c10::TensorOptions const&) 2025-05-07T20:10:36.7783232Z U bcmp@GLIBC_2.2.5 2025-05-07T20:10:36.7783342Z U bool at::Tensor::item() const 2025-05-07T20:10:36.7783478Z U bool* at::TensorBase::data_ptr() const 2025-05-07T20:10:36.7783624Z U bool* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:36.7783716Z U c10::AnyType::get() 2025-05-07T20:10:36.7783906Z U c10::AutogradMetaInterface::~AutogradMetaInterface() 2025-05-07T20:10:36.7784073Z U c10::BFloat16* at::TensorBase::data_ptr() const 2025-05-07T20:10:36.7784268Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:36.7784366Z U c10::BoolType::get() 2025-05-07T20:10:36.7784470Z U c10::DeviceObjType::get() 2025-05-07T20:10:36.7784625Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:10:36.7784802Z U c10::Dispatcher::findSchemaOrThrow(char const*, char const*) 2025-05-07T20:10:36.7784912Z U c10::Dispatcher::realSingleton() 2025-05-07T20:10:36.7785409Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet) 2025-05-07T20:10:36.7786021Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet, c10::ArrayRef) 2025-05-07T20:10:36.7786423Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:10:36.7786520Z U c10::Error::what() const 2025-05-07T20:10:36.7786619Z U c10::FloatType::get() 2025-05-07T20:10:36.7786741Z U c10::GradMode::is_enabled() 2025-05-07T20:10:36.7786846Z U c10::GradMode::set_enabled(bool) 2025-05-07T20:10:36.7786999Z U c10::Half* at::TensorBase::data_ptr() const 2025-05-07T20:10:36.7787165Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:36.7787312Z U c10::IValue::TagType::get(c10::IValue const&) 2025-05-07T20:10:36.7787428Z U c10::IValue::hash(c10::IValue const&) 2025-05-07T20:10:36.7787533Z U c10::IValue::isBoolList() const 2025-05-07T20:10:36.7787635Z U c10::IValue::isIntList() const 2025-05-07T20:10:36.7787749Z U c10::IValue::isSymIntList() const 2025-05-07T20:10:36.7787854Z U c10::IValue::isTensorList() const 2025-05-07T20:10:36.7787989Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:10:36.7788098Z U c10::InferenceMode::is_enabled() 2025-05-07T20:10:36.7788190Z U c10::IntType::get() 2025-05-07T20:10:36.7788846Z U c10::ListType::get(std::__cxx11::basic_string, std::allocator > const&, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:10:36.7789003Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:10:36.7789110Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:10:36.7789220Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:10:36.7789331Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:10:36.7789545Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:10:36.7789660Z U c10::ParallelGuard::ParallelGuard(bool) 2025-05-07T20:10:36.7789767Z U c10::ParallelGuard::~ParallelGuard() 2025-05-07T20:10:36.7789873Z U c10::ScalarTypeType::get() 2025-05-07T20:10:36.7790125Z U c10::SmallVectorBase::grow_pod(void const*, unsigned long, unsigned long) 2025-05-07T20:10:36.7790436Z U c10::SmallVectorBase::mallocForGrow(unsigned long, unsigned long, unsigned long&) 2025-05-07T20:10:36.7790585Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:10:36.7790679Z U c10::StringType::get() 2025-05-07T20:10:36.7790813Z U c10::SymBool::expect_true(char const*, long) const 2025-05-07T20:10:36.7790972Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:10:36.7791104Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:10:36.7791465Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:10:36.7791595Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:10:36.7791721Z U c10::SymInt::operator%(c10::SymInt const&) const 2025-05-07T20:10:36.7791840Z U c10::SymInt::operator*(c10::SymInt const&) const 2025-05-07T20:10:36.7791978Z U c10::SymInt::operator/(c10::SymInt const&) const 2025-05-07T20:10:36.7792081Z U c10::SymInt::promote_to_negative() 2025-05-07T20:10:36.7792197Z U c10::SymInt::sym_eq(c10::SymInt const&) const 2025-05-07T20:10:36.7792323Z U c10::SymInt::sym_ne(c10::SymInt const&) const 2025-05-07T20:10:36.7792423Z U c10::SymInt::toSymNode() const 2025-05-07T20:10:36.7792510Z U c10::SymIntType::get() 2025-05-07T20:10:36.7792680Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:10:36.7792786Z U c10::TensorImpl::requires_grad() const 2025-05-07T20:10:36.7793180Z U c10::TensorImpl::set_autograd_meta(std::unique_ptr >) 2025-05-07T20:10:36.7793356Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:10:36.7793450Z U c10::TensorType::get() 2025-05-07T20:10:36.7794148Z U c10::TupleType::TupleType(std::vector, std::allocator > >, std::optional, std::shared_ptr) 2025-05-07T20:10:36.7794326Z U c10::Type::isSubtypeOfExt(c10::Type const&, std::ostream*) const 2025-05-07T20:10:36.7794424Z U c10::Type::is_module() const 2025-05-07T20:10:36.7794540Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:10:36.7795188Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:10:36.7795313Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:10:36.7795468Z U c10::cuda::CUDAKernelLaunchRegistry::get_singleton_ref() 2025-05-07T20:10:36.7795717Z U c10::cuda::CUDAKernelLaunchRegistry::get_uvm_assertions_ptr_for_current_device() 2025-05-07T20:10:36.7796019Z U c10::cuda::CUDAKernelLaunchRegistry::insert(char const*, char const*, unsigned int, char const*, int) 2025-05-07T20:10:36.7796287Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:10:36.7796407Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:10:36.7796511Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:10:36.7796622Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:10:36.7796732Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:10:36.7796961Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:10:36.7797060Z U c10::cuda::current_device() 2025-05-07T20:10:36.7797159Z U c10::cuda::device_count() 2025-05-07T20:10:36.7797312Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:10:36.7797437Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:10:36.7797573Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:10:36.7797702Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:10:36.7798038Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:10:36.7798146Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:10:36.7798567Z U c10::detail::ListImpl::ListImpl(std::vector >, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:10:36.7799069Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:10:36.7799318Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:10:36.7799807Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:36.7800145Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:10:36.7800734Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:36.7800998Z U c10::impl::ExcludeDispatchKeyGuard::ExcludeDispatchKeyGuard(c10::DispatchKeySet) 2025-05-07T20:10:36.7801256Z U c10::impl::ExcludeDispatchKeyGuard::ExcludeDispatchKeyGuard(c10::DispatchKeySet) 2025-05-07T20:10:36.7801473Z U c10::impl::ExcludeDispatchKeyGuard::~ExcludeDispatchKeyGuard() 2025-05-07T20:10:36.7801587Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:10:36.7801688Z U c10::impl::GPUTrace::haveState 2025-05-07T20:10:36.7801998Z U c10::impl::OperatorEntry::assertSignatureIsCorrect(c10::impl::CppSignature const&, bool) const 2025-05-07T20:10:36.7802175Z U c10::impl::OperatorEntry::reportError(c10::DispatchKey) const 2025-05-07T20:10:36.7802298Z U c10::impl::PyObjectSlot::PyObjectSlot() 2025-05-07T20:10:36.7802424Z U c10::impl::PyObjectSlot::~PyObjectSlot() 2025-05-07T20:10:36.7802566Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:10:36.7802724Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:10:36.7802839Z U c10::impl::device_guard_impl_registry 2025-05-07T20:10:36.7802952Z U c10::impl::raw_local_dispatch_key_set 2025-05-07T20:10:36.7803094Z U c10::initializeFunctionalityOffsetsAndMasks() 2025-05-07T20:10:36.7803461Z U c10::ivalue::ConstantString::create(std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:10:36.7803586Z U c10::operator*(c10::SymInt const&, int) 2025-05-07T20:10:36.7803706Z U c10::operator+(c10::SymInt const&, int) 2025-05-07T20:10:36.7803863Z U c10::operator+(c10::SymInt const&, unsigned long) 2025-05-07T20:10:36.7803981Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:10:36.7804122Z U c10::operator-(c10::SymInt const&, unsigned long) 2025-05-07T20:10:36.7804255Z U c10::operator/(c10::SymInt const&, int) 2025-05-07T20:10:36.7804372Z U c10::operator<(c10::SymInt const&, int) 2025-05-07T20:10:36.7804513Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:10:36.7804651Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:10:36.7804829Z U c10::operator<<(std::ostream&, c10::OperatorName const&) 2025-05-07T20:10:36.7804966Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:10:36.7805094Z U c10::operator==(c10::SymInt const&, int) 2025-05-07T20:10:36.7805211Z U c10::operator>(c10::SymInt const&, int) 2025-05-07T20:10:36.7805353Z U c10::operator>=(c10::SymInt const&, int) 2025-05-07T20:10:36.7805481Z U c10::report_overflow(char const*) 2025-05-07T20:10:36.7805593Z U c10::throwNullDataPtrError() 2025-05-07T20:10:36.7805714Z U c10::typeKindToString(c10::TypeKind) 2025-05-07T20:10:36.7805833Z U c10::warn(c10::Warning const&) 2025-05-07T20:10:36.7805943Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:10:36.7806136Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:10:36.7806255Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:10:36.7806359Z U ceil@GLIBC_2.2.5 2025-05-07T20:10:36.7806468Z U cublasGemmStridedBatchedEx 2025-05-07T20:10:36.7806566Z U cublasSetStream_v2 2025-05-07T20:10:36.7806704Z U cudaDeviceGetAttribute@libcudart.so.12 2025-05-07T20:10:36.7806831Z U cudaDeviceGetByPCIBusId@libcudart.so.12 2025-05-07T20:10:36.7806957Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:10:36.7807122Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:10:36.7807236Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:10:36.7807357Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:10:36.7807468Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:10:36.7807618Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:10:36.7807739Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:10:36.7807844Z U cudaFree@libcudart.so.12 2025-05-07T20:10:36.7807981Z U cudaFuncGetAttributes@libcudart.so.12 2025-05-07T20:10:36.7808101Z U cudaFuncSetAttribute@libcudart.so.12 2025-05-07T20:10:36.7808209Z U cudaGetDevice@libcudart.so.12 2025-05-07T20:10:36.7808338Z U cudaGetDeviceCount@libcudart.so.12 2025-05-07T20:10:36.7808475Z U cudaGetDeviceProperties_v2@libcudart.so.12 2025-05-07T20:10:36.7808593Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:10:36.7808707Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:10:36.7808852Z U cudaHostGetDevicePointer@libcudart.so.12 2025-05-07T20:10:36.7808962Z U cudaHostRegister@libcudart.so.12 2025-05-07T20:10:36.7809080Z U cudaHostUnregister@libcudart.so.12 2025-05-07T20:10:36.7809204Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:10:36.7809324Z U cudaMallocManaged@libcudart.so.12 2025-05-07T20:10:36.7809433Z U cudaMemAdvise@libcudart.so.12 2025-05-07T20:10:36.7809565Z U cudaMemPrefetchAsync@libcudart.so.12 2025-05-07T20:10:36.7809685Z U cudaMemcpy2DAsync@libcudart.so.12 2025-05-07T20:10:36.7809798Z U cudaMemcpyAsync@libcudart.so.12 2025-05-07T20:10:36.7809911Z U cudaMemsetAsync@libcudart.so.12 2025-05-07T20:10:36.7810199Z U cudaOccupancyMaxActiveBlocksPerMultiprocessorWithFlags@libcudart.so.12 2025-05-07T20:10:36.7810322Z U cudaPeekAtLastError@libcudart.so.12 2025-05-07T20:10:36.7810429Z U cudaSetDevice@libcudart.so.12 2025-05-07T20:10:36.7810550Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:10:36.7810675Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:10:36.7810822Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:10:36.7811020Z U double* at::TensorBase::data_ptr() const 2025-05-07T20:10:36.7811222Z U double* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:36.7811316Z U exit@GLIBC_2.2.5 2025-05-07T20:10:36.7811407Z U exp10@GLIBC_2.2.5 2025-05-07T20:10:36.7811511Z U exp2@GLIBC_2.2.5 2025-05-07T20:10:36.7811601Z U exp@GLIBC_2.2.5 2025-05-07T20:10:36.7811728Z U expf@GLIBC_2.2.5 2025-05-07T20:10:36.7811937Z U fbgemm_gpu::asynchronous_complete_cumsum_cpu(at::Tensor const&) 2025-05-07T20:10:36.7812134Z U fbgemm_gpu::asynchronous_complete_cumsum_gpu(at::Tensor const&) 2025-05-07T20:10:36.7812335Z U fbgemm_gpu::asynchronous_exclusive_cumsum_cpu(at::Tensor const&) 2025-05-07T20:10:36.7812549Z U fbgemm_gpu::asynchronous_exclusive_cumsum_gpu(at::Tensor const&) 2025-05-07T20:10:36.7812743Z U fbgemm_gpu::asynchronous_inclusive_cumsum_gpu(at::Tensor const&) 2025-05-07T20:10:36.7812888Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:10:36.7813060Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:36.7813157Z U fmod@GLIBC_2.2.5 2025-05-07T20:10:36.7813243Z U free@GLIBC_2.2.5 2025-05-07T20:10:36.7813359Z U get_info_B_num_bits_from_T(int, int) 2025-05-07T20:10:36.7813485Z U int at::Tensor::item() const 2025-05-07T20:10:36.7813775Z U int const* at::TensorBase::const_data_ptr() const 2025-05-07T20:10:36.7813902Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:10:36.7814054Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:36.7814146Z U isnan@GLIBC_2.2.5 2025-05-07T20:10:36.7814265Z U lgamma@GLIBC_2.2.5 2025-05-07T20:10:36.7814370Z U llrint@GLIBC_2.2.5 2025-05-07T20:10:36.7814466Z U llround@GLIBC_2.2.5 2025-05-07T20:10:36.7814556Z U log10@GLIBC_2.2.5 2025-05-07T20:10:36.7814655Z U log2@GLIBC_2.2.5 2025-05-07T20:10:36.7814743Z U log@GLIBC_2.2.5 2025-05-07T20:10:36.7814831Z U logl@GLIBC_2.2.5 2025-05-07T20:10:36.7814947Z U long at::Tensor::item() const 2025-05-07T20:10:36.7815137Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:10:36.7815304Z U long const* at::TensorBase::const_data_ptr() const 2025-05-07T20:10:36.7815438Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:10:36.7815598Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:36.7815691Z U lrint@GLIBC_2.2.5 2025-05-07T20:10:36.7815787Z U madvise@GLIBC_2.2.5 2025-05-07T20:10:36.7815890Z U malloc@GLIBC_2.2.5 2025-05-07T20:10:36.7815984Z U memcmp@GLIBC_2.2.5 2025-05-07T20:10:36.7816073Z U memcpy@GLIBC_2.14 2025-05-07T20:10:36.7816165Z U memmove@GLIBC_2.2.5 2025-05-07T20:10:36.7816268Z U memset@GLIBC_2.2.5 2025-05-07T20:10:36.7816431Z U nextafter@GLIBC_2.2.5 2025-05-07T20:10:36.7816534Z U nvmlDeviceGetCount_v2 2025-05-07T20:10:36.7816662Z U nvmlDeviceGetHandleByIndex_v2 2025-05-07T20:10:36.7816787Z U nvmlDeviceGetNvLinkRemotePciInfo_v2 2025-05-07T20:10:36.7816892Z U nvmlDeviceGetNvLinkState 2025-05-07T20:10:36.7816997Z U nvmlDeviceGetPciInfo_v3 2025-05-07T20:10:36.7817098Z U nvmlInit_v2 2025-05-07T20:10:36.7817212Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:10:36.7817351Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:10:36.7817488Z U operator new[](unsigned long)@GLIBCXX_3.4 2025-05-07T20:10:36.7818956Z U pow@GLIBC_2.2.5 2025-05-07T20:10:36.7819050Z U realloc@GLIBC_2.2.5 2025-05-07T20:10:36.7819222Z U short* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:36.7819419Z U signed char* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:36.7819509Z U sin@GLIBC_2.2.5 2025-05-07T20:10:36.7819756Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:10:36.7819931Z U std::_Rb_tree_decrement(std::_Rb_tree_node_base*)@GLIBCXX_3.4 2025-05-07T20:10:36.7820118Z U std::_Rb_tree_increment(std::_Rb_tree_node_base const*)@GLIBCXX_3.4 2025-05-07T20:10:36.7820300Z U std::_Rb_tree_increment(std::_Rb_tree_node_base*)@GLIBCXX_3.4 2025-05-07T20:10:36.7820677Z U std::_Rb_tree_insert_and_rebalance(bool, std::_Rb_tree_node_base*, std::_Rb_tree_node_base*, std::_Rb_tree_node_base&)@GLIBCXX_3.4 2025-05-07T20:10:36.7821014Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:10:36.7821406Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:10:36.7821726Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:10:36.7822130Z U std::__cxx11::basic_stringstream, std::allocator >::~basic_stringstream()@GLIBCXX_3.4.21 2025-05-07T20:10:36.7822504Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:10:36.7822644Z U std::__once_call@GLIBCXX_3.4.11 2025-05-07T20:10:36.7822761Z U std::__once_callable@GLIBCXX_3.4.11 2025-05-07T20:10:36.7822886Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:10:36.7822999Z U std::__throw_bad_array_new_length() 2025-05-07T20:10:36.7823108Z U std::__throw_bad_cast()@GLIBCXX_3.4 2025-05-07T20:10:36.7823252Z U std::__throw_bad_function_call()@GLIBCXX_3.4.14 2025-05-07T20:10:36.7823394Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:36.7823534Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:36.7823684Z U std::__throw_out_of_range(char const*)@GLIBCXX_3.4 2025-05-07T20:10:36.7823855Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:10:36.7823987Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:10:36.7824143Z U std::bad_weak_ptr::~bad_weak_ptr()@GLIBCXX_3.4.15 2025-05-07T20:10:36.7824348Z U std::basic_filebuf >::close()@GLIBCXX_3.4 2025-05-07T20:10:36.7824687Z U std::basic_ifstream >::basic_ifstream(char const*, std::_Ios_Openmode)@GLIBCXX_3.4 2025-05-07T20:10:36.7824935Z U std::basic_ifstream >::~basic_ifstream()@GLIBCXX_3.4 2025-05-07T20:10:36.7825189Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:10:36.7825526Z U std::basic_ofstream >::basic_ofstream(char const*, std::_Ios_Openmode)@GLIBCXX_3.4 2025-05-07T20:10:36.7825772Z U std::basic_ofstream >::~basic_ofstream()@GLIBCXX_3.4 2025-05-07T20:10:36.7826332Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:10:36.7826508Z U std::chrono::_V2::system_clock::now()@GLIBCXX_3.4.19 2025-05-07T20:10:36.7826623Z U std::cout@GLIBCXX_3.4 2025-05-07T20:10:36.7826777Z U std::ctype::_M_widen_init() const@GLIBCXX_3.4.11 2025-05-07T20:10:36.7826906Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:10:36.7827042Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:10:36.7827187Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:10:36.7827305Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:10:36.7827435Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:10:36.7827629Z U std::ostream& std::ostream::_M_insert(double)@GLIBCXX_3.4.9 2025-05-07T20:10:36.7827812Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:10:36.7828060Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:10:36.7828177Z U std::ostream::flush()@GLIBCXX_3.4 2025-05-07T20:10:36.7828304Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:10:36.7828431Z U std::ostream::put(char)@GLIBCXX_3.4 2025-05-07T20:10:36.7828681Z U std::ostream::write(char const*, long)@GLIBCXX_3.4 2025-05-07T20:10:36.7828951Z U std::out_of_range::out_of_range(char const*)@GLIBCXX_3.4.21 2025-05-07T20:10:36.7829079Z U std::out_of_range::~out_of_range()@GLIBCXX_3.4 2025-05-07T20:10:36.7829281Z U std::runtime_error::runtime_error(char const*)@GLIBCXX_3.4.21 2025-05-07T20:10:36.7829676Z U std::runtime_error::runtime_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:10:36.7829841Z U std::runtime_error::~runtime_error()@GLIBCXX_3.4 2025-05-07T20:10:36.7829945Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:10:36.7830036Z U strcmp@GLIBC_2.2.5 2025-05-07T20:10:36.7830124Z U strlen@GLIBC_2.2.5 2025-05-07T20:10:36.7830222Z U sysconf@GLIBC_2.2.5 2025-05-07T20:10:36.7830336Z U torch::CppFunction::~CppFunction() 2025-05-07T20:10:36.7830869Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:10:36.7831298Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:10:36.7831750Z U torch::Library::_def(std::variant&&, torch::CppFunction&&, std::vector > const&) & 2025-05-07T20:10:36.7831993Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:10:36.7832108Z U torch::autograd::AnomalyMode::_enabled 2025-05-07T20:10:36.7832378Z U torch::autograd::AutogradContext::AutogradContext(torch::dynamo::autograd::PackedArgs&) 2025-05-07T20:10:36.7832557Z U torch::autograd::AutogradContext::get_and_bump_dirty() const 2025-05-07T20:10:36.7832747Z U torch::autograd::AutogradContext::get_non_differentiable() const 2025-05-07T20:10:36.7832920Z U torch::autograd::AutogradContext::get_saved_variables() const 2025-05-07T20:10:36.7833251Z U torch::autograd::AutogradContext::save_for_backward(std::vector >) 2025-05-07T20:10:36.7833389Z U torch::autograd::AutogradContext::save_variables() 2025-05-07T20:10:36.7833565Z U torch::autograd::ForwardADLevel::try_get_by_idx(unsigned long) 2025-05-07T20:10:36.7833740Z U torch::autograd::InputMetadata::shape_as_dim_vector() const 2025-05-07T20:10:36.7833877Z U torch::autograd::Node::assign_parent() 2025-05-07T20:10:36.7833984Z U torch::autograd::Node::metadata() 2025-05-07T20:10:36.7834120Z U torch::autograd::Node::name[abi:cxx11]() const 2025-05-07T20:10:36.7834345Z U torch::autograd::SavedVariable::SavedVariable(at::Tensor const&, bool, bool) 2025-05-07T20:10:36.7834614Z U torch::autograd::SavedVariable::unpack(std::shared_ptr) const 2025-05-07T20:10:36.7834757Z U torch::autograd::VariableInfo::VariableInfo() 2025-05-07T20:10:36.7834950Z U torch::autograd::VariableInfo::VariableInfo(at::Tensor const&, bool) 2025-05-07T20:10:36.7835151Z U torch::autograd::VariableInfo::zeros(c10::OptionalDeviceGuard&) const 2025-05-07T20:10:36.7837574Z U torch::autograd::_wrap_outputs(std::vector > const&, std::unordered_set, std::equal_to, std::allocator > const&, std::unordered_set, std::equal_to, std::allocator > const&, c10::ArrayRef >, std::shared_ptr const&, std::function > (std::vector >, std::vector >)> const&, std::unordered_set, std::equal_to, std::allocator > const&, std::function const&) 2025-05-07T20:10:36.7837717Z U torch::autograd::deleteNode(torch::autograd::Node*) 2025-05-07T20:10:36.7837886Z U torch::autograd::get_current_graph_task_exec_info() 2025-05-07T20:10:36.7838038Z U torch::autograd::impl::gradient_edge(at::Tensor const&) 2025-05-07T20:10:36.7838182Z U torch::dynamo::autograd::getPyCompilerInterface() 2025-05-07T20:10:36.7838564Z U torch::dynamo::autograd::get_input_metadata(std::vector > const&) 2025-05-07T20:10:36.7838894Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:10:36.7839249Z U torch::jit::parseSchemaOrName(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:10:36.7839441Z U torch::pickle_load(std::vector > const&) 2025-05-07T20:10:36.7839554Z U torch::pickle_save(c10::IValue const&) 2025-05-07T20:10:36.7840068Z U transpose_embedding_input(at::Tensor, long, at::Tensor, at::Tensor, bool, std::optional const&, long, long, long, bool, std::optional const&, long, long) 2025-05-07T20:10:36.7840169Z U typeinfo for c10::Error 2025-05-07T20:10:36.7840266Z U typeinfo for c10::Type 2025-05-07T20:10:36.7840395Z U typeinfo for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:10:36.7840520Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:10:36.7840639Z U typeinfo for std::out_of_range@GLIBCXX_3.4 2025-05-07T20:10:36.7840761Z U typeinfo for std::runtime_error@GLIBCXX_3.4 2025-05-07T20:10:36.7840884Z U typeinfo for torch::autograd::Node 2025-05-07T20:10:36.7841058Z U unsigned char* at::TensorBase::data_ptr() const 2025-05-07T20:10:36.7841250Z U unsigned char* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:36.7841665Z U void fbgemm::FloatOrHalfToFused8BitRowwiseQuantizedSBFloat(float const*, unsigned long, int, unsigned char*) 2025-05-07T20:10:36.7842163Z U void fbgemm::FloatOrHalfToFused8BitRowwiseQuantizedSBFloat(unsigned short const*, unsigned long, int, unsigned char*) 2025-05-07T20:10:36.7842565Z U void fbgemm::FloatOrHalfToFusedNBitRowwiseQuantizedSBHalf(int, float const*, unsigned long, int, unsigned char*) 2025-05-07T20:10:36.7843045Z U void fbgemm::FloatOrHalfToFusedNBitRowwiseQuantizedSBHalf(int, unsigned short const*, unsigned long, int, unsigned char*) 2025-05-07T20:10:36.7843469Z U void fbgemm::Fused8BitRowwiseQuantizedSBFloatToFloatOrHalf(unsigned char const*, unsigned long, int, float*) 2025-05-07T20:10:36.7843938Z U void fbgemm::Fused8BitRowwiseQuantizedSBFloatToFloatOrHalf(unsigned char const*, unsigned long, int, unsigned short*) 2025-05-07T20:10:36.7844365Z U void fbgemm::FusedNBitRowwiseQuantizedSBHalfToFloatOrHalf(int, unsigned char const*, unsigned long, int, float*, bool) 2025-05-07T20:10:36.7844855Z U void fbgemm::FusedNBitRowwiseQuantizedSBHalfToFloatOrHalf(int, unsigned char const*, unsigned long, int, unsigned short*, bool) 2025-05-07T20:10:36.7845328Z U void fbgemm::FusedNBitRowwiseQuantizedSBHalfToFloatOrHalfRef(int, unsigned char const*, unsigned long, int, float*, bool) 2025-05-07T20:10:36.7845879Z U void fbgemm::FusedNBitRowwiseQuantizedSBHalfToFloatOrHalfRef(int, unsigned char const*, unsigned long, int, unsigned short*, bool) 2025-05-07T20:10:36.7846411Z U void fbgemm::FusedNBitRowwiseQuantizedSBHalfToFloatOrHalfRef(int, unsigned char const*, unsigned long, int, unsigned short*, bool) 2025-05-07T20:10:36.7846716Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:10:36.7846900Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:10:36.7847071Z U vtable for __cxxabiv1::__pointer_type_info@CXXABI_1.3 2025-05-07T20:10:36.7847223Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:10:36.7847377Z U vtable for __cxxabiv1::__vmi_class_type_info@CXXABI_1.3 2025-05-07T20:10:36.7847497Z U vtable for at::TensorIterator 2025-05-07T20:10:36.7847615Z U vtable for at::TensorIteratorBase 2025-05-07T20:10:36.7847715Z U vtable for c10::Error 2025-05-07T20:10:36.7847816Z U vtable for c10::ListType 2025-05-07T20:10:36.7848146Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:36.7848321Z U vtable for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:10:36.7848704Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:10:36.7848853Z U vtable for torch::autograd::AutogradMeta 2025-05-07T20:10:36.7849027Z U vtable for torch::autograd::Node 2025-05-07T20:10:36.7849203Z w TLS init function for c10::impl::raw_local_dispatch_key_set 2025-05-07T20:10:36.7849324Z w _ITM_deregisterTMCloneTable 2025-05-07T20:10:36.7849432Z w _ITM_registerTMCloneTable 2025-05-07T20:10:36.7849538Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:10:36.7849638Z w __gmon_start__ 2025-05-07T20:10:36.7849732Z w __pthread_key_create 2025-05-07T20:10:36.7849842Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:10:36.7849952Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:10:36.7850055Z w pthread_once 2025-05-07T20:10:36.7850197Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:10:36.7850368Z + ldd ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:10:36.7850399Z 2025-05-07T20:10:36.7850562Z linux-vdso.so.1 (0x00007ffdf3fb7000) 2025-05-07T20:10:36.7850662Z libc10.so => not found 2025-05-07T20:10:36.7850758Z libnvrtc.so.12 => not found 2025-05-07T20:10:36.7850850Z libc10_cuda.so => not found 2025-05-07T20:10:36.7850940Z libnccl.so.2 => not found 2025-05-07T20:10:36.7851039Z libcuda.so.1 => not found 2025-05-07T20:10:36.7851405Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm.so (0x00007fb1c4600000) 2025-05-07T20:10:36.7851988Z fbgemm_gpu_embedding_inplace_ops.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_embedding_inplace_ops.so (0x00007fb1c9a44000) 2025-05-07T20:10:36.7852509Z fbgemm_gpu_tbe_index_select.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_index_select.so (0x00007fb1c2200000) 2025-05-07T20:10:36.7852956Z fbgemm_gpu_tbe_cache.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_cache.so (0x00007fb1c0a00000) 2025-05-07T20:10:36.7853452Z fbgemm_gpu_tbe_optimizers.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_optimizers.so (0x00007fb1c0000000) 2025-05-07T20:10:36.7853560Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:36.7853650Z libtorch.so => not found 2025-05-07T20:10:36.7854180Z fbgemm_gpu_sparse_async_cumsum.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_sparse_async_cumsum.so (0x00007fb1bfe59000) 2025-05-07T20:10:36.7854666Z fbgemm_gpu_tbe_utils.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_utils.so (0x00007fb1bec00000) 2025-05-07T20:10:36.7854760Z libtorch_cpu.so => not found 2025-05-07T20:10:36.7854855Z libtorch_cuda.so => not found 2025-05-07T20:10:36.7854949Z libcudart.so.12 => not found 2025-05-07T20:10:36.7855151Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007fb1be99c000) 2025-05-07T20:10:36.7855273Z libm.so.6 => /lib64/libm.so.6 (0x00007fb1c9965000) 2025-05-07T20:10:36.7855421Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007fb1c4bd2000) 2025-05-07T20:10:36.7855552Z libc.so.6 => /lib64/libc.so.6 (0x00007fb1be794000) 2025-05-07T20:10:36.7855677Z /lib64/ld-linux-x86-64.so.2 (0x00007fb1c9abd000) 2025-05-07T20:10:36.7855763Z libc10.so => not found 2025-05-07T20:10:36.7855866Z libnvrtc.so.12 => not found 2025-05-07T20:10:36.7855955Z libc10_cuda.so => not found 2025-05-07T20:10:36.7856045Z libnccl.so.2 => not found 2025-05-07T20:10:36.7856134Z libcuda.so.1 => not found 2025-05-07T20:10:36.7856572Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/asmjit.so (0x00007fb1c4589000) 2025-05-07T20:10:36.7856671Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:36.7856761Z libtorch.so => not found 2025-05-07T20:10:36.7856866Z libtorch_cpu.so => not found 2025-05-07T20:10:36.7856964Z libtorch_cuda.so => not found 2025-05-07T20:10:36.7857054Z libtorch.so => not found 2025-05-07T20:10:36.7857141Z libc10.so => not found 2025-05-07T20:10:36.7857245Z libnvrtc.so.12 => not found 2025-05-07T20:10:36.7857333Z libc10_cuda.so => not found 2025-05-07T20:10:36.7857483Z libnccl.so.2 => not found 2025-05-07T20:10:36.7857584Z libcuda.so.1 => not found 2025-05-07T20:10:36.7857679Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:36.7857775Z libtorch_cpu.so => not found 2025-05-07T20:10:36.7857870Z libtorch_cuda.so => not found 2025-05-07T20:10:36.7857972Z libcudart.so.12 => not found 2025-05-07T20:10:36.7858122Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007fb1c4b7c000) 2025-05-07T20:10:36.7858208Z libc10.so => not found 2025-05-07T20:10:36.7858312Z libnvrtc.so.12 => not found 2025-05-07T20:10:36.7858401Z libc10_cuda.so => not found 2025-05-07T20:10:36.7858489Z libnccl.so.2 => not found 2025-05-07T20:10:36.7858577Z libcuda.so.1 => not found 2025-05-07T20:10:36.7858685Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:36.7858776Z libtorch.so => not found 2025-05-07T20:10:36.7858870Z libtorch_cpu.so => not found 2025-05-07T20:10:36.7859002Z libtorch_cuda.so => not found 2025-05-07T20:10:36.7859095Z libcudart.so.12 => not found 2025-05-07T20:10:36.7859184Z libtorch.so => not found 2025-05-07T20:10:36.7859267Z libc10.so => not found 2025-05-07T20:10:36.7859368Z libnvrtc.so.12 => not found 2025-05-07T20:10:36.7859455Z libc10_cuda.so => not found 2025-05-07T20:10:36.7859544Z libnccl.so.2 => not found 2025-05-07T20:10:36.7859666Z libcuda.so.1 => not found 2025-05-07T20:10:36.7859762Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:36.7859857Z libtorch_cpu.so => not found 2025-05-07T20:10:36.7859953Z libtorch_cuda.so => not found 2025-05-07T20:10:36.7860058Z libcudart.so.12 => not found 2025-05-07T20:10:36.7860150Z libtorch.so => not found 2025-05-07T20:10:36.7860235Z libc10.so => not found 2025-05-07T20:10:36.7860337Z libnvrtc.so.12 => not found 2025-05-07T20:10:36.7860428Z libc10_cuda.so => not found 2025-05-07T20:10:36.7860520Z libnccl.so.2 => not found 2025-05-07T20:10:36.7860608Z libcuda.so.1 => not found 2025-05-07T20:10:36.7860716Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:36.7860809Z libtorch_cpu.so => not found 2025-05-07T20:10:36.7860902Z libtorch_cuda.so => not found 2025-05-07T20:10:36.7861006Z libcudart.so.12 => not found 2025-05-07T20:10:36.7861090Z libc10.so => not found 2025-05-07T20:10:36.7861178Z libnvrtc.so.12 => not found 2025-05-07T20:10:36.7861265Z libc10_cuda.so => not found 2025-05-07T20:10:36.7861367Z libnccl.so.2 => not found 2025-05-07T20:10:36.7861454Z libcuda.so.1 => not found 2025-05-07T20:10:36.7861547Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:36.7861674Z libtorch.so => not found 2025-05-07T20:10:36.7861766Z libtorch_cpu.so => not found 2025-05-07T20:10:36.7861859Z libtorch_cuda.so => not found 2025-05-07T20:10:36.7861959Z libcudart.so.12 => not found 2025-05-07T20:10:36.7862049Z libtorch.so => not found 2025-05-07T20:10:36.7862148Z libc10.so => not found 2025-05-07T20:10:36.7862290Z libnvrtc.so.12 => not found 2025-05-07T20:10:36.7862383Z libc10_cuda.so => not found 2025-05-07T20:10:36.7862474Z libnccl.so.2 => not found 2025-05-07T20:10:36.7862576Z libcuda.so.1 => not found 2025-05-07T20:10:36.7862671Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:36.7862765Z libtorch_cpu.so => not found 2025-05-07T20:10:36.7862874Z libtorch_cuda.so => not found 2025-05-07T20:10:36.7862967Z libcudart.so.12 => not found 2025-05-07T20:10:36.7863061Z libtorch.so => not found 2025-05-07T20:10:36.7863149Z libc10.so => not found 2025-05-07T20:10:36.7863259Z libnvrtc.so.12 => not found 2025-05-07T20:10:36.7863352Z libc10_cuda.so => not found 2025-05-07T20:10:36.7863454Z libnccl.so.2 => not found 2025-05-07T20:10:36.7863579Z libcuda.so.1 => not found 2025-05-07T20:10:36.7863684Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:36.7863789Z libtorch_cpu.so => not found 2025-05-07T20:10:36.7863896Z libtorch_cuda.so => not found 2025-05-07T20:10:36.7864053Z librt.so.1 => /lib64/librt.so.1 (0x00007fb1c4b67000) 2025-05-07T20:10:36.7864225Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007fb1c4b62000) 2025-05-07T20:10:36.7864233Z 2025-05-07T20:10:36.7864343Z [CHECK] Displaying ELF information: 2025-05-07T20:10:36.7864553Z + readelf -d ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:10:36.7864558Z 2025-05-07T20:10:36.7864562Z 2025-05-07T20:10:36.7864721Z Dynamic section at offset 0x48e4fa8 contains 47 entries: 2025-05-07T20:10:36.7864840Z Tag Type Name/Value 2025-05-07T20:10:36.7865044Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:10:36.7865247Z 0x0000000000000001 (NEEDED) Shared library: [libnvrtc.so.12] 2025-05-07T20:10:36.7865444Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:10:36.7865650Z 0x0000000000000001 (NEEDED) Shared library: [libnccl.so.2] 2025-05-07T20:10:36.7865842Z 0x0000000000000001 (NEEDED) Shared library: [libcuda.so.1] 2025-05-07T20:10:36.7866029Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm.so] 2025-05-07T20:10:36.7866321Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_embedding_inplace_ops.so] 2025-05-07T20:10:36.7866550Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_index_select.so] 2025-05-07T20:10:36.7866762Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_cache.so] 2025-05-07T20:10:36.7867015Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_optimizers.so] 2025-05-07T20:10:36.7867233Z 0x0000000000000001 (NEEDED) Shared library: [libnvidia-ml.so.1] 2025-05-07T20:10:36.7867426Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:10:36.7867666Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_sparse_async_cumsum.so] 2025-05-07T20:10:36.7867896Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_utils.so] 2025-05-07T20:10:36.7868094Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:10:36.7868294Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:10:36.7868506Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:10:36.7868702Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:10:36.7868886Z 0x0000000000000001 (NEEDED) Shared library: [libm.so.6] 2025-05-07T20:10:36.7869092Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:10:36.7869299Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:10:36.7869507Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:10:36.7869719Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_py.so] 2025-05-07T20:10:36.7869938Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:10:36.7870057Z 0x000000000000000c (INIT) 0x1bb000 2025-05-07T20:10:36.7870368Z 0x000000000000000d (FINI) 0x75816c 2025-05-07T20:10:36.7870508Z 0x0000000000000019 (INIT_ARRAY) 0x48d6858 2025-05-07T20:10:36.7870641Z 0x000000000000001b (INIT_ARRAYSZ) 1160 (bytes) 2025-05-07T20:10:36.7870759Z 0x000000000000001a (FINI_ARRAY) 0x48d6ce0 2025-05-07T20:10:36.7870890Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:10:36.7871005Z 0x000000006ffffef5 (GNU_HASH) 0x238 2025-05-07T20:10:36.7871118Z 0x0000000000000005 (STRTAB) 0x33248 2025-05-07T20:10:36.7871230Z 0x0000000000000006 (SYMTAB) 0xc2f0 2025-05-07T20:10:36.7871392Z 0x000000000000000a (STRSZ) 1276767 (bytes) 2025-05-07T20:10:36.7871508Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:10:36.7871628Z 0x0000000000000003 (PLTGOT) 0x48eafe8 2025-05-07T20:10:36.7871794Z 0x0000000000000002 (PLTRELSZ) 68808 (bytes) 2025-05-07T20:10:36.7871910Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:10:36.7872033Z 0x0000000000000017 (JMPREL) 0x1a9648 2025-05-07T20:10:36.7872177Z 0x0000000000000007 (RELA) 0x16e320 2025-05-07T20:10:36.7872316Z 0x0000000000000008 (RELASZ) 242472 (bytes) 2025-05-07T20:10:36.7872445Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:10:36.7872570Z 0x000000006ffffffe (VERNEED) 0x16e1a0 2025-05-07T20:10:36.7872705Z 0x000000006fffffff (VERNEEDNUM) 6 2025-05-07T20:10:36.7872829Z 0x000000006ffffff0 (VERSYM) 0x16ada8 2025-05-07T20:10:36.7872950Z 0x000000006ffffff9 (RELACOUNT) 2870 2025-05-07T20:10:36.7873078Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:10:36.7873083Z 2025-05-07T20:10:36.7873213Z ################################################################################ 2025-05-07T20:10:36.7873218Z 2025-05-07T20:10:36.7873223Z 2025-05-07T20:10:36.7873348Z ################################################################################ 2025-05-07T20:10:36.7873745Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:10:36.7873857Z [CHECK] Listing out library size: 2025-05-07T20:10:36.7874162Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:10:36.7874167Z 2025-05-07T20:10:36.7874445Z 904 ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:10:36.7874485Z 2025-05-07T20:10:36.7874911Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:10:36.7875436Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:10:36.7875441Z 2025-05-07T20:10:36.9817064Z GLIBC_2.2.5 2025-05-07T20:10:36.9817456Z GLIBC_2.3 2025-05-07T20:10:36.9817796Z GLIBC_2.14 2025-05-07T20:10:36.9818096Z 2025-05-07T20:10:36.9818107Z 2025-05-07T20:10:36.9818756Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:10:36.9819897Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:10:36.9820555Z 2025-05-07T20:10:37.1764977Z GLIBCXX_3.4 2025-05-07T20:10:37.1766530Z GLIBCXX_3.4.9 2025-05-07T20:10:37.1767283Z GLIBCXX_3.4.11 2025-05-07T20:10:37.1768048Z GLIBCXX_3.4.14 2025-05-07T20:10:37.1768315Z GLIBCXX_3.4.15 2025-05-07T20:10:37.1768553Z GLIBCXX_3.4.18 2025-05-07T20:10:37.1768808Z GLIBCXX_3.4.20 2025-05-07T20:10:37.1769042Z GLIBCXX_3.4.21 2025-05-07T20:10:37.1769173Z 2025-05-07T20:10:37.1769179Z 2025-05-07T20:10:37.1785220Z + nm -gDC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward.so > /tmp/tmp.xBynrJyZBO.symbols.txt 2025-05-07T20:10:37.1785762Z 2025-05-07T20:10:37.3713605Z 2025-05-07T20:10:37.3797017Z [CHECK] Total Number of symbols: 12682 2025-05-07T20:10:37.3938980Z [CHECK] Number of fbgemm symbols: 2318 2025-05-07T20:10:37.3961497Z + nm -gDCu ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward.so > /tmp/tmp.uSgsTNbZxa.usymbols.txt 2025-05-07T20:10:37.3962075Z 2025-05-07T20:10:37.4037842Z 2025-05-07T20:10:37.4068578Z [CHECK] Listing out undefined symbols (273 total): 2025-05-07T20:10:37.4085319Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:37.4086289Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:37.4086841Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:10:37.4087261Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:10:37.4087670Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:10:37.4088098Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:10:37.4088499Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:10:37.4088875Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:10:37.4089256Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:10:37.4089625Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:10:37.4090016Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:10:37.4090351Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:10:37.4090686Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:10:37.4091020Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:10:37.4091339Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:10:37.4091677Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:10:37.4092000Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:10:37.4092537Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:10:37.4092850Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:10:37.4093181Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:10:37.4093496Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:10:37.4093826Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:10:37.4094323Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:10:37.4094825Z U at::CUDAGeneratorImpl::device_type() 2025-05-07T20:10:37.4095218Z U at::CUDAGeneratorImpl::philox_cuda_state(unsigned long) 2025-05-07T20:10:37.4095613Z U at::Context::deterministicAlgorithms() const 2025-05-07T20:10:37.4096020Z U at::RecordFunction::RecordFunction(at::StepCallbacks&&) 2025-05-07T20:10:37.4096506Z U at::RecordFunction::currentThreadId() 2025-05-07T20:10:37.4097055Z U at::RecordFunction::~RecordFunction() 2025-05-07T20:10:37.4097531Z U at::SavedTensorDefaultHooks::set_tracing(bool) 2025-05-07T20:10:37.4097971Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:10:37.4098600Z U at::_ops::clamp::call(at::Tensor const&, std::optional const&, std::optional const&) 2025-05-07T20:10:37.4099182Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:10:37.4100109Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:37.4101424Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:37.4102438Z U at::_ops::reshape::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:10:37.4103582Z U at::_ops::sparse_coo_tensor_indices_size::call(at::Tensor const&, at::Tensor const&, c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:37.4104703Z U at::_ops::to_dtype::call(at::Tensor const&, c10::ScalarType, bool, bool, std::optional) 2025-05-07T20:10:37.4105268Z U at::_ops::unsqueeze::call(at::Tensor const&, long) 2025-05-07T20:10:37.4105684Z U at::_ops::view_as::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:10:37.4106533Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:37.4107590Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:37.4108387Z U at::cuda::detail::getDefaultCUDAGenerator(signed char) 2025-05-07T20:10:37.4108787Z U at::cuda::getCurrentDeviceProperties() 2025-05-07T20:10:37.4109130Z U at::functorch::functorchTLSAccessor() 2025-05-07T20:10:37.4109510Z U at::getStepCallbacksUnlessEmpty(at::RecordScope) 2025-05-07T20:10:37.4109854Z U at::get_thread_num() 2025-05-07T20:10:37.4110151Z U at::globalContext() 2025-05-07T20:10:37.4110463Z U at::internal::set_thread_num(int) 2025-05-07T20:10:37.4110788Z U at::sequence_number::get_and_increment() 2025-05-07T20:10:37.4111186Z U at::tensor(c10::ArrayRef, c10::TensorOptions const&) 2025-05-07T20:10:37.4111584Z U at::toAccumulateType(c10::ScalarType, bool) 2025-05-07T20:10:37.4111924Z U bcmp@GLIBC_2.2.5 2025-05-07T20:10:37.4112225Z U c10::AnyType::get() 2025-05-07T20:10:37.4112611Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:37.4113017Z U c10::BoolType::get() 2025-05-07T20:10:37.4113350Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:10:37.4113784Z U c10::Dispatcher::findSchemaOrThrow(char const*, char const*) 2025-05-07T20:10:37.4114191Z U c10::Dispatcher::realSingleton() 2025-05-07T20:10:37.4114884Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet) 2025-05-07T20:10:37.4116029Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet, c10::ArrayRef) 2025-05-07T20:10:37.4117038Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:10:37.4117592Z U c10::Error::what() const 2025-05-07T20:10:37.4117892Z U c10::FloatType::get() 2025-05-07T20:10:37.4118190Z U c10::GeneratorImpl::device() const 2025-05-07T20:10:37.4118511Z U c10::GradMode::is_enabled() 2025-05-07T20:10:37.4118811Z U c10::GradMode::set_enabled(bool) 2025-05-07T20:10:37.4119248Z U c10::Half* at::TensorBase::data_ptr() const 2025-05-07T20:10:37.4119659Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:37.4120089Z U c10::IValue::TagType::get(c10::IValue const&) 2025-05-07T20:10:37.4120466Z U c10::IValue::hash(c10::IValue const&) 2025-05-07T20:10:37.4120804Z U c10::IValue::isBoolList() const 2025-05-07T20:10:37.4121131Z U c10::IValue::isIntList() const 2025-05-07T20:10:37.4121440Z U c10::IValue::isSymIntList() const 2025-05-07T20:10:37.4121776Z U c10::IValue::isTensorList() const 2025-05-07T20:10:37.4122140Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:10:37.4122476Z U c10::IntType::get() 2025-05-07T20:10:37.4122832Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:10:37.4123207Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:10:37.4123554Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:10:37.4124065Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:10:37.4124518Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:10:37.4124998Z U c10::ParallelGuard::ParallelGuard(bool) 2025-05-07T20:10:37.4125346Z U c10::ParallelGuard::~ParallelGuard() 2025-05-07T20:10:37.4125850Z U c10::SmallVectorBase::grow_pod(void const*, unsigned long, unsigned long) 2025-05-07T20:10:37.4126384Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:10:37.4126767Z U c10::StringType::get() 2025-05-07T20:10:37.4127128Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:10:37.4127526Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:10:37.4128192Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:10:37.4128821Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:10:37.4129204Z U c10::SymInt::promote_to_negative() 2025-05-07T20:10:37.4129555Z U c10::SymInt::toSymNode() const 2025-05-07T20:10:37.4129981Z U c10::SymIntType::get() 2025-05-07T20:10:37.4130365Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:10:37.4130728Z U c10::TensorImpl::requires_grad() const 2025-05-07T20:10:37.4131109Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:10:37.4131457Z U c10::TensorType::get() 2025-05-07T20:10:37.4131787Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:10:37.4132702Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:10:37.4133652Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:10:37.4133997Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:10:37.4134342Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:10:37.4134690Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:10:37.4135015Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:10:37.4135359Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:10:37.4135800Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:10:37.4136260Z U c10::cuda::device_count() 2025-05-07T20:10:37.4136678Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:10:37.4137258Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:10:37.4137801Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:10:37.4138190Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:10:37.4138617Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:10:37.4139025Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:10:37.4139693Z U c10::detail::ListImpl::ListImpl(std::vector >, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:10:37.4140760Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:10:37.4141625Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:10:37.4142495Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:37.4143438Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:10:37.4144453Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:37.4145264Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:10:37.4145625Z U c10::impl::GPUTrace::haveState 2025-05-07T20:10:37.4146162Z U c10::impl::OperatorEntry::assertSignatureIsCorrect(c10::impl::CppSignature const&, bool) const 2025-05-07T20:10:37.4146789Z U c10::impl::OperatorEntry::reportError(c10::DispatchKey) const 2025-05-07T20:10:37.4147239Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:10:37.4147685Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:10:37.4148096Z U c10::impl::device_guard_impl_registry 2025-05-07T20:10:37.4148439Z U c10::impl::raw_local_dispatch_key_set 2025-05-07T20:10:37.4148954Z U c10::initializeFunctionalityOffsetsAndMasks() 2025-05-07T20:10:37.4149570Z U c10::ivalue::ConstantString::create(std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:10:37.4150308Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:10:37.4150682Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:10:37.4151047Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:10:37.4151445Z U c10::operator<<(std::ostream&, c10::OperatorName const&) 2025-05-07T20:10:37.4151863Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:10:37.4152231Z U c10::operator<=(c10::SymInt const&, int) 2025-05-07T20:10:37.4152561Z U c10::operator>(c10::SymInt const&, int) 2025-05-07T20:10:37.4152906Z U c10::operator>=(c10::SymInt const&, int) 2025-05-07T20:10:37.4153245Z U c10::throwNullDataPtrError() 2025-05-07T20:10:37.4153552Z U c10::warn(c10::Warning const&) 2025-05-07T20:10:37.4153876Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:10:37.4154260Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:10:37.4154676Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:10:37.4155007Z U cudaDeviceGetAttribute@libcudart.so.12 2025-05-07T20:10:37.4155361Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:10:37.4155720Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:10:37.4156057Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:10:37.4156427Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:10:37.4156748Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:10:37.4157079Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:10:37.4157404Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:10:37.4157777Z U cudaFuncSetAttribute@libcudart.so.12 2025-05-07T20:10:37.4158137Z U cudaGetDeviceProperties_v2@libcudart.so.12 2025-05-07T20:10:37.4158481Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:10:37.4158816Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:10:37.4159136Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:10:37.4159468Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:10:37.4159793Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:10:37.4160138Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:10:37.4161062Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:37.4162419Z U fbgemm::SparseAdaGradSignature::Type fbgemm::GenerateSparseAdaGrad(int, bool, int, bool) 2025-05-07T20:10:37.4162987Z U fbgemm::fbgemmAlignedFree(void*) 2025-05-07T20:10:37.4163395Z U fbgemm_gpu::asynchronous_complete_cumsum_gpu(at::Tensor const&) 2025-05-07T20:10:37.4163828Z U float at::Tensor::item() const 2025-05-07T20:10:37.4164201Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:10:37.4164605Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:37.4164974Z U free@GLIBC_2.2.5 2025-05-07T20:10:37.4165286Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:10:37.4165681Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:37.4166119Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:10:37.4166529Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:10:37.4166931Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:37.4167290Z U memcmp@GLIBC_2.2.5 2025-05-07T20:10:37.4167744Z U memcpy@GLIBC_2.14 2025-05-07T20:10:37.4168017Z U memmove@GLIBC_2.2.5 2025-05-07T20:10:37.4168313Z U memset@GLIBC_2.2.5 2025-05-07T20:10:37.4168604Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:10:37.4168946Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:10:37.4169479Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:10:37.4170385Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, int const*, int*, int, int, int, CUstream_st*) 2025-05-07T20:10:37.4171390Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, long const*, long*, int, int, int, CUstream_st*) 2025-05-07T20:10:37.4172137Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:10:37.4172905Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, int const*, int*, int, int, int, CUstream_st*) 2025-05-07T20:10:37.4173682Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, long const*, long*, int, int, int, CUstream_st*) 2025-05-07T20:10:37.4174205Z U realloc@GLIBC_2.2.5 2025-05-07T20:10:37.4174862Z U split_embedding_codegen_forward_cpu(at::Tensor, at::Tensor, at::Tensor, c10::SymInt, at::Tensor, at::Tensor, at::Tensor, long, at::Tensor, long) 2025-05-07T20:10:37.4175914Z U split_embedding_codegen_grad_indice_weights_cpu(at::Tensor, at::Tensor, at::Tensor, at::Tensor, at::Tensor, at::Tensor, at::Tensor) 2025-05-07T20:10:37.4176611Z U sqrt@GLIBC_2.2.5 2025-05-07T20:10:37.4176918Z U sqrtf@GLIBC_2.2.5 2025-05-07T20:10:37.4177365Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:10:37.4178089Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:10:37.4178939Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:10:37.4179754Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:10:37.4180581Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:10:37.4181196Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:10:37.4181548Z U std::__throw_bad_array_new_length() 2025-05-07T20:10:37.4181937Z U std::__throw_bad_function_call()@GLIBCXX_3.4.14 2025-05-07T20:10:37.4182338Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:37.4182855Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:37.4183255Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:10:37.4183675Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:10:37.4184055Z U std::bad_weak_ptr::~bad_weak_ptr()@GLIBCXX_3.4.15 2025-05-07T20:10:37.4184516Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:10:37.4185393Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:10:37.4186159Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:10:37.4186503Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:10:37.4186860Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:10:37.4187192Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:10:37.4187564Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:10:37.4187937Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:10:37.4188448Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:10:37.4188911Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:10:37.4189341Z U std::out_of_range::out_of_range(char const*)@GLIBCXX_3.4.21 2025-05-07T20:10:37.4189744Z U std::out_of_range::~out_of_range()@GLIBCXX_3.4 2025-05-07T20:10:37.4190356Z U std::runtime_error::runtime_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:10:37.4190986Z U std::runtime_error::~runtime_error()@GLIBCXX_3.4 2025-05-07T20:10:37.4191337Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:10:37.4191633Z U strcmp@GLIBC_2.2.5 2025-05-07T20:10:37.4191920Z U strlen@GLIBC_2.2.5 2025-05-07T20:10:37.4192211Z U torch::CppFunction::~CppFunction() 2025-05-07T20:10:37.4192979Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:10:37.4194080Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:10:37.4194839Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:10:37.4195310Z U torch::autograd::AnomalyMode::_enabled 2025-05-07T20:10:37.4195843Z U torch::autograd::AutogradContext::AutogradContext(torch::dynamo::autograd::PackedArgs&) 2025-05-07T20:10:37.4196371Z U torch::autograd::AutogradContext::get_and_bump_dirty() const 2025-05-07T20:10:37.4196843Z U torch::autograd::AutogradContext::get_non_differentiable() const 2025-05-07T20:10:37.4197304Z U torch::autograd::AutogradContext::get_saved_variables() const 2025-05-07T20:10:37.4197913Z U torch::autograd::AutogradContext::save_for_backward(std::vector >) 2025-05-07T20:10:37.4198488Z U torch::autograd::AutogradContext::save_variables() 2025-05-07T20:10:37.4199087Z U torch::autograd::ForwardADLevel::try_get_by_idx(unsigned long) 2025-05-07T20:10:37.4199567Z U torch::autograd::InputMetadata::shape_as_dim_vector() const 2025-05-07T20:10:37.4199970Z U torch::autograd::Node::assign_parent() 2025-05-07T20:10:37.4200501Z U torch::autograd::Node::metadata() 2025-05-07T20:10:37.4200881Z U torch::autograd::Node::name[abi:cxx11]() const 2025-05-07T20:10:37.4201371Z U torch::autograd::SavedVariable::SavedVariable(at::Tensor const&, bool, bool) 2025-05-07T20:10:37.4202011Z U torch::autograd::SavedVariable::unpack(std::shared_ptr) const 2025-05-07T20:10:37.4202525Z U torch::autograd::VariableInfo::VariableInfo() 2025-05-07T20:10:37.4203002Z U torch::autograd::VariableInfo::VariableInfo(at::Tensor const&, bool) 2025-05-07T20:10:37.4203560Z U torch::autograd::VariableInfo::zeros(c10::OptionalDeviceGuard&) const 2025-05-07T20:10:37.4206490Z U torch::autograd::_wrap_outputs(std::vector > const&, std::unordered_set, std::equal_to, std::allocator > const&, std::unordered_set, std::equal_to, std::allocator > const&, c10::ArrayRef >, std::shared_ptr const&, std::function > (std::vector >, std::vector >)> const&, std::unordered_set, std::equal_to, std::allocator > const&, std::function const&) 2025-05-07T20:10:37.4209622Z U torch::autograd::deleteNode(torch::autograd::Node*) 2025-05-07T20:10:37.4210035Z U torch::autograd::get_current_graph_task_exec_info() 2025-05-07T20:10:37.4210482Z U torch::autograd::impl::gradient_edge(at::Tensor const&) 2025-05-07T20:10:37.4211054Z U torch::dynamo::autograd::getPyCompilerInterface() 2025-05-07T20:10:37.4211704Z U torch::dynamo::autograd::get_input_metadata(std::vector > const&) 2025-05-07T20:10:37.4212580Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:10:37.4213592Z U transpose_embedding_input(at::Tensor, long, at::Tensor, at::Tensor, bool, std::optional const&, long, long, long, bool, std::optional const&, long, long) 2025-05-07T20:10:37.4214516Z U typeinfo for c10::Error 2025-05-07T20:10:37.4214881Z U typeinfo for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:10:37.4215302Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:10:37.4215679Z U typeinfo for std::out_of_range@GLIBCXX_3.4 2025-05-07T20:10:37.4216053Z U typeinfo for std::runtime_error@GLIBCXX_3.4 2025-05-07T20:10:37.4216527Z U typeinfo for torch::autograd::Node 2025-05-07T20:10:37.4217831Z U void internal::csr2csc(internal::HyperCompressedSparseColumn&, int, at::TensorAccessor const&, at::TensorAccessor const&, at::TensorAccessor const&, long, int const*, long) 2025-05-07T20:10:37.4220005Z U void internal::csr2csc(internal::HyperCompressedSparseColumn&, int, at::TensorAccessor const&, at::TensorAccessor const&, at::TensorAccessor const&, long, int const*, long) 2025-05-07T20:10:37.4221297Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:10:37.4221736Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:10:37.4222180Z U vtable for __cxxabiv1::__pointer_type_info@CXXABI_1.3 2025-05-07T20:10:37.4222601Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:10:37.4222990Z U vtable for c10::Error 2025-05-07T20:10:37.4223515Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:37.4224098Z U vtable for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:10:37.4224758Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:10:37.4225221Z U vtable for torch::autograd::Node 2025-05-07T20:10:37.4225635Z w TLS init function for c10::impl::raw_local_dispatch_key_set 2025-05-07T20:10:37.4226033Z w _ITM_deregisterTMCloneTable 2025-05-07T20:10:37.4226369Z w _ITM_registerTMCloneTable 2025-05-07T20:10:37.4226684Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:10:37.4227000Z w __gmon_start__ 2025-05-07T20:10:37.4227281Z w __pthread_key_create 2025-05-07T20:10:37.4227602Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:10:37.4227977Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:10:37.4228341Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:10:37.4228838Z + ldd ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:10:37.4229191Z 2025-05-07T20:10:37.4229362Z linux-vdso.so.1 (0x00007fff757ed000) 2025-05-07T20:10:37.4229682Z libc10.so => not found 2025-05-07T20:10:37.4229943Z libnvrtc.so.12 => not found 2025-05-07T20:10:37.4230210Z libc10_cuda.so => not found 2025-05-07T20:10:37.4230499Z libnccl.so.2 => not found 2025-05-07T20:10:37.4230775Z libcuda.so.1 => not found 2025-05-07T20:10:37.4231398Z fbgemm_gpu_tbe_cache.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_cache.so (0x00007f2bc3800000) 2025-05-07T20:10:37.4232440Z fbgemm_gpu_tbe_common.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_common.so (0x00007f2bc3400000) 2025-05-07T20:10:37.4233539Z fbgemm_gpu_sparse_async_cumsum.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_sparse_async_cumsum.so (0x00007f2bc3259000) 2025-05-07T20:10:37.4234299Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:37.4234592Z libtorch.so => not found 2025-05-07T20:10:37.4235104Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm.so (0x00007f2bc2c00000) 2025-05-07T20:10:37.4236066Z fbgemm_gpu_tbe_utils.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_utils.so (0x00007f2bc1a00000) 2025-05-07T20:10:37.4236721Z libtorch_cpu.so => not found 2025-05-07T20:10:37.4237016Z libtorch_cuda.so => not found 2025-05-07T20:10:37.4237292Z libcudart.so.12 => not found 2025-05-07T20:10:37.4237642Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f2bc179c000) 2025-05-07T20:10:37.4238087Z libm.so.6 => /lib64/libm.so.6 (0x00007f2bfed81000) 2025-05-07T20:10:37.4238474Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f2bfed53000) 2025-05-07T20:10:37.4238880Z libc.so.6 => /lib64/libc.so.6 (0x00007f2bc1594000) 2025-05-07T20:10:37.4239242Z /lib64/ld-linux-x86-64.so.2 (0x00007f2bfee66000) 2025-05-07T20:10:37.4239592Z libtorch.so => not found 2025-05-07T20:10:37.4239846Z libc10.so => not found 2025-05-07T20:10:37.4240110Z libnvrtc.so.12 => not found 2025-05-07T20:10:37.4240378Z libc10_cuda.so => not found 2025-05-07T20:10:37.4240668Z libnccl.so.2 => not found 2025-05-07T20:10:37.4240919Z libcuda.so.1 => not found 2025-05-07T20:10:37.4241196Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:37.4241494Z libtorch_cpu.so => not found 2025-05-07T20:10:37.4241766Z libtorch_cuda.so => not found 2025-05-07T20:10:37.4242042Z libcudart.so.12 => not found 2025-05-07T20:10:37.4242357Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f2bc4faa000) 2025-05-07T20:10:37.4242714Z libc10.so => not found 2025-05-07T20:10:37.4242955Z libnvrtc.so.12 => not found 2025-05-07T20:10:37.4243228Z libc10_cuda.so => not found 2025-05-07T20:10:37.4243488Z libnccl.so.2 => not found 2025-05-07T20:10:37.4243753Z libcuda.so.1 => not found 2025-05-07T20:10:37.4244368Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_config.so (0x00007f2bfed42000) 2025-05-07T20:10:37.4245007Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:37.4245289Z libtorch.so => not found 2025-05-07T20:10:37.4245545Z libtorch_cpu.so => not found 2025-05-07T20:10:37.4245839Z libtorch_cuda.so => not found 2025-05-07T20:10:37.4246102Z libcudart.so.12 => not found 2025-05-07T20:10:37.4246373Z libc10.so => not found 2025-05-07T20:10:37.4246614Z libnvrtc.so.12 => not found 2025-05-07T20:10:37.4246890Z libc10_cuda.so => not found 2025-05-07T20:10:37.4247149Z libnccl.so.2 => not found 2025-05-07T20:10:37.4247412Z libcuda.so.1 => not found 2025-05-07T20:10:37.4247680Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:37.4247949Z libtorch.so => not found 2025-05-07T20:10:37.4248219Z libtorch_cpu.so => not found 2025-05-07T20:10:37.4250460Z libtorch_cuda.so => not found 2025-05-07T20:10:37.4250737Z libcudart.so.12 => not found 2025-05-07T20:10:37.4250994Z libc10.so => not found 2025-05-07T20:10:37.4251357Z libnvrtc.so.12 => not found 2025-05-07T20:10:37.4251601Z libc10_cuda.so => not found 2025-05-07T20:10:37.4251860Z libnccl.so.2 => not found 2025-05-07T20:10:37.4252093Z libcuda.so.1 => not found 2025-05-07T20:10:37.4252630Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/asmjit.so (0x00007f2bc4f33000) 2025-05-07T20:10:37.4253178Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:37.4253435Z libtorch.so => not found 2025-05-07T20:10:37.4253695Z libtorch_cpu.so => not found 2025-05-07T20:10:37.4253944Z libtorch_cuda.so => not found 2025-05-07T20:10:37.4254205Z libtorch.so => not found 2025-05-07T20:10:37.4254432Z libc10.so => not found 2025-05-07T20:10:37.4254677Z libnvrtc.so.12 => not found 2025-05-07T20:10:37.4254921Z libc10_cuda.so => not found 2025-05-07T20:10:37.4255362Z libnccl.so.2 => not found 2025-05-07T20:10:37.4255617Z libcuda.so.1 => not found 2025-05-07T20:10:37.4255885Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:37.4256166Z libtorch_cpu.so => not found 2025-05-07T20:10:37.4256519Z libtorch_cuda.so => not found 2025-05-07T20:10:37.4256979Z libcudart.so.12 => not found 2025-05-07T20:10:37.4257361Z libtorch.so => not found 2025-05-07T20:10:37.4257675Z libc10.so => not found 2025-05-07T20:10:37.4257921Z libnvrtc.so.12 => not found 2025-05-07T20:10:37.4258198Z libc10_cuda.so => not found 2025-05-07T20:10:37.4258499Z libnccl.so.2 => not found 2025-05-07T20:10:37.4258764Z libcuda.so.1 => not found 2025-05-07T20:10:37.4259020Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:37.4259300Z libtorch_cpu.so => not found 2025-05-07T20:10:37.4259586Z libtorch_cuda.so => not found 2025-05-07T20:10:37.4259960Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007f2bfed2d000) 2025-05-07T20:10:37.4260354Z libtorch.so => not found 2025-05-07T20:10:37.4260600Z libc10.so => not found 2025-05-07T20:10:37.4260854Z libnvrtc.so.12 => not found 2025-05-07T20:10:37.4261111Z libc10_cuda.so => not found 2025-05-07T20:10:37.4261385Z libnccl.so.2 => not found 2025-05-07T20:10:37.4261636Z libcuda.so.1 => not found 2025-05-07T20:10:37.4261909Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:37.4262172Z libtorch_cpu.so => not found 2025-05-07T20:10:37.4262450Z libtorch_cuda.so => not found 2025-05-07T20:10:37.4262778Z librt.so.1 => /lib64/librt.so.1 (0x00007f2bfed24000) 2025-05-07T20:10:37.4263019Z 2025-05-07T20:10:37.4263132Z [CHECK] Displaying ELF information: 2025-05-07T20:10:37.4263607Z + readelf -d ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:10:37.4263987Z 2025-05-07T20:10:37.4263991Z 2025-05-07T20:10:37.4264152Z Dynamic section at offset 0x38775ba0 contains 45 entries: 2025-05-07T20:10:37.4264554Z Tag Type Name/Value 2025-05-07T20:10:37.4264972Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:10:37.4265491Z 0x0000000000000001 (NEEDED) Shared library: [libnvrtc.so.12] 2025-05-07T20:10:37.4266010Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:10:37.4266505Z 0x0000000000000001 (NEEDED) Shared library: [libnccl.so.2] 2025-05-07T20:10:37.4267014Z 0x0000000000000001 (NEEDED) Shared library: [libcuda.so.1] 2025-05-07T20:10:37.4267535Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_cache.so] 2025-05-07T20:10:37.4268094Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_common.so] 2025-05-07T20:10:37.4268700Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_sparse_async_cumsum.so] 2025-05-07T20:10:37.4269263Z 0x0000000000000001 (NEEDED) Shared library: [libnvidia-ml.so.1] 2025-05-07T20:10:37.4269792Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:10:37.4270524Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm.so] 2025-05-07T20:10:37.4271057Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_utils.so] 2025-05-07T20:10:37.4271668Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:10:37.4272207Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:10:37.4272816Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:10:37.4273330Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:10:37.4273850Z 0x0000000000000001 (NEEDED) Shared library: [libm.so.6] 2025-05-07T20:10:37.4274346Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:10:37.4274856Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:10:37.4275368Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:10:37.4275971Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_training_backward.so] 2025-05-07T20:10:37.4276536Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:10:37.4276945Z 0x000000000000000c (INIT) 0x652000 2025-05-07T20:10:37.4277308Z 0x000000000000000d (FINI) 0x2f6443c 2025-05-07T20:10:37.4277656Z 0x0000000000000019 (INIT_ARRAY) 0x3871d880 2025-05-07T20:10:37.4278038Z 0x000000000000001b (INIT_ARRAYSZ) 1824 (bytes) 2025-05-07T20:10:37.4278401Z 0x000000000000001a (FINI_ARRAY) 0x3871dfa0 2025-05-07T20:10:37.4278816Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:10:37.4279182Z 0x000000006ffffef5 (GNU_HASH) 0x238 2025-05-07T20:10:37.4279514Z 0x0000000000000005 (STRTAB) 0x62978 2025-05-07T20:10:37.4279872Z 0x0000000000000006 (SYMTAB) 0x18470 2025-05-07T20:10:37.4280281Z 0x000000000000000a (STRSZ) 5120077 (bytes) 2025-05-07T20:10:37.4280666Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:10:37.4281011Z 0x0000000000000003 (PLTGOT) 0x38788fe8 2025-05-07T20:10:37.4281390Z 0x0000000000000002 (PLTRELSZ) 63264 (bytes) 2025-05-07T20:10:37.4281735Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:10:37.4282077Z 0x0000000000000017 (JMPREL) 0x641978 2025-05-07T20:10:37.4282428Z 0x0000000000000007 (RELA) 0x54ae50 2025-05-07T20:10:37.4282901Z 0x0000000000000008 (RELASZ) 1010472 (bytes) 2025-05-07T20:10:37.4283270Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:10:37.4283606Z 0x000000006ffffffe (VERNEED) 0x54ace0 2025-05-07T20:10:37.4283950Z 0x000000006fffffff (VERNEEDNUM) 6 2025-05-07T20:10:37.4284264Z 0x000000006ffffff0 (VERSYM) 0x5449c6 2025-05-07T20:10:37.4284603Z 0x000000006ffffff9 (RELACOUNT) 28262 2025-05-07T20:10:37.4284915Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:10:37.4285131Z 2025-05-07T20:10:37.4285247Z ################################################################################ 2025-05-07T20:10:37.4285468Z 2025-05-07T20:10:37.4285472Z 2025-05-07T20:10:37.4285603Z ################################################################################ 2025-05-07T20:10:37.4286138Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:10:37.4286680Z [CHECK] Listing out library size: 2025-05-07T20:10:37.4287164Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:10:37.4287585Z 2025-05-07T20:10:37.4287826Z 142 ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:10:37.4288177Z 2025-05-07T20:10:37.4288608Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:10:37.4289632Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:10:37.4290333Z 2025-05-07T20:10:37.4499418Z GLIBC_2.2.5 2025-05-07T20:10:37.4500045Z GLIBC_2.3 2025-05-07T20:10:37.4500581Z GLIBC_2.14 2025-05-07T20:10:37.4500930Z 2025-05-07T20:10:37.4500965Z 2025-05-07T20:10:37.4502312Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:10:37.4505659Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:10:37.4506432Z 2025-05-07T20:10:37.4776241Z GLIBCXX_3.4 2025-05-07T20:10:37.4777097Z GLIBCXX_3.4.9 2025-05-07T20:10:37.4777712Z GLIBCXX_3.4.11 2025-05-07T20:10:37.4778292Z GLIBCXX_3.4.18 2025-05-07T20:10:37.4778911Z GLIBCXX_3.4.20 2025-05-07T20:10:37.4779471Z GLIBCXX_3.4.21 2025-05-07T20:10:37.4779830Z 2025-05-07T20:10:37.4779869Z 2025-05-07T20:10:37.4797152Z + nm -gDC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so > /tmp/tmp.P4826Ha1TT.symbols.txt 2025-05-07T20:10:37.4798690Z 2025-05-07T20:10:37.5035714Z 2025-05-07T20:10:37.5060602Z [CHECK] Total Number of symbols: 1629 2025-05-07T20:10:37.5084610Z [CHECK] Number of fbgemm symbols: 227 2025-05-07T20:10:37.5100866Z + nm -gDCu ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so > /tmp/tmp.MI9llrHnsA.usymbols.txt 2025-05-07T20:10:37.5102432Z 2025-05-07T20:10:37.5123304Z 2025-05-07T20:10:37.5152315Z [CHECK] Listing out undefined symbols (171 total): 2025-05-07T20:10:37.5168819Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:37.5171890Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:37.5173621Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:10:37.5174650Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:10:37.5175370Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:10:37.5175779Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:10:37.5176161Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:10:37.5176664Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:10:37.5177020Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:10:37.5177463Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:10:37.5177828Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:10:37.5178145Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:10:37.5178480Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:10:37.5178795Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:10:37.5179127Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:10:37.5179457Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:10:37.5179825Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:10:37.5180178Z U at::CUDAGeneratorImpl::device_type() 2025-05-07T20:10:37.5180622Z U at::CUDAGeneratorImpl::philox_cuda_state(unsigned long) 2025-05-07T20:10:37.5181088Z U at::Context::deterministicAlgorithms() const 2025-05-07T20:10:37.5181532Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:10:37.5182032Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:10:37.5182910Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:37.5184256Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:37.5185340Z U at::_ops::reshape::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:10:37.5185955Z U at::_ops::to_dtype::call(at::Tensor const&, c10::ScalarType, bool, bool, std::optional) 2025-05-07T20:10:37.5186938Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:37.5188118Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:37.5189098Z U at::cuda::detail::getDefaultCUDAGenerator(signed char) 2025-05-07T20:10:37.5189501Z U at::cuda::getCurrentDeviceProperties() 2025-05-07T20:10:37.5189860Z U at::globalContext() 2025-05-07T20:10:37.5190259Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:37.5190690Z U c10::BoolType::get() 2025-05-07T20:10:37.5191043Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:10:37.5191429Z U c10::FloatType::get() 2025-05-07T20:10:37.5191768Z U c10::GeneratorImpl::device() const 2025-05-07T20:10:37.5192206Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:37.5192640Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:10:37.5192987Z U c10::IntType::get() 2025-05-07T20:10:37.5193371Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:10:37.5193803Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:10:37.5194192Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:10:37.5207517Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:10:37.5207931Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:10:37.5208603Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:10:37.5209258Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:10:37.5209620Z U c10::SymInt::promote_to_negative() 2025-05-07T20:10:37.5209956Z U c10::SymIntType::get() 2025-05-07T20:10:37.5210310Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:10:37.5210741Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:10:37.5211428Z U c10::TensorType::get() 2025-05-07T20:10:37.5211943Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:10:37.5212895Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:10:37.5213845Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:10:37.5214222Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:10:37.5214583Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:10:37.5214927Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:10:37.5215284Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:10:37.5215628Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:10:37.5216112Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:10:37.5216714Z U c10::cuda::device_count() 2025-05-07T20:10:37.5217304Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:10:37.5217789Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:10:37.5218179Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:10:37.5218584Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:10:37.5218990Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:10:37.5219426Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:10:37.5220168Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:10:37.5221030Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:10:37.5221887Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:37.5222821Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:10:37.5223831Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:37.5224637Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:10:37.5224973Z U c10::impl::GPUTrace::haveState 2025-05-07T20:10:37.5225391Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:10:37.5225834Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:10:37.5226233Z U c10::impl::device_guard_impl_registry 2025-05-07T20:10:37.5226648Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:10:37.5227039Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:10:37.5227401Z U c10::throwNullDataPtrError() 2025-05-07T20:10:37.5227741Z U c10::warn(c10::Warning const&) 2025-05-07T20:10:37.5228110Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:10:37.5228529Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:10:37.5228983Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:10:37.5229360Z U cudaDeviceGetAttribute@libcudart.so.12 2025-05-07T20:10:37.5229724Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:10:37.5230102Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:10:37.5230473Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:10:37.5230822Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:10:37.5231178Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:10:37.5231514Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:10:37.5231876Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:10:37.5232223Z U cudaFuncSetAttribute@libcudart.so.12 2025-05-07T20:10:37.5232602Z U cudaGetDeviceProperties_v2@libcudart.so.12 2025-05-07T20:10:37.5232970Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:10:37.5233323Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:10:37.5233670Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:10:37.5234014Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:10:37.5234372Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:10:37.5234729Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:10:37.5237062Z U embedding_ops::split_embedding_backward_codegen_find_long_segments(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, int, bool) 2025-05-07T20:10:37.5239541Z U fbgemm_gpu::asynchronous_complete_cumsum_gpu(at::Tensor const&) 2025-05-07T20:10:37.5239992Z U float at::Tensor::item() const 2025-05-07T20:10:37.5240361Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:10:37.5240785Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:37.5241177Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:10:37.5241579Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:37.5242005Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:10:37.5242444Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:10:37.5242973Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:37.5243320Z U memcpy@GLIBC_2.14 2025-05-07T20:10:37.5243619Z U memmove@GLIBC_2.2.5 2025-05-07T20:10:37.5243901Z U memset@GLIBC_2.2.5 2025-05-07T20:10:37.5244215Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:10:37.5244587Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:10:37.5245135Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:10:37.5245917Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, int const*, int*, int, int, int, CUstream_st*) 2025-05-07T20:10:37.5246637Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:10:37.5247372Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, int const*, int*, int, int, int, CUstream_st*) 2025-05-07T20:10:37.5248121Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:10:37.5248922Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:10:37.5249720Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:10:37.5250491Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:10:37.5251171Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:10:37.5251503Z U std::__throw_bad_array_new_length() 2025-05-07T20:10:37.5251838Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:37.5252212Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:37.5252602Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:10:37.5253005Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:10:37.5253463Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:10:37.5254306Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:10:37.5255045Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:10:37.5255386Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:10:37.5255747Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:10:37.5256075Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:10:37.5256540Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:10:37.5257251Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:10:37.5257763Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:10:37.5258126Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:10:37.5258452Z U strlen@GLIBC_2.2.5 2025-05-07T20:10:37.5258766Z U torch::CppFunction::~CppFunction() 2025-05-07T20:10:37.5259583Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:10:37.5260738Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:10:37.5261551Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:10:37.5262292Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:10:37.5263343Z U transpose_embedding_input(at::Tensor, long, at::Tensor, at::Tensor, bool, std::optional const&, long, long, long, bool, std::optional const&, long, long) 2025-05-07T20:10:37.5265388Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:10:37.5268259Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:10:37.5271229Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:10:37.5273977Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:10:37.5276878Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:10:37.5279592Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:10:37.5282978Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:10:37.5286913Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:10:37.5290806Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:10:37.5294582Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:10:37.5298403Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:10:37.5302152Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:10:37.5305730Z U void embedding_ops::split_embedding_backward_count_unique_indices_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int) 2025-05-07T20:10:37.5307642Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:10:37.5308102Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:10:37.5308544Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:10:37.5309269Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:37.5309935Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:10:37.5310382Z w _ITM_deregisterTMCloneTable 2025-05-07T20:10:37.5310695Z w _ITM_registerTMCloneTable 2025-05-07T20:10:37.5311013Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:10:37.5311315Z w __gmon_start__ 2025-05-07T20:10:37.5311582Z w __pthread_key_create 2025-05-07T20:10:37.5312006Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:10:37.5312305Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:10:37.5312656Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:10:37.5313121Z + ldd ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:10:37.5313467Z 2025-05-07T20:10:37.5313628Z linux-vdso.so.1 (0x00007fff7dff8000) 2025-05-07T20:10:37.5313910Z libc10.so => not found 2025-05-07T20:10:37.5314157Z libnvrtc.so.12 => not found 2025-05-07T20:10:37.5314398Z libc10_cuda.so => not found 2025-05-07T20:10:37.5314659Z libnccl.so.2 => not found 2025-05-07T20:10:37.5314893Z libcuda.so.1 => not found 2025-05-07T20:10:37.5315786Z fbgemm_gpu_tbe_training_backward.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward.so (0x00007fadd8c00000) 2025-05-07T20:10:37.5316531Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:37.5316804Z libtorch.so => not found 2025-05-07T20:10:37.5317082Z libtorch_cpu.so => not found 2025-05-07T20:10:37.5317356Z libtorch_cuda.so => not found 2025-05-07T20:10:37.5317630Z libcudart.so.12 => not found 2025-05-07T20:10:37.5317953Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007fadd899c000) 2025-05-07T20:10:37.5318377Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007fae1ba2e000) 2025-05-07T20:10:37.5318750Z libc.so.6 => /lib64/libc.so.6 (0x00007fadd8794000) 2025-05-07T20:10:37.5319110Z /lib64/ld-linux-x86-64.so.2 (0x00007fae1ba64000) 2025-05-07T20:10:37.5319426Z libc10.so => not found 2025-05-07T20:10:37.5319676Z libnvrtc.so.12 => not found 2025-05-07T20:10:37.5319929Z libc10_cuda.so => not found 2025-05-07T20:10:37.5320198Z libnccl.so.2 => not found 2025-05-07T20:10:37.5320457Z libcuda.so.1 => not found 2025-05-07T20:10:37.5321061Z fbgemm_gpu_tbe_cache.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_cache.so (0x00007fadd7000000) 2025-05-07T20:10:37.5322067Z fbgemm_gpu_tbe_common.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_common.so (0x00007fadd6c00000) 2025-05-07T20:10:37.5323137Z fbgemm_gpu_sparse_async_cumsum.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_sparse_async_cumsum.so (0x00007fadd6a59000) 2025-05-07T20:10:37.5323861Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:37.5324137Z libtorch.so => not found 2025-05-07T20:10:37.5324636Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm.so (0x00007fadd6400000) 2025-05-07T20:10:37.5325533Z fbgemm_gpu_tbe_utils.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_utils.so (0x00007fadd5200000) 2025-05-07T20:10:37.5326166Z libtorch_cpu.so => not found 2025-05-07T20:10:37.5326447Z libtorch_cuda.so => not found 2025-05-07T20:10:37.5326710Z libcudart.so.12 => not found 2025-05-07T20:10:37.5327008Z libm.so.6 => /lib64/libm.so.6 (0x00007fae12925000) 2025-05-07T20:10:37.5327325Z libtorch.so => not found 2025-05-07T20:10:37.5327578Z libc10.so => not found 2025-05-07T20:10:37.5327860Z libnvrtc.so.12 => not found 2025-05-07T20:10:37.5328124Z libc10_cuda.so => not found 2025-05-07T20:10:37.5328364Z libnccl.so.2 => not found 2025-05-07T20:10:37.5328605Z libcuda.so.1 => not found 2025-05-07T20:10:37.5328846Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:37.5329111Z libtorch_cpu.so => not found 2025-05-07T20:10:37.5329483Z libtorch_cuda.so => not found 2025-05-07T20:10:37.5329747Z libcudart.so.12 => not found 2025-05-07T20:10:37.5330047Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007fadd873e000) 2025-05-07T20:10:37.5330355Z libc10.so => not found 2025-05-07T20:10:37.5330574Z libnvrtc.so.12 => not found 2025-05-07T20:10:37.5330799Z libc10_cuda.so => not found 2025-05-07T20:10:37.5331032Z libnccl.so.2 => not found 2025-05-07T20:10:37.5331250Z libcuda.so.1 => not found 2025-05-07T20:10:37.5331808Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_config.so (0x00007fae1ba17000) 2025-05-07T20:10:37.5332404Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:37.5332644Z libtorch.so => not found 2025-05-07T20:10:37.5332882Z libtorch_cpu.so => not found 2025-05-07T20:10:37.5333123Z libtorch_cuda.so => not found 2025-05-07T20:10:37.5333370Z libcudart.so.12 => not found 2025-05-07T20:10:37.5333598Z libc10.so => not found 2025-05-07T20:10:37.5333820Z libnvrtc.so.12 => not found 2025-05-07T20:10:37.5334049Z libc10_cuda.so => not found 2025-05-07T20:10:37.5334280Z libnccl.so.2 => not found 2025-05-07T20:10:37.5334502Z libcuda.so.1 => not found 2025-05-07T20:10:37.5334736Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:37.5335006Z libtorch.so => not found 2025-05-07T20:10:37.5335236Z libtorch_cpu.so => not found 2025-05-07T20:10:37.5335478Z libtorch_cuda.so => not found 2025-05-07T20:10:37.5335706Z libcudart.so.12 => not found 2025-05-07T20:10:37.5335933Z libc10.so => not found 2025-05-07T20:10:37.5336139Z libnvrtc.so.12 => not found 2025-05-07T20:10:37.5336494Z libc10_cuda.so => not found 2025-05-07T20:10:37.5336722Z libnccl.so.2 => not found 2025-05-07T20:10:37.5337150Z libcuda.so.1 => not found 2025-05-07T20:10:37.5337650Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/asmjit.so (0x00007fadd6f89000) 2025-05-07T20:10:37.5338202Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:37.5338465Z libtorch.so => not found 2025-05-07T20:10:37.5338702Z libtorch_cpu.so => not found 2025-05-07T20:10:37.5338960Z libtorch_cuda.so => not found 2025-05-07T20:10:37.5339205Z libtorch.so => not found 2025-05-07T20:10:37.5339437Z libc10.so => not found 2025-05-07T20:10:37.5339664Z libnvrtc.so.12 => not found 2025-05-07T20:10:37.5339913Z libc10_cuda.so => not found 2025-05-07T20:10:37.5340151Z libnccl.so.2 => not found 2025-05-07T20:10:37.5340388Z libcuda.so.1 => not found 2025-05-07T20:10:37.5340624Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:37.5340880Z libtorch_cpu.so => not found 2025-05-07T20:10:37.5341137Z libtorch_cuda.so => not found 2025-05-07T20:10:37.5341385Z libcudart.so.12 => not found 2025-05-07T20:10:37.5341634Z libtorch.so => not found 2025-05-07T20:10:37.5341860Z libc10.so => not found 2025-05-07T20:10:37.5342087Z libnvrtc.so.12 => not found 2025-05-07T20:10:37.5342326Z libc10_cuda.so => not found 2025-05-07T20:10:37.5342571Z libnccl.so.2 => not found 2025-05-07T20:10:37.5342808Z libcuda.so.1 => not found 2025-05-07T20:10:37.5343055Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:37.5343309Z libtorch_cpu.so => not found 2025-05-07T20:10:37.5343571Z libtorch_cuda.so => not found 2025-05-07T20:10:37.5343905Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007fae1ba02000) 2025-05-07T20:10:37.5344275Z libtorch.so => not found 2025-05-07T20:10:37.5344516Z libc10.so => not found 2025-05-07T20:10:37.5344743Z libnvrtc.so.12 => not found 2025-05-07T20:10:37.5344991Z libc10_cuda.so => not found 2025-05-07T20:10:37.5345233Z libnccl.so.2 => not found 2025-05-07T20:10:37.5345476Z libcuda.so.1 => not found 2025-05-07T20:10:37.5345712Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:37.5346011Z libtorch_cpu.so => not found 2025-05-07T20:10:37.5346258Z libtorch_cuda.so => not found 2025-05-07T20:10:37.5346550Z librt.so.1 => /lib64/librt.so.1 (0x00007fae1b9f9000) 2025-05-07T20:10:37.5346781Z 2025-05-07T20:10:37.5346891Z [CHECK] Displaying ELF information: 2025-05-07T20:10:37.5347352Z + readelf -d ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:10:37.5347766Z 2025-05-07T20:10:37.5347771Z 2025-05-07T20:10:37.5347928Z Dynamic section at offset 0x8d68cc8 contains 40 entries: 2025-05-07T20:10:37.5348287Z Tag Type Name/Value 2025-05-07T20:10:37.5348687Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:10:37.5349284Z 0x0000000000000001 (NEEDED) Shared library: [libnvrtc.so.12] 2025-05-07T20:10:37.5349772Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:10:37.5350251Z 0x0000000000000001 (NEEDED) Shared library: [libnccl.so.2] 2025-05-07T20:10:37.5350719Z 0x0000000000000001 (NEEDED) Shared library: [libcuda.so.1] 2025-05-07T20:10:37.5351257Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_training_backward.so] 2025-05-07T20:10:37.5351800Z 0x0000000000000001 (NEEDED) Shared library: [libnvidia-ml.so.1] 2025-05-07T20:10:37.5352287Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:10:37.5352764Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:10:37.5353280Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:10:37.5353769Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:10:37.5354248Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:10:37.5354761Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:10:37.5355221Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:10:37.5355709Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:10:37.5356272Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_training_backward_gwd.so] 2025-05-07T20:10:37.5356806Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:10:37.5357308Z 0x000000000000000c (INIT) 0xbe000 2025-05-07T20:10:37.5357607Z 0x000000000000000d (FINI) 0x5f04ec 2025-05-07T20:10:37.5357926Z 0x0000000000000019 (INIT_ARRAY) 0x8d5ea18 2025-05-07T20:10:37.5358245Z 0x000000000000001b (INIT_ARRAYSZ) 200 (bytes) 2025-05-07T20:10:37.5358585Z 0x000000000000001a (FINI_ARRAY) 0x8d5eae0 2025-05-07T20:10:37.5358905Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:10:37.5359214Z 0x000000006ffffef5 (GNU_HASH) 0x238 2025-05-07T20:10:37.5359533Z 0x0000000000000005 (STRTAB) 0xc600 2025-05-07T20:10:37.5359831Z 0x0000000000000006 (SYMTAB) 0x2d30 2025-05-07T20:10:37.5360162Z 0x000000000000000a (STRSZ) 597451 (bytes) 2025-05-07T20:10:37.5360482Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:10:37.5360815Z 0x0000000000000003 (PLTGOT) 0x8d6afe8 2025-05-07T20:10:37.5361141Z 0x0000000000000002 (PLTRELSZ) 12672 (bytes) 2025-05-07T20:10:37.5361462Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:10:37.5361758Z 0x0000000000000017 (JMPREL) 0xbab38 2025-05-07T20:10:37.5362055Z 0x0000000000000007 (RELA) 0x9f1a8 2025-05-07T20:10:37.5362384Z 0x0000000000000008 (RELASZ) 113040 (bytes) 2025-05-07T20:10:37.5362700Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:10:37.5363008Z 0x000000006ffffffe (VERNEED) 0x9f088 2025-05-07T20:10:37.5363299Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:10:37.5363598Z 0x000000006ffffff0 (VERSYM) 0x9e3cc 2025-05-07T20:10:37.5363924Z 0x000000006ffffff9 (RELACOUNT) 3303 2025-05-07T20:10:37.5364216Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:10:37.5364398Z 2025-05-07T20:10:37.5364514Z ################################################################################ 2025-05-07T20:10:37.5364719Z 2025-05-07T20:10:37.5364723Z 2025-05-07T20:10:37.5364824Z ################################################################################ 2025-05-07T20:10:37.5365428Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:10:37.5365914Z [CHECK] Listing out library size: 2025-05-07T20:10:37.5366375Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:10:37.5366755Z 2025-05-07T20:10:37.5366988Z 59 ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:10:37.5367315Z 2025-05-07T20:10:37.5367716Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:10:37.5368705Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:10:37.5369292Z 2025-05-07T20:10:37.5445182Z GLIBC_2.2.5 2025-05-07T20:10:37.5445814Z GLIBC_2.3 2025-05-07T20:10:37.5446427Z GLIBC_2.14 2025-05-07T20:10:37.5446755Z 2025-05-07T20:10:37.5446769Z 2025-05-07T20:10:37.5448220Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:10:37.5449562Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:10:37.5450168Z 2025-05-07T20:10:37.5614012Z GLIBCXX_3.4 2025-05-07T20:10:37.5614644Z GLIBCXX_3.4.9 2025-05-07T20:10:37.5615289Z GLIBCXX_3.4.11 2025-05-07T20:10:37.5615861Z GLIBCXX_3.4.15 2025-05-07T20:10:37.5616720Z GLIBCXX_3.4.18 2025-05-07T20:10:37.5617289Z GLIBCXX_3.4.20 2025-05-07T20:10:37.5617720Z GLIBCXX_3.4.21 2025-05-07T20:10:37.5617847Z 2025-05-07T20:10:37.5617852Z 2025-05-07T20:10:37.5634484Z + nm -gDC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so > /tmp/tmp.QILNXzIhBx.symbols.txt 2025-05-07T20:10:37.5635504Z 2025-05-07T20:10:37.5758418Z 2025-05-07T20:10:37.5799617Z [CHECK] Total Number of symbols: 1874 2025-05-07T20:10:37.5819244Z [CHECK] Number of fbgemm symbols: 100 2025-05-07T20:10:37.5840471Z + nm -gDCu ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so > /tmp/tmp.IYARmF1xmM.usymbols.txt 2025-05-07T20:10:37.5841101Z 2025-05-07T20:10:37.5869806Z 2025-05-07T20:10:37.5902826Z [CHECK] Listing out undefined symbols (259 total): 2025-05-07T20:10:37.5923359Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:37.5924236Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:37.5924791Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:10:37.5925244Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:10:37.5925799Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:10:37.5926206Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:10:37.5926573Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:10:37.5926943Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:10:37.5927282Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:10:37.5927645Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:10:37.5927992Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:10:37.5928319Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:10:37.5929928Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:10:37.5930248Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:10:37.5930565Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:10:37.5930885Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:10:37.5931214Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:10:37.5931604Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:10:37.5931922Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:10:37.5932335Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:10:37.5932642Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:10:37.5932930Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:10:37.5933238Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:10:37.5933543Z U adjust_info_B_num_bits(int, int) 2025-05-07T20:10:37.5933873Z U at::Context::deterministicAlgorithms() const 2025-05-07T20:10:37.5934287Z U at::RecordFunction::RecordFunction(at::StepCallbacks&&) 2025-05-07T20:10:37.5934661Z U at::RecordFunction::currentThreadId() 2025-05-07T20:10:37.5935045Z U at::RecordFunction::end() 2025-05-07T20:10:37.5935365Z U at::RecordFunction::~RecordFunction() 2025-05-07T20:10:37.5935727Z U at::SavedTensorDefaultHooks::set_tracing(bool) 2025-05-07T20:10:37.5936128Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:10:37.5936916Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:10:37.5937769Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:37.5939150Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:37.5940185Z U at::_ops::view_as::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:10:37.5940942Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:37.5942075Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:37.5942997Z U at::cuda::getCurrentDeviceProperties() 2025-05-07T20:10:37.5943348Z U at::functorch::functorchTLSAccessor() 2025-05-07T20:10:37.5943711Z U at::getStepCallbacksUnlessEmpty(at::RecordScope) 2025-05-07T20:10:37.5944239Z U at::globalContext() 2025-05-07T20:10:37.5944555Z U at::sequence_number::get_and_increment() 2025-05-07T20:10:37.5944884Z U bcmp@GLIBC_2.2.5 2025-05-07T20:10:37.5945157Z U c10::AnyType::get() 2025-05-07T20:10:37.5945554Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:37.5945972Z U c10::BoolType::get() 2025-05-07T20:10:37.5946321Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:10:37.5946774Z U c10::Dispatcher::findSchemaOrThrow(char const*, char const*) 2025-05-07T20:10:37.5947170Z U c10::Dispatcher::realSingleton() 2025-05-07T20:10:37.5947882Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet) 2025-05-07T20:10:37.5949077Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet, c10::ArrayRef) 2025-05-07T20:10:37.5950168Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:10:37.5950743Z U c10::Error::what() const 2025-05-07T20:10:37.5951055Z U c10::FloatType::get() 2025-05-07T20:10:37.5951390Z U c10::GradMode::is_enabled() 2025-05-07T20:10:37.5951717Z U c10::GradMode::set_enabled(bool) 2025-05-07T20:10:37.5952124Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:37.5952589Z U c10::IValue::TagType::get(c10::IValue const&) 2025-05-07T20:10:37.5952974Z U c10::IValue::hash(c10::IValue const&) 2025-05-07T20:10:37.5953342Z U c10::IValue::isBoolList() const 2025-05-07T20:10:37.5953694Z U c10::IValue::isIntList() const 2025-05-07T20:10:37.5954208Z U c10::IValue::isSymIntList() const 2025-05-07T20:10:37.5954681Z U c10::IValue::isTensorList() const 2025-05-07T20:10:37.5955044Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:10:37.5955427Z U c10::IntType::get() 2025-05-07T20:10:37.5955790Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:10:37.5956221Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:10:37.5956646Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:10:37.5957002Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:10:37.5957475Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:10:37.5958346Z U c10::SmallVectorBase::grow_pod(void const*, unsigned long, unsigned long) 2025-05-07T20:10:37.5958922Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:10:37.5959329Z U c10::StringType::get() 2025-05-07T20:10:37.5959682Z U c10::SymBool::expect_true(char const*, long) const 2025-05-07T20:10:37.5960111Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:10:37.5960533Z U c10::SymBool::guard_size_oblivious(char const*, long) const 2025-05-07T20:10:37.5960988Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:10:37.5961431Z U c10::SymFloat::operator/(c10::SymFloat const&) const 2025-05-07T20:10:37.5962082Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:10:37.5962751Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:10:37.5963127Z U c10::SymInt::operator c10::SymFloat() const 2025-05-07T20:10:37.5963623Z U c10::SymInt::operator/(c10::SymInt const&) const 2025-05-07T20:10:37.5963998Z U c10::SymInt::promote_to_negative() 2025-05-07T20:10:37.5964331Z U c10::SymInt::sym_eq(c10::SymInt const&) const 2025-05-07T20:10:37.5964701Z U c10::SymInt::sym_gt(c10::SymInt const&) const 2025-05-07T20:10:37.5965040Z U c10::SymInt::sym_le(c10::SymInt const&) const 2025-05-07T20:10:37.5965398Z U c10::SymInt::toSymNode() const 2025-05-07T20:10:37.5965696Z U c10::SymIntType::get() 2025-05-07T20:10:37.5966055Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:10:37.5966446Z U c10::TensorImpl::requires_grad() const 2025-05-07T20:10:37.5966809Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:10:37.5967177Z U c10::TensorType::get() 2025-05-07T20:10:37.5967485Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:10:37.5968414Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:10:37.5969320Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:10:37.5969660Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:10:37.5970049Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:10:37.5970716Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:10:37.5971199Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:10:37.5971630Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:10:37.5972103Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:10:37.5972595Z U c10::cuda::device_count() 2025-05-07T20:10:37.5972953Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:10:37.5973365Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:10:37.5973771Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:10:37.5974188Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:10:37.5974628Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:10:37.5975028Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:10:37.5975757Z U c10::detail::ListImpl::ListImpl(std::vector >, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:10:37.5976921Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:10:37.5977858Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:10:37.5978741Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:37.5979702Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:10:37.5980717Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:37.5981535Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:10:37.5981878Z U c10::impl::GPUTrace::haveState 2025-05-07T20:10:37.5982429Z U c10::impl::OperatorEntry::assertSignatureIsCorrect(c10::impl::CppSignature const&, bool) const 2025-05-07T20:10:37.5983062Z U c10::impl::OperatorEntry::reportError(c10::DispatchKey) const 2025-05-07T20:10:37.5983512Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:10:37.5983971Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:10:37.5984372Z U c10::impl::device_guard_impl_registry 2025-05-07T20:10:37.5984733Z U c10::impl::raw_local_dispatch_key_set 2025-05-07T20:10:37.5985136Z U c10::initializeFunctionalityOffsetsAndMasks() 2025-05-07T20:10:37.5985775Z U c10::ivalue::ConstantString::create(std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:10:37.5986393Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:10:37.5986774Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:10:37.5987187Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:10:37.5987619Z U c10::operator<<(std::ostream&, c10::OperatorName const&) 2025-05-07T20:10:37.5988098Z U c10::operator<<(std::ostream&, c10::SymFloat const&) 2025-05-07T20:10:37.5988522Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:10:37.5988897Z U c10::throwNullDataPtrError() 2025-05-07T20:10:37.5989248Z U c10::warn(c10::Warning const&) 2025-05-07T20:10:37.5989588Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:10:37.5990061Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:10:37.5990508Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:10:37.5990877Z U cudaDeviceGetAttribute@libcudart.so.12 2025-05-07T20:10:37.5991264Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:10:37.5991647Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:10:37.5992032Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:10:37.5992394Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:10:37.5992766Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:10:37.5993122Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:10:37.5993479Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:10:37.5993863Z U cudaFuncSetAttribute@libcudart.so.12 2025-05-07T20:10:37.5994245Z U cudaGetDeviceProperties_v2@libcudart.so.12 2025-05-07T20:10:37.5994644Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:10:37.5995002Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:10:37.5995419Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:10:37.5995783Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:10:37.5996141Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:10:37.5996517Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:10:37.5998901Z U embedding_ops::split_embedding_backward_codegen_find_long_segments(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, int, bool) 2025-05-07T20:10:37.6001483Z U fbgemm_gpu::config::is_feature_enabled(fbgemm_gpu::config::FeatureGateName const&) 2025-05-07T20:10:37.6001982Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:10:37.6002410Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:37.6002780Z U free@GLIBC_2.2.5 2025-05-07T20:10:37.6003125Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:10:37.6003535Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:37.6003961Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:10:37.6004407Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:10:37.6004800Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:37.6005277Z U memcmp@GLIBC_2.2.5 2025-05-07T20:10:37.6005559Z U memcpy@GLIBC_2.14 2025-05-07T20:10:37.6005857Z U memmove@GLIBC_2.2.5 2025-05-07T20:10:37.6006144Z U memset@GLIBC_2.2.5 2025-05-07T20:10:37.6006457Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:10:37.6006815Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:10:37.6007345Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:10:37.6008072Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:10:37.6008609Z U realloc@GLIBC_2.2.5 2025-05-07T20:10:37.6009024Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:10:37.6009679Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:10:37.6010487Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:10:37.6011269Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:10:37.6012039Z U std::__cxx11::basic_stringstream, std::allocator >::~basic_stringstream()@GLIBCXX_3.4.21 2025-05-07T20:10:37.6012835Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:10:37.6013420Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:10:37.6013745Z U std::__throw_bad_array_new_length() 2025-05-07T20:10:37.6014126Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:37.6014509Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:37.6014909Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:10:37.6015347Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:10:37.6015715Z U std::bad_weak_ptr::~bad_weak_ptr()@GLIBCXX_3.4.15 2025-05-07T20:10:37.6016188Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:10:37.6017343Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:10:37.6018146Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:10:37.6018528Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:10:37.6018888Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:10:37.6019263Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:10:37.6019637Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:10:37.6020047Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:10:37.6020606Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:10:37.6021086Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:10:37.6021524Z U std::out_of_range::out_of_range(char const*)@GLIBCXX_3.4.21 2025-05-07T20:10:37.6021971Z U std::out_of_range::~out_of_range()@GLIBCXX_3.4 2025-05-07T20:10:37.6022636Z U std::runtime_error::runtime_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:10:37.6023336Z U std::runtime_error::~runtime_error()@GLIBCXX_3.4 2025-05-07T20:10:37.6023699Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:10:37.6023829Z U strcmp@GLIBC_2.2.5 2025-05-07T20:10:37.6023933Z U strlen@GLIBC_2.2.5 2025-05-07T20:10:37.6024061Z U torch::CppFunction::~CppFunction() 2025-05-07T20:10:37.6024665Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:10:37.6025131Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:10:37.6025421Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:10:37.6025567Z U torch::autograd::AnomalyMode::_enabled 2025-05-07T20:10:37.6025868Z U torch::autograd::AutogradContext::AutogradContext(torch::dynamo::autograd::PackedArgs&) 2025-05-07T20:10:37.6026094Z U torch::autograd::AutogradContext::get_and_bump_dirty() const 2025-05-07T20:10:37.6026322Z U torch::autograd::AutogradContext::get_non_differentiable() const 2025-05-07T20:10:37.6026516Z U torch::autograd::AutogradContext::get_saved_variables() const 2025-05-07T20:10:37.6026872Z U torch::autograd::AutogradContext::save_for_backward(std::vector >) 2025-05-07T20:10:37.6027059Z U torch::autograd::AutogradContext::save_variables() 2025-05-07T20:10:37.6027259Z U torch::autograd::ForwardADLevel::try_get_by_idx(unsigned long) 2025-05-07T20:10:37.6027443Z U torch::autograd::InputMetadata::shape_as_dim_vector() const 2025-05-07T20:10:37.6027593Z U torch::autograd::Node::assign_parent() 2025-05-07T20:10:37.6027717Z U torch::autograd::Node::metadata() 2025-05-07T20:10:37.6027862Z U torch::autograd::Node::name[abi:cxx11]() const 2025-05-07T20:10:37.6028136Z U torch::autograd::SavedVariable::SavedVariable(at::Tensor const&, bool, bool) 2025-05-07T20:10:37.6028437Z U torch::autograd::SavedVariable::unpack(std::shared_ptr) const 2025-05-07T20:10:37.6028588Z U torch::autograd::VariableInfo::VariableInfo() 2025-05-07T20:10:37.6028830Z U torch::autograd::VariableInfo::VariableInfo(at::Tensor const&, bool) 2025-05-07T20:10:37.6029188Z U torch::autograd::VariableInfo::zeros(c10::OptionalDeviceGuard&) const 2025-05-07T20:10:37.6031607Z U torch::autograd::_wrap_outputs(std::vector > const&, std::unordered_set, std::equal_to, std::allocator > const&, std::unordered_set, std::equal_to, std::allocator > const&, c10::ArrayRef >, std::shared_ptr const&, std::function > (std::vector >, std::vector >)> const&, std::unordered_set, std::equal_to, std::allocator > const&, std::function const&) 2025-05-07T20:10:37.6031760Z U torch::autograd::deleteNode(torch::autograd::Node*) 2025-05-07T20:10:37.6031927Z U torch::autograd::get_current_graph_task_exec_info() 2025-05-07T20:10:37.6032083Z U torch::autograd::impl::gradient_edge(at::Tensor const&) 2025-05-07T20:10:37.6032800Z U torch::autograd::profiler::record_function_enter_new(std::__cxx11::basic_string, std::allocator > const&, std::optional, std::allocator > > const&) 2025-05-07T20:10:37.6032972Z U torch::dynamo::autograd::getPyCompilerInterface() 2025-05-07T20:10:37.6033350Z U torch::dynamo::autograd::get_input_metadata(std::vector > const&) 2025-05-07T20:10:37.6033681Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:10:37.6034206Z U transpose_embedding_input(at::Tensor, long, at::Tensor, at::Tensor, bool, std::optional const&, long, long, long, bool, std::optional const&, long, long) 2025-05-07T20:10:37.6034340Z U typeinfo for c10::Error 2025-05-07T20:10:37.6034496Z U typeinfo for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:10:37.6034627Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:10:37.6034759Z U typeinfo for std::out_of_range@GLIBCXX_3.4 2025-05-07T20:10:37.6034911Z U typeinfo for std::runtime_error@GLIBCXX_3.4 2025-05-07T20:10:37.6035049Z U typeinfo for torch::autograd::Node 2025-05-07T20:10:37.6036340Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:10:37.6037637Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:10:37.6038895Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:10:37.6040161Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:10:37.6041371Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:10:37.6042606Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:10:37.6042755Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:10:37.6042929Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:10:37.6043080Z U vtable for __cxxabiv1::__pointer_type_info@CXXABI_1.3 2025-05-07T20:10:37.6043232Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:10:37.6043352Z U vtable for c10::Error 2025-05-07T20:10:37.6043663Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:37.6043795Z U vtable for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:10:37.6044033Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:10:37.6044157Z U vtable for torch::autograd::Node 2025-05-07T20:10:37.6044355Z w TLS init function for c10::impl::raw_local_dispatch_key_set 2025-05-07T20:10:37.6044486Z w _ITM_deregisterTMCloneTable 2025-05-07T20:10:37.6044590Z w _ITM_registerTMCloneTable 2025-05-07T20:10:37.6044694Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:10:37.6044808Z w __gmon_start__ 2025-05-07T20:10:37.6044943Z w __pthread_key_create 2025-05-07T20:10:37.6045048Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:10:37.6045156Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:10:37.6045318Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:10:37.6045568Z + ldd ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:10:37.6045575Z 2025-05-07T20:10:37.6045751Z linux-vdso.so.1 (0x00007ffdd31f1000) 2025-05-07T20:10:37.6045840Z libc10.so => not found 2025-05-07T20:10:37.6045938Z libnvrtc.so.12 => not found 2025-05-07T20:10:37.6046051Z libc10_cuda.so => not found 2025-05-07T20:10:37.6046142Z libnccl.so.2 => not found 2025-05-07T20:10:37.6046231Z libcuda.so.1 => not found 2025-05-07T20:10:37.6046777Z fbgemm_gpu_tbe_training_backward.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward.so (0x00007f9409000000) 2025-05-07T20:10:37.6046877Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:37.6046966Z libtorch.so => not found 2025-05-07T20:10:37.6047084Z libtorch_cpu.so => not found 2025-05-07T20:10:37.6047208Z libtorch_cuda.so => not found 2025-05-07T20:10:37.6047302Z libcudart.so.12 => not found 2025-05-07T20:10:37.6047457Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f9408d9c000) 2025-05-07T20:10:37.6047622Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f9446ae0000) 2025-05-07T20:10:37.6047789Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f9446ab2000) 2025-05-07T20:10:37.6047910Z libc.so.6 => /lib64/libc.so.6 (0x00007f9408b94000) 2025-05-07T20:10:37.6048056Z /lib64/ld-linux-x86-64.so.2 (0x00007f9446b3e000) 2025-05-07T20:10:37.6048142Z libc10.so => not found 2025-05-07T20:10:37.6048239Z libnvrtc.so.12 => not found 2025-05-07T20:10:37.6048332Z libc10_cuda.so => not found 2025-05-07T20:10:37.6048451Z libnccl.so.2 => not found 2025-05-07T20:10:37.6048540Z libcuda.so.1 => not found 2025-05-07T20:10:37.6048976Z fbgemm_gpu_tbe_cache.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_cache.so (0x00007f9407400000) 2025-05-07T20:10:37.6049432Z fbgemm_gpu_tbe_common.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_common.so (0x00007f9407000000) 2025-05-07T20:10:37.6049926Z fbgemm_gpu_sparse_async_cumsum.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_sparse_async_cumsum.so (0x00007f9406e59000) 2025-05-07T20:10:37.6050028Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:37.6050153Z libtorch.so => not found 2025-05-07T20:10:37.6050481Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm.so (0x00007f9406800000) 2025-05-07T20:10:37.6050897Z fbgemm_gpu_tbe_utils.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_utils.so (0x00007f9405600000) 2025-05-07T20:10:37.6051019Z libtorch_cpu.so => not found 2025-05-07T20:10:37.6051119Z libtorch_cuda.so => not found 2025-05-07T20:10:37.6051214Z libcudart.so.12 => not found 2025-05-07T20:10:37.6051337Z libm.so.6 => /lib64/libm.so.6 (0x00007f9442d25000) 2025-05-07T20:10:37.6051454Z libtorch.so => not found 2025-05-07T20:10:37.6051537Z libc10.so => not found 2025-05-07T20:10:37.6051628Z libnvrtc.so.12 => not found 2025-05-07T20:10:37.6051738Z libc10_cuda.so => not found 2025-05-07T20:10:37.6051827Z libnccl.so.2 => not found 2025-05-07T20:10:37.6051919Z libcuda.so.1 => not found 2025-05-07T20:10:37.6052019Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:37.6052163Z libtorch_cpu.so => not found 2025-05-07T20:10:37.6052272Z libtorch_cuda.so => not found 2025-05-07T20:10:37.6052373Z libcudart.so.12 => not found 2025-05-07T20:10:37.6052488Z libc10.so => not found 2025-05-07T20:10:37.6052587Z libnvrtc.so.12 => not found 2025-05-07T20:10:37.6052685Z libc10_cuda.so => not found 2025-05-07T20:10:37.6052783Z libnccl.so.2 => not found 2025-05-07T20:10:37.6052903Z libcuda.so.1 => not found 2025-05-07T20:10:37.6053345Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_config.so (0x00007f9446a9b000) 2025-05-07T20:10:37.6053450Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:37.6053582Z libtorch.so => not found 2025-05-07T20:10:37.6053682Z libtorch_cpu.so => not found 2025-05-07T20:10:37.6053783Z libtorch_cuda.so => not found 2025-05-07T20:10:37.6053903Z libcudart.so.12 => not found 2025-05-07T20:10:37.6053999Z libc10.so => not found 2025-05-07T20:10:37.6054098Z libnvrtc.so.12 => not found 2025-05-07T20:10:37.6054195Z libc10_cuda.so => not found 2025-05-07T20:10:37.6054311Z libnccl.so.2 => not found 2025-05-07T20:10:37.6054407Z libcuda.so.1 => not found 2025-05-07T20:10:37.6054505Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:37.6054612Z libtorch.so => not found 2025-05-07T20:10:37.6054706Z libtorch_cpu.so => not found 2025-05-07T20:10:37.6054800Z libtorch_cuda.so => not found 2025-05-07T20:10:37.6054898Z libcudart.so.12 => not found 2025-05-07T20:10:37.6055000Z libc10.so => not found 2025-05-07T20:10:37.6055094Z libnvrtc.so.12 => not found 2025-05-07T20:10:37.6055186Z libc10_cuda.so => not found 2025-05-07T20:10:37.6055321Z libnccl.so.2 => not found 2025-05-07T20:10:37.6055411Z libcuda.so.1 => not found 2025-05-07T20:10:37.6055741Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/asmjit.so (0x00007f9446a1c000) 2025-05-07T20:10:37.6055839Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:37.6055983Z libtorch.so => not found 2025-05-07T20:10:37.6056081Z libtorch_cpu.so => not found 2025-05-07T20:10:37.6056183Z libtorch_cuda.so => not found 2025-05-07T20:10:37.6056307Z libtorch.so => not found 2025-05-07T20:10:37.6056458Z libc10.so => not found 2025-05-07T20:10:37.6056550Z libnvrtc.so.12 => not found 2025-05-07T20:10:37.6056646Z libc10_cuda.so => not found 2025-05-07T20:10:37.6056943Z libnccl.so.2 => not found 2025-05-07T20:10:37.6057039Z libcuda.so.1 => not found 2025-05-07T20:10:37.6057145Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:37.6057276Z libtorch_cpu.so => not found 2025-05-07T20:10:37.6057380Z libtorch_cuda.so => not found 2025-05-07T20:10:37.6057531Z libcudart.so.12 => not found 2025-05-07T20:10:37.6057633Z libtorch.so => not found 2025-05-07T20:10:37.6057842Z libc10.so => not found 2025-05-07T20:10:37.6057942Z libnvrtc.so.12 => not found 2025-05-07T20:10:37.6058037Z libc10_cuda.so => not found 2025-05-07T20:10:37.6058169Z libnccl.so.2 => not found 2025-05-07T20:10:37.6058268Z libcuda.so.1 => not found 2025-05-07T20:10:37.6058369Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:37.6058474Z libtorch_cpu.so => not found 2025-05-07T20:10:37.6058601Z libtorch_cuda.so => not found 2025-05-07T20:10:37.6058780Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007f9446a0f000) 2025-05-07T20:10:37.6058877Z libtorch.so => not found 2025-05-07T20:10:37.6058992Z libc10.so => not found 2025-05-07T20:10:37.6059091Z libnvrtc.so.12 => not found 2025-05-07T20:10:37.6059187Z libc10_cuda.so => not found 2025-05-07T20:10:37.6059284Z libnccl.so.2 => not found 2025-05-07T20:10:37.6059404Z libcuda.so.1 => not found 2025-05-07T20:10:37.6059507Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:37.6059608Z libtorch_cpu.so => not found 2025-05-07T20:10:37.6059735Z libtorch_cuda.so => not found 2025-05-07T20:10:37.6059874Z librt.so.1 => /lib64/librt.so.1 (0x00007f9442d20000) 2025-05-07T20:10:37.6059879Z 2025-05-07T20:10:37.6059991Z [CHECK] Displaying ELF information: 2025-05-07T20:10:37.6060309Z + readelf -d ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:10:37.6060347Z 2025-05-07T20:10:37.6060351Z 2025-05-07T20:10:37.6060521Z Dynamic section at offset 0x3a27010 contains 41 entries: 2025-05-07T20:10:37.6060639Z Tag Type Name/Value 2025-05-07T20:10:37.6060852Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:10:37.6061061Z 0x0000000000000001 (NEEDED) Shared library: [libnvrtc.so.12] 2025-05-07T20:10:37.6061294Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:10:37.6061489Z 0x0000000000000001 (NEEDED) Shared library: [libnccl.so.2] 2025-05-07T20:10:37.6061706Z 0x0000000000000001 (NEEDED) Shared library: [libcuda.so.1] 2025-05-07T20:10:37.6061971Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_training_backward.so] 2025-05-07T20:10:37.6062186Z 0x0000000000000001 (NEEDED) Shared library: [libnvidia-ml.so.1] 2025-05-07T20:10:37.6062404Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:10:37.6062609Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:10:37.6062815Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:10:37.6063036Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:10:37.6063242Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:10:37.6063453Z 0x0000000000000001 (NEEDED) Shared library: [libgomp.so.1] 2025-05-07T20:10:37.6063696Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:10:37.6063893Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:10:37.6064109Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:10:37.6064432Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_training_backward_dense.so] 2025-05-07T20:10:37.6064633Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:10:37.6064757Z 0x000000000000000c (INIT) 0x80000 2025-05-07T20:10:37.6064877Z 0x000000000000000d (FINI) 0x261c5c 2025-05-07T20:10:37.6065017Z 0x0000000000000019 (INIT_ARRAY) 0x3a223b0 2025-05-07T20:10:37.6065155Z 0x000000000000001b (INIT_ARRAYSZ) 184 (bytes) 2025-05-07T20:10:37.6065287Z 0x000000000000001a (FINI_ARRAY) 0x3a22468 2025-05-07T20:10:37.6065430Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:10:37.6065552Z 0x000000006ffffef5 (GNU_HASH) 0x238 2025-05-07T20:10:37.6065671Z 0x0000000000000005 (STRTAB) 0xe368 2025-05-07T20:10:37.6065790Z 0x0000000000000006 (SYMTAB) 0x33a0 2025-05-07T20:10:37.6065951Z 0x000000000000000a (STRSZ) 374997 (bytes) 2025-05-07T20:10:37.6066083Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:10:37.6066204Z 0x0000000000000003 (PLTGOT) 0x3a28fe8 2025-05-07T20:10:37.6066370Z 0x0000000000000002 (PLTRELSZ) 18456 (bytes) 2025-05-07T20:10:37.6066486Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:10:37.6066606Z 0x0000000000000017 (JMPREL) 0x7b2d8 2025-05-07T20:10:37.6066723Z 0x0000000000000007 (RELA) 0x6ac28 2025-05-07T20:10:37.6066885Z 0x0000000000000008 (RELASZ) 67248 (bytes) 2025-05-07T20:10:37.6067014Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:10:37.6067135Z 0x000000006ffffffe (VERNEED) 0x6aae8 2025-05-07T20:10:37.6067273Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:10:37.6067392Z 0x000000006ffffff0 (VERSYM) 0x69c3e 2025-05-07T20:10:37.6067514Z 0x000000006ffffff9 (RELACOUNT) 1392 2025-05-07T20:10:37.6067643Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:10:37.6067648Z 2025-05-07T20:10:37.6067769Z ################################################################################ 2025-05-07T20:10:37.6067801Z 2025-05-07T20:10:37.6067805Z 2025-05-07T20:10:37.6067925Z ################################################################################ 2025-05-07T20:10:37.6068280Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:10:37.6068389Z [CHECK] Listing out library size: 2025-05-07T20:10:37.6068699Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:10:37.6068731Z 2025-05-07T20:10:37.6069026Z 328 ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:10:37.6069031Z 2025-05-07T20:10:37.6069468Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:10:37.6070002Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:10:37.6070009Z 2025-05-07T20:10:37.6719529Z GLIBC_2.2.5 2025-05-07T20:10:37.6719816Z GLIBC_2.3 2025-05-07T20:10:37.6720042Z GLIBC_2.14 2025-05-07T20:10:37.6720060Z 2025-05-07T20:10:37.6720073Z 2025-05-07T20:10:37.6721461Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:10:37.6723129Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:10:37.6723420Z 2025-05-07T20:10:37.7373698Z GLIBCXX_3.4 2025-05-07T20:10:37.7373990Z GLIBCXX_3.4.9 2025-05-07T20:10:37.7374233Z GLIBCXX_3.4.11 2025-05-07T20:10:37.7374470Z GLIBCXX_3.4.18 2025-05-07T20:10:37.7374735Z GLIBCXX_3.4.20 2025-05-07T20:10:37.7374972Z GLIBCXX_3.4.21 2025-05-07T20:10:37.7374991Z 2025-05-07T20:10:37.7375304Z 2025-05-07T20:10:37.7394814Z + nm -gDC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so > /tmp/tmp.WLlGk77Ete.symbols.txt 2025-05-07T20:10:37.7394869Z 2025-05-07T20:10:37.8007069Z 2025-05-07T20:10:37.8043704Z [CHECK] Total Number of symbols: 3739 2025-05-07T20:10:37.8100653Z [CHECK] Number of fbgemm symbols: 551 2025-05-07T20:10:37.8123804Z + nm -gDCu ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so > /tmp/tmp.lCytyeav9t.usymbols.txt 2025-05-07T20:10:37.8123857Z 2025-05-07T20:10:37.8161979Z 2025-05-07T20:10:37.8195929Z [CHECK] Listing out undefined symbols (178 total): 2025-05-07T20:10:37.8215700Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:37.8216047Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:37.8216179Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:10:37.8216422Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:10:37.8216590Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:10:37.8216734Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:10:37.8216871Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:10:37.8216993Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:10:37.8217163Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:10:37.8217298Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:10:37.8217415Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:10:37.8217521Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:10:37.8217640Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:10:37.8217757Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:10:37.8217866Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:10:37.8217980Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:10:37.8218248Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:10:37.8218373Z U adjust_info_B_num_bits(int, int) 2025-05-07T20:10:37.8218501Z U at::CUDAGeneratorImpl::device_type() 2025-05-07T20:10:37.8218676Z U at::CUDAGeneratorImpl::philox_cuda_state(unsigned long) 2025-05-07T20:10:37.8218836Z U at::Context::deterministicAlgorithms() const 2025-05-07T20:10:37.8219091Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:10:37.8219255Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:10:37.8219836Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:37.8220470Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:37.8220683Z U at::_ops::reshape::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:10:37.8220980Z U at::_ops::to_dtype::call(at::Tensor const&, c10::ScalarType, bool, bool, std::optional) 2025-05-07T20:10:37.8221451Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:37.8222086Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:37.8222249Z U at::cuda::detail::getDefaultCUDAGenerator(signed char) 2025-05-07T20:10:37.8222441Z U at::cuda::getCurrentDeviceProperties() 2025-05-07T20:10:37.8222561Z U at::globalContext() 2025-05-07T20:10:37.8222766Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:37.8222864Z U c10::BoolType::get() 2025-05-07T20:10:37.8223039Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:10:37.8223141Z U c10::FloatType::get() 2025-05-07T20:10:37.8223263Z U c10::GeneratorImpl::device() const 2025-05-07T20:10:37.8223452Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:37.8223596Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:10:37.8223695Z U c10::IntType::get() 2025-05-07T20:10:37.8223877Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:10:37.8224002Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:10:37.8224163Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:10:37.8224307Z U c10::SymBool::expect_true(char const*, long) const 2025-05-07T20:10:37.8224466Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:10:37.8224637Z U c10::SymBool::guard_size_oblivious(char const*, long) const 2025-05-07T20:10:37.8224783Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:10:37.8225199Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:10:37.8225338Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:10:37.8225478Z U c10::SymInt::operator/(c10::SymInt const&) const 2025-05-07T20:10:37.8225610Z U c10::SymInt::promote_to_negative() 2025-05-07T20:10:37.8225744Z U c10::SymInt::sym_eq(c10::SymInt const&) const 2025-05-07T20:10:37.8225873Z U c10::SymInt::sym_gt(c10::SymInt const&) const 2025-05-07T20:10:37.8226044Z U c10::SymInt::sym_le(c10::SymInt const&) const 2025-05-07T20:10:37.8226154Z U c10::SymInt::toSymNode() const 2025-05-07T20:10:37.8226258Z U c10::SymIntType::get() 2025-05-07T20:10:37.8226423Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:10:37.8226604Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:10:37.8226705Z U c10::TensorType::get() 2025-05-07T20:10:37.8226845Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:10:37.8227546Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:10:37.8227684Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:10:37.8227821Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:10:37.8227947Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:10:37.8228064Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:10:37.8228201Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:10:37.8228313Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:10:37.8228565Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:10:37.8228714Z U c10::cuda::device_count() 2025-05-07T20:10:37.8228853Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:10:37.8228986Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:10:37.8229128Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:10:37.8229312Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:10:37.8229473Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:10:37.8229588Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:10:37.8230110Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:10:37.8230362Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:10:37.8230864Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:37.8231201Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:10:37.8231771Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:37.8231910Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:10:37.8232023Z U c10::impl::GPUTrace::haveState 2025-05-07T20:10:37.8232173Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:10:37.8232462Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:10:37.8232584Z U c10::impl::device_guard_impl_registry 2025-05-07T20:10:37.8232704Z U c10::operator+(c10::SymInt const&, int) 2025-05-07T20:10:37.8232838Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:10:37.8232981Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:10:37.8233114Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:10:37.8233269Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:10:37.8233384Z U c10::throwNullDataPtrError() 2025-05-07T20:10:37.8233512Z U c10::warn(c10::Warning const&) 2025-05-07T20:10:37.8233624Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:10:37.8233836Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:10:37.8233953Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:10:37.8234082Z U cudaDeviceGetAttribute@libcudart.so.12 2025-05-07T20:10:37.8234250Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:10:37.8234385Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:10:37.8234496Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:10:37.8234634Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:10:37.8234746Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:10:37.8234859Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:10:37.8234980Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:10:37.8235118Z U cudaFuncSetAttribute@libcudart.so.12 2025-05-07T20:10:37.8235252Z U cudaGetDeviceProperties_v2@libcudart.so.12 2025-05-07T20:10:37.8235534Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:10:37.8235662Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:10:37.8235775Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:10:37.8235888Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:10:37.8236056Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:10:37.8236177Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:10:37.8238333Z U embedding_ops::split_embedding_backward_codegen_find_long_segments(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, int, bool) 2025-05-07T20:10:37.8238545Z U fbgemm_gpu::asynchronous_complete_cumsum_gpu(at::Tensor const&) 2025-05-07T20:10:37.8238662Z U float at::Tensor::item() const 2025-05-07T20:10:37.8238811Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:10:37.8239076Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:37.8239203Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:10:37.8239342Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:37.8239532Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:10:37.8239660Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:10:37.8239804Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:37.8239913Z U memcpy@GLIBC_2.14 2025-05-07T20:10:37.8240008Z U memmove@GLIBC_2.2.5 2025-05-07T20:10:37.8240100Z U memset@GLIBC_2.2.5 2025-05-07T20:10:37.8240224Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:10:37.8240342Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:10:37.8240657Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:10:37.8240968Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, int const*, int*, int, int, int, CUstream_st*) 2025-05-07T20:10:37.8241270Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, long const*, long*, int, int, int, CUstream_st*) 2025-05-07T20:10:37.8241606Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:10:37.8241920Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, int const*, int*, int, int, int, CUstream_st*) 2025-05-07T20:10:37.8242224Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, long const*, long*, int, int, int, CUstream_st*) 2025-05-07T20:10:37.8242573Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:10:37.8242962Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:10:37.8243275Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:10:37.8243633Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:10:37.8243764Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:10:37.8243877Z U std::__throw_bad_array_new_length() 2025-05-07T20:10:37.8244015Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:37.8244161Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:37.8244331Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:10:37.8244485Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:10:37.8244731Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:10:37.8245305Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:10:37.8245429Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:10:37.8245560Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:10:37.8245678Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:10:37.8245790Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:10:37.8245978Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:10:37.8246208Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:10:37.8246334Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:10:37.8246450Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:10:37.8246547Z U strlen@GLIBC_2.2.5 2025-05-07T20:10:37.8246668Z U torch::CppFunction::~CppFunction() 2025-05-07T20:10:37.8247240Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:10:37.8247766Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:10:37.8248001Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:10:37.8248348Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:10:37.8248850Z U transpose_embedding_input(at::Tensor, long, at::Tensor, at::Tensor, bool, std::optional const&, long, long, long, bool, std::optional const&, long, long) 2025-05-07T20:10:37.8250606Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:10:37.8252401Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:10:37.8254508Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:10:37.8256488Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:10:37.8258303Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:10:37.8260145Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:10:37.8261823Z U void embedding_ops::split_embedding_backward_count_unique_indices_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int) 2025-05-07T20:10:37.8261982Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:10:37.8262145Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:10:37.8262324Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:10:37.8262650Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:37.8262901Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:10:37.8263027Z w _ITM_deregisterTMCloneTable 2025-05-07T20:10:37.8263136Z w _ITM_registerTMCloneTable 2025-05-07T20:10:37.8263237Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:10:37.8263375Z w __gmon_start__ 2025-05-07T20:10:37.8263471Z w __pthread_key_create 2025-05-07T20:10:37.8263583Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:10:37.8263692Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:10:37.8263864Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:10:37.8264117Z + ldd ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:10:37.8264126Z 2025-05-07T20:10:37.8292270Z linux-vdso.so.1 (0x00007ffeed1fa000) 2025-05-07T20:10:37.8292651Z libc10.so => not found 2025-05-07T20:10:37.8292926Z libnvrtc.so.12 => not found 2025-05-07T20:10:37.8293204Z libc10_cuda.so => not found 2025-05-07T20:10:37.8293465Z libnccl.so.2 => not found 2025-05-07T20:10:37.8293716Z libcuda.so.1 => not found 2025-05-07T20:10:37.8295426Z fbgemm_gpu_tbe_training_backward.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward.so (0x00007ff4bca00000) 2025-05-07T20:10:37.8295722Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:37.8295975Z libtorch.so => not found 2025-05-07T20:10:37.8296686Z libtorch_cpu.so => not found 2025-05-07T20:10:37.8297001Z libtorch_cuda.so => not found 2025-05-07T20:10:37.8297266Z libcudart.so.12 => not found 2025-05-07T20:10:37.8297747Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007ff4bc79c000) 2025-05-07T20:10:37.8298314Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007ff50b77c000) 2025-05-07T20:10:37.8298740Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007ff50b74e000) 2025-05-07T20:10:37.8299098Z libc.so.6 => /lib64/libc.so.6 (0x00007ff4bc594000) 2025-05-07T20:10:37.8299431Z /lib64/ld-linux-x86-64.so.2 (0x00007ff50b7da000) 2025-05-07T20:10:37.8299519Z libc10.so => not found 2025-05-07T20:10:37.8299613Z libnvrtc.so.12 => not found 2025-05-07T20:10:37.8299701Z libc10_cuda.so => not found 2025-05-07T20:10:37.8299809Z libnccl.so.2 => not found 2025-05-07T20:10:37.8299900Z libcuda.so.1 => not found 2025-05-07T20:10:37.8300361Z fbgemm_gpu_tbe_cache.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_cache.so (0x00007ff4bae00000) 2025-05-07T20:10:37.8300830Z fbgemm_gpu_tbe_common.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_common.so (0x00007ff4baa00000) 2025-05-07T20:10:37.8301360Z fbgemm_gpu_sparse_async_cumsum.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_sparse_async_cumsum.so (0x00007ff4ba859000) 2025-05-07T20:10:37.8301458Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:37.8301562Z libtorch.so => not found 2025-05-07T20:10:37.8301912Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm.so (0x00007ff4ba200000) 2025-05-07T20:10:37.8302358Z fbgemm_gpu_tbe_utils.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_utils.so (0x00007ff4b9000000) 2025-05-07T20:10:37.8302466Z libtorch_cpu.so => not found 2025-05-07T20:10:37.8302565Z libtorch_cuda.so => not found 2025-05-07T20:10:37.8302658Z libcudart.so.12 => not found 2025-05-07T20:10:37.8302781Z libm.so.6 => /lib64/libm.so.6 (0x00007ff4f6725000) 2025-05-07T20:10:37.8302990Z libtorch.so => not found 2025-05-07T20:10:37.8303070Z libc10.so => not found 2025-05-07T20:10:37.8303155Z libnvrtc.so.12 => not found 2025-05-07T20:10:37.8303249Z libc10_cuda.so => not found 2025-05-07T20:10:37.8303337Z libnccl.so.2 => not found 2025-05-07T20:10:37.8303423Z libcuda.so.1 => not found 2025-05-07T20:10:37.8303555Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:37.8303656Z libtorch_cpu.so => not found 2025-05-07T20:10:37.8303746Z libtorch_cuda.so => not found 2025-05-07T20:10:37.8303833Z libcudart.so.12 => not found 2025-05-07T20:10:37.8303929Z libc10.so => not found 2025-05-07T20:10:37.8304016Z libnvrtc.so.12 => not found 2025-05-07T20:10:37.8304104Z libc10_cuda.so => not found 2025-05-07T20:10:37.8304188Z libnccl.so.2 => not found 2025-05-07T20:10:37.8304329Z libcuda.so.1 => not found 2025-05-07T20:10:37.8304732Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_config.so (0x00007ff50b737000) 2025-05-07T20:10:37.8304823Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:37.8304919Z libtorch.so => not found 2025-05-07T20:10:37.8305005Z libtorch_cpu.so => not found 2025-05-07T20:10:37.8305091Z libtorch_cuda.so => not found 2025-05-07T20:10:37.8305178Z libcudart.so.12 => not found 2025-05-07T20:10:37.8305271Z libc10.so => not found 2025-05-07T20:10:37.8305360Z libnvrtc.so.12 => not found 2025-05-07T20:10:37.8305444Z libc10_cuda.so => not found 2025-05-07T20:10:37.8305536Z libnccl.so.2 => not found 2025-05-07T20:10:37.8305620Z libcuda.so.1 => not found 2025-05-07T20:10:37.8305710Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:37.8305790Z libtorch.so => not found 2025-05-07T20:10:37.8305891Z libtorch_cpu.so => not found 2025-05-07T20:10:37.8305980Z libtorch_cuda.so => not found 2025-05-07T20:10:37.8306064Z libcudart.so.12 => not found 2025-05-07T20:10:37.8306151Z libc10.so => not found 2025-05-07T20:10:37.8306277Z libnvrtc.so.12 => not found 2025-05-07T20:10:37.8306394Z libc10_cuda.so => not found 2025-05-07T20:10:37.8306492Z libnccl.so.2 => not found 2025-05-07T20:10:37.8306620Z libcuda.so.1 => not found 2025-05-07T20:10:37.8306961Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/asmjit.so (0x00007ff4bc51d000) 2025-05-07T20:10:37.8307097Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:37.8307228Z libtorch.so => not found 2025-05-07T20:10:37.8307333Z libtorch_cpu.so => not found 2025-05-07T20:10:37.8307438Z libtorch_cuda.so => not found 2025-05-07T20:10:37.8307542Z libtorch.so => not found 2025-05-07T20:10:37.8307672Z libc10.so => not found 2025-05-07T20:10:37.8307774Z libnvrtc.so.12 => not found 2025-05-07T20:10:37.8307879Z libc10_cuda.so => not found 2025-05-07T20:10:37.8308005Z libnccl.so.2 => not found 2025-05-07T20:10:37.8308106Z libcuda.so.1 => not found 2025-05-07T20:10:37.8308211Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:37.8308315Z libtorch_cpu.so => not found 2025-05-07T20:10:37.8308447Z libtorch_cuda.so => not found 2025-05-07T20:10:37.8308553Z libcudart.so.12 => not found 2025-05-07T20:10:37.8308654Z libtorch.so => not found 2025-05-07T20:10:37.8308784Z libc10.so => not found 2025-05-07T20:10:37.8308886Z libnvrtc.so.12 => not found 2025-05-07T20:10:37.8308988Z libc10_cuda.so => not found 2025-05-07T20:10:37.8309088Z libnccl.so.2 => not found 2025-05-07T20:10:37.8309217Z libcuda.so.1 => not found 2025-05-07T20:10:37.8309326Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:37.8309429Z libtorch_cpu.so => not found 2025-05-07T20:10:37.8309563Z libtorch_cuda.so => not found 2025-05-07T20:10:37.8309745Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007ff50b722000) 2025-05-07T20:10:37.8309845Z libtorch.so => not found 2025-05-07T20:10:37.8309942Z libc10.so => not found 2025-05-07T20:10:37.8310074Z libnvrtc.so.12 => not found 2025-05-07T20:10:37.8310176Z libc10_cuda.so => not found 2025-05-07T20:10:37.8310277Z libnccl.so.2 => not found 2025-05-07T20:10:37.8310415Z libcuda.so.1 => not found 2025-05-07T20:10:37.8310522Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:37.8310624Z libtorch_cpu.so => not found 2025-05-07T20:10:37.8310727Z libtorch_cuda.so => not found 2025-05-07T20:10:37.8310901Z librt.so.1 => /lib64/librt.so.1 (0x00007ff50b719000) 2025-05-07T20:10:37.8310922Z 2025-05-07T20:10:37.8311199Z [CHECK] Displaying ELF information: 2025-05-07T20:10:37.8311521Z + readelf -d ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:10:37.8311554Z 2025-05-07T20:10:37.8338817Z 2025-05-07T20:10:37.8339505Z Dynamic section at offset 0x147859a8 contains 41 entries: 2025-05-07T20:10:37.8339919Z Tag Type Name/Value 2025-05-07T20:10:37.8340531Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:10:37.8341344Z 0x0000000000000001 (NEEDED) Shared library: [libnvrtc.so.12] 2025-05-07T20:10:37.8341928Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:10:37.8342537Z 0x0000000000000001 (NEEDED) Shared library: [libnccl.so.2] 2025-05-07T20:10:37.8343102Z 0x0000000000000001 (NEEDED) Shared library: [libcuda.so.1] 2025-05-07T20:10:37.8343868Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_training_backward.so] 2025-05-07T20:10:37.8344475Z 0x0000000000000001 (NEEDED) Shared library: [libnvidia-ml.so.1] 2025-05-07T20:10:37.8345084Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:10:37.8345464Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:10:37.8345673Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:10:37.8345909Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:10:37.8346116Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:10:37.8347676Z 0x0000000000000001 (NEEDED) Shared library: [libgomp.so.1] 2025-05-07T20:10:37.8347922Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:10:37.8348119Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:10:37.8348371Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:10:37.8348653Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_training_backward_vbe.so] 2025-05-07T20:10:37.8348841Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:10:37.8348970Z 0x000000000000000c (INIT) 0x1dc000 2025-05-07T20:10:37.8349092Z 0x000000000000000d (FINI) 0xe754cc 2025-05-07T20:10:37.8349253Z 0x0000000000000019 (INIT_ARRAY) 0x1476a588 2025-05-07T20:10:37.8349394Z 0x000000000000001b (INIT_ARRAYSZ) 680 (bytes) 2025-05-07T20:10:37.8349525Z 0x000000000000001a (FINI_ARRAY) 0x1476a830 2025-05-07T20:10:37.8349686Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:10:37.8349814Z 0x000000006ffffef5 (GNU_HASH) 0x238 2025-05-07T20:10:37.8349936Z 0x0000000000000005 (STRTAB) 0x1c8a0 2025-05-07T20:10:37.8350082Z 0x0000000000000006 (SYMTAB) 0x6a00 2025-05-07T20:10:37.8350233Z 0x000000000000000a (STRSZ) 1486798 (bytes) 2025-05-07T20:10:37.8350367Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:10:37.8350500Z 0x0000000000000003 (PLTGOT) 0x1478afe8 2025-05-07T20:10:37.8350672Z 0x0000000000000002 (PLTRELSZ) 22152 (bytes) 2025-05-07T20:10:37.8350779Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:10:37.8350905Z 0x0000000000000017 (JMPREL) 0x1d5988 2025-05-07T20:10:37.8351056Z 0x0000000000000007 (RELA) 0x1896c8 2025-05-07T20:10:37.8351203Z 0x0000000000000008 (RELASZ) 312000 (bytes) 2025-05-07T20:10:37.8351335Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:10:37.8351500Z 0x000000006ffffffe (VERNEED) 0x1895a8 2025-05-07T20:10:37.8351623Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:10:37.8351752Z 0x000000006ffffff0 (VERSYM) 0x18786e 2025-05-07T20:10:37.8351873Z 0x000000006ffffff9 (RELACOUNT) 8035 2025-05-07T20:10:37.8352015Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:10:37.8352031Z 2025-05-07T20:10:37.8352162Z ################################################################################ 2025-05-07T20:10:37.8352199Z 2025-05-07T20:10:37.8352204Z 2025-05-07T20:10:37.8352418Z [CHECK] Verifying sample subset of symbols in the built libraries ... 2025-05-07T20:10:37.8469641Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.13/cmake-build/asmjit.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:10:37.8497440Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_config.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:10:37.8741835Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:10:37.8785419Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_common.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:10:37.8846918Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_cache.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:10:37.8890161Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_optimizers.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:10:37.8928779Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_utils.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:10:37.8959174Z [CHECK] Found symbol in ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_sparse_async_cumsum.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:10:37.9084665Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.13/cmake-build/asmjit.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:10:37.9115469Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_config.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:10:37.9355959Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:10:37.9401713Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_common.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:10:37.9458295Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_cache.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:10:37.9499564Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_optimizers.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:10:37.9535652Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_utils.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:10:37.9569644Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_sparse_async_cumsum.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:10:37.9978571Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:10:38.0358894Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_inference.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:10:38.0580707Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:10:38.1529349Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_forward.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:10:38.1569172Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_embedding_inplace_ops.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:10:38.1657916Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_index_select.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:10:38.1990365Z [CHECK] Found symbol in ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_py.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:10:38.1992453Z ################################################################################ 2025-05-07T20:10:38.1993986Z [BUILD] Wheel Audit: dist/fbgemm_gpu_nightly-2025.5.7-cp313-cp313-manylinux_2_28_x86_64.whl 2025-05-07T20:10:38.1995436Z 2025-05-07T20:10:38.1996297Z + conda run --no-capture-output -n build_binary auditwheel show dist/fbgemm_gpu_nightly-2025.5.7-cp313-cp313-manylinux_2_28_x86_64.whl 2025-05-07T20:10:38.1996879Z 2025-05-07T20:10:49.6899851Z 2025-05-07T20:10:49.6901348Z fbgemm_gpu_nightly-2025.5.7-cp313-cp313-manylinux_2_28_x86_64.whl is 2025-05-07T20:10:49.6903250Z consistent with the following platform tag: "linux_x86_64". 2025-05-07T20:10:49.6904073Z 2025-05-07T20:10:49.6904554Z The wheel references external versioned symbols in these 2025-05-07T20:10:49.6905824Z system-provided shared libraries: librt.so.1 with versions 2025-05-07T20:10:49.6907031Z {'GLIBC_2.2.5'}, libgcc_s.so.1 with versions {'GCC_3.0'}, 2025-05-07T20:10:49.6907645Z libstdc++.so.6 with versions {'GLIBCXX_3.4.20', 'GLIBCXX_3.4', 2025-05-07T20:10:49.6908203Z 'GLIBCXX_3.4.14', 'GLIBCXX_3.4.15', 'GLIBCXX_3.4.18', 'CXXABI_1.3.7', 2025-05-07T20:10:49.6908646Z 'CXXABI_1.3.3', 'CXXABI_1.3.5', 'GLIBCXX_3.4.11', 'CXXABI_1.3.11', 2025-05-07T20:10:49.6909087Z 'GLIBCXX_3.4.9', 'GLIBCXX_3.4.19', 'CXXABI_1.3', 'GLIBCXX_3.4.21'}, 2025-05-07T20:10:49.6909511Z libc.so.6 with versions {'GLIBC_2.2.5', 'GLIBC_2.3', 'GLIBC_2.14', 2025-05-07T20:10:49.6909950Z 'GLIBC_2.3.3', 'GLIBC_2.7', 'GLIBC_2.3.2', 'GLIBC_2.6', 'GLIBC_2.17'}, 2025-05-07T20:10:49.6910369Z libpthread.so.0 with versions {'GLIBC_2.3.4', 'GLIBC_2.2.5', 2025-05-07T20:10:49.6910780Z 'GLIBC_2.3.2'}, libm.so.6 with versions {'GLIBC_2.2.5'}, 2025-05-07T20:10:49.6911299Z libcudart.so.12 with versions {'libcudart.so.12'}, libgomp.so.1 with 2025-05-07T20:10:49.6911760Z versions {'OMP_1.0'}, libdl.so.2 with versions {'GLIBC_2.3.4', 2025-05-07T20:10:49.6912115Z 'GLIBC_2.2.5'} 2025-05-07T20:10:49.6912236Z 2025-05-07T20:10:49.6912430Z This constrains the platform tag to "manylinux_2_27_x86_64". In order 2025-05-07T20:10:49.6912977Z to achieve a more compatible tag, you would need to recompile a new 2025-05-07T20:10:49.6913417Z wheel from source on a system with earlier versions of these 2025-05-07T20:10:49.6913809Z libraries, such as a recent manylinux image. 2025-05-07T20:10:49.7695738Z 2025-05-07T20:10:49.7695955Z 2025-05-07T20:10:49.7696615Z ################################################################################ 2025-05-07T20:10:49.7697039Z [BUILD] Enumerating the built wheels ... 2025-05-07T20:10:49.7710494Z + ls -lth dist/fbgemm_gpu_nightly-2025.5.7-cp313-cp313-manylinux_2_28_x86_64.whl 2025-05-07T20:10:49.7710994Z 2025-05-07T20:10:49.7715805Z -rw-r--r--. 1 root root 505M May 7 20:10 dist/fbgemm_gpu_nightly-2025.5.7-cp313-cp313-manylinux_2_28_x86_64.whl 2025-05-07T20:10:49.7716523Z 2025-05-07T20:10:49.7716697Z [BUILD] Enumerating the wheel SHAs ... 2025-05-07T20:10:49.7717406Z + sha1sum dist/fbgemm_gpu_nightly-2025.5.7-cp313-cp313-manylinux_2_28_x86_64.whl 2025-05-07T20:10:49.7717960Z 2025-05-07T20:10:50.7203133Z 299df2de429e3f992bd4ca97e3ecaf8eca88673f dist/fbgemm_gpu_nightly-2025.5.7-cp313-cp313-manylinux_2_28_x86_64.whl 2025-05-07T20:10:50.7204704Z 2025-05-07T20:10:50.7205484Z + sha256sum dist/fbgemm_gpu_nightly-2025.5.7-cp313-cp313-manylinux_2_28_x86_64.whl 2025-05-07T20:10:50.7206553Z 2025-05-07T20:10:52.9235060Z 39c1d3cc0e0b813059420b1119a422c86410a97e86df1b3e3a211855f053b85b dist/fbgemm_gpu_nightly-2025.5.7-cp313-cp313-manylinux_2_28_x86_64.whl 2025-05-07T20:10:52.9235990Z 2025-05-07T20:10:52.9236701Z + md5sum dist/fbgemm_gpu_nightly-2025.5.7-cp313-cp313-manylinux_2_28_x86_64.whl 2025-05-07T20:10:52.9237161Z 2025-05-07T20:10:53.7693194Z be5559aec3f2b56feda8dda237eb993d dist/fbgemm_gpu_nightly-2025.5.7-cp313-cp313-manylinux_2_28_x86_64.whl 2025-05-07T20:10:53.7694644Z 2025-05-07T20:10:53.7695010Z [BUILD] FBGEMM-GPU build + package completed 2025-05-07T20:10:53.7795765Z ##[group]Run actions/upload-artifact@v4 2025-05-07T20:10:53.7796110Z with: 2025-05-07T20:10:53.7796435Z name: fbgemm_default_x86_clang_py3.13_cu12.6.3.whl 2025-05-07T20:10:53.7796785Z path: fbgemm_gpu/dist/*.whl 2025-05-07T20:10:53.7797098Z if-no-files-found: error 2025-05-07T20:10:53.7797465Z compression-level: 6 2025-05-07T20:10:53.7797778Z overwrite: false 2025-05-07T20:10:53.7798055Z include-hidden-files: false 2025-05-07T20:10:53.7798319Z env: 2025-05-07T20:10:53.7798576Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T20:10:53.7798914Z BUILD_ENV: build_binary 2025-05-07T20:10:53.7799189Z BUILD_TARGET: default 2025-05-07T20:10:53.7799527Z BUILD_VARIANT: cuda 2025-05-07T20:10:53.7799802Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T20:10:53.7800065Z ##[endgroup] 2025-05-07T20:10:53.7803451Z ##[command]/usr/bin/docker exec 2b31f69c500b43ab9782f5664ba09da9843f0f5b972ff047034471321a834ace sh -c "cat /etc/*release | grep ^ID" 2025-05-07T20:10:54.2586970Z With the provided path, there will be 1 file uploaded 2025-05-07T20:10:54.2590809Z Artifact name is valid! 2025-05-07T20:10:54.2591845Z Root directory input is valid! 2025-05-07T20:10:54.3622640Z Beginning upload of artifact content to blob storage 2025-05-07T20:10:55.1034515Z Uploaded bytes 8388608 2025-05-07T20:10:55.5307710Z Uploaded bytes 16777216 2025-05-07T20:10:55.9666201Z Uploaded bytes 25165824 2025-05-07T20:10:56.3878279Z Uploaded bytes 33554432 2025-05-07T20:10:56.7907066Z Uploaded bytes 41943040 2025-05-07T20:10:57.2854684Z Uploaded bytes 50331648 2025-05-07T20:10:57.6843509Z Uploaded bytes 58720256 2025-05-07T20:10:58.1778234Z Uploaded bytes 67108864 2025-05-07T20:10:58.5366361Z Uploaded bytes 75497472 2025-05-07T20:10:59.0803023Z Uploaded bytes 83886080 2025-05-07T20:10:59.4576995Z Uploaded bytes 92274688 2025-05-07T20:10:59.9079539Z Uploaded bytes 100663296 2025-05-07T20:11:00.3514116Z Uploaded bytes 109051904 2025-05-07T20:11:00.8111964Z Uploaded bytes 117440512 2025-05-07T20:11:01.1558928Z Uploaded bytes 125829120 2025-05-07T20:11:01.6656839Z Uploaded bytes 134217728 2025-05-07T20:11:02.0877637Z Uploaded bytes 142606336 2025-05-07T20:11:02.5125674Z Uploaded bytes 150994944 2025-05-07T20:11:02.9467391Z Uploaded bytes 159383552 2025-05-07T20:11:03.4877180Z Uploaded bytes 167772160 2025-05-07T20:11:03.8802764Z Uploaded bytes 176160768 2025-05-07T20:11:04.3462333Z Uploaded bytes 184549376 2025-05-07T20:11:04.7578666Z Uploaded bytes 192937984 2025-05-07T20:11:05.2838365Z Uploaded bytes 201326592 2025-05-07T20:11:05.6389729Z Uploaded bytes 209715200 2025-05-07T20:11:06.1065033Z Uploaded bytes 218103808 2025-05-07T20:11:06.5582741Z Uploaded bytes 226492416 2025-05-07T20:11:06.9907017Z Uploaded bytes 234881024 2025-05-07T20:11:07.4674437Z Uploaded bytes 243269632 2025-05-07T20:11:07.7578106Z Uploaded bytes 251658240 2025-05-07T20:11:08.1832036Z Uploaded bytes 260046848 2025-05-07T20:11:08.6086616Z Uploaded bytes 268435456 2025-05-07T20:11:09.0243497Z Uploaded bytes 276824064 2025-05-07T20:11:09.3887678Z Uploaded bytes 285212672 2025-05-07T20:11:09.8901559Z Uploaded bytes 293601280 2025-05-07T20:11:10.3111545Z Uploaded bytes 301989888 2025-05-07T20:11:10.7492241Z Uploaded bytes 310378496 2025-05-07T20:11:11.2357402Z Uploaded bytes 318767104 2025-05-07T20:11:11.6945925Z Uploaded bytes 327155712 2025-05-07T20:11:12.0734144Z Uploaded bytes 335544320 2025-05-07T20:11:12.5236441Z Uploaded bytes 343932928 2025-05-07T20:11:12.9608472Z Uploaded bytes 352321536 2025-05-07T20:11:13.3337245Z Uploaded bytes 360710144 2025-05-07T20:11:13.8278593Z Uploaded bytes 369098752 2025-05-07T20:11:14.2381085Z Uploaded bytes 377487360 2025-05-07T20:11:14.6820376Z Uploaded bytes 385875968 2025-05-07T20:11:15.1459453Z Uploaded bytes 394264576 2025-05-07T20:11:15.5935607Z Uploaded bytes 402653184 2025-05-07T20:11:16.0173365Z Uploaded bytes 411041792 2025-05-07T20:11:16.4412965Z Uploaded bytes 419430400 2025-05-07T20:11:16.8382467Z Uploaded bytes 427819008 2025-05-07T20:11:17.2760023Z Uploaded bytes 436207616 2025-05-07T20:11:17.7825181Z Uploaded bytes 444596224 2025-05-07T20:11:18.1624385Z Uploaded bytes 452984832 2025-05-07T20:11:18.5994869Z Uploaded bytes 461373440 2025-05-07T20:11:19.0049488Z Uploaded bytes 469762048 2025-05-07T20:11:19.4503586Z Uploaded bytes 478150656 2025-05-07T20:11:19.8327511Z Uploaded bytes 486539264 2025-05-07T20:11:20.2143504Z Uploaded bytes 494927872 2025-05-07T20:11:20.6887928Z Uploaded bytes 503316480 2025-05-07T20:11:21.0901311Z Uploaded bytes 511705088 2025-05-07T20:11:21.4377864Z Uploaded bytes 518342721 2025-05-07T20:11:21.4539397Z Finished uploading artifact content to blob storage! 2025-05-07T20:11:21.4541525Z SHA256 digest of uploaded artifact zip is 89eaaaecfb7193ffac918299fa53a504232da76b2cfdf030f646c3d051d08d9d 2025-05-07T20:11:21.4543368Z Finalizing artifact upload 2025-05-07T20:11:21.5487718Z Artifact fbgemm_default_x86_clang_py3.13_cu12.6.3.whl.zip successfully finalized. Artifact ID 3081456163 2025-05-07T20:11:21.5488663Z Artifact fbgemm_default_x86_clang_py3.13_cu12.6.3.whl has been successfully uploaded! Final size is 518342721 bytes. Artifact ID is 3081456163 2025-05-07T20:11:21.5498404Z Artifact download URL: https://github.com/pytorch/FBGEMM/actions/runs/14891846252/artifacts/3081456163 2025-05-07T20:11:21.5734428Z Post job cleanup. 2025-05-07T20:11:21.5739774Z ##[command]/usr/bin/docker exec 2b31f69c500b43ab9782f5664ba09da9843f0f5b972ff047034471321a834ace sh -c "cat /etc/*release | grep ^ID" 2025-05-07T20:11:21.8570521Z [command]/usr/bin/git version 2025-05-07T20:11:21.8606114Z git version 2.47.1 2025-05-07T20:11:21.8636960Z Copying '/github/home/.gitconfig' to '/__w/_temp/893f4ec1-baad-4e99-9e06-39ae06b55b23/.gitconfig' 2025-05-07T20:11:21.8645709Z Temporarily overriding HOME='/__w/_temp/893f4ec1-baad-4e99-9e06-39ae06b55b23' before making global git config changes 2025-05-07T20:11:21.8646490Z Adding repository directory to the temporary git global config as a safe directory 2025-05-07T20:11:21.8650442Z [command]/usr/bin/git config --global --add safe.directory /__w/FBGEMM/FBGEMM 2025-05-07T20:11:21.8694554Z [command]/usr/bin/git config --local --name-only --get-regexp core\.sshCommand 2025-05-07T20:11:21.8721566Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'core\.sshCommand' && git config --local --unset-all 'core.sshCommand' || :" 2025-05-07T20:11:21.9006541Z Entering 'external/asmjit' 2025-05-07T20:11:21.9055037Z Entering 'external/composable_kernel' 2025-05-07T20:11:21.9112974Z Entering 'external/cpuinfo' 2025-05-07T20:11:21.9182318Z Entering 'external/cutlass' 2025-05-07T20:11:21.9252479Z Entering 'external/googletest' 2025-05-07T20:11:21.9321835Z Entering 'external/hipify_torch' 2025-05-07T20:11:21.9371782Z Entering 'external/json' 2025-05-07T20:11:21.9432209Z [command]/usr/bin/git config --local --name-only --get-regexp http\.https\:\/\/github\.com\/\.extraheader 2025-05-07T20:11:21.9452456Z http.https://github.com/.extraheader 2025-05-07T20:11:21.9457467Z [command]/usr/bin/git config --local --unset-all http.https://github.com/.extraheader 2025-05-07T20:11:21.9481245Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'http\.https\:\/\/github\.com\/\.extraheader' && git config --local --unset-all 'http.https://github.com/.extraheader' || :" 2025-05-07T20:11:21.9786969Z Entering 'external/asmjit' 2025-05-07T20:11:21.9822856Z http.https://github.com/.extraheader 2025-05-07T20:11:21.9859580Z Entering 'external/composable_kernel' 2025-05-07T20:11:21.9898980Z http.https://github.com/.extraheader 2025-05-07T20:11:21.9946791Z Entering 'external/cpuinfo' 2025-05-07T20:11:21.9995520Z http.https://github.com/.extraheader 2025-05-07T20:11:22.0036966Z Entering 'external/cutlass' 2025-05-07T20:11:22.0073012Z http.https://github.com/.extraheader 2025-05-07T20:11:22.0119718Z Entering 'external/googletest' 2025-05-07T20:11:22.0157324Z http.https://github.com/.extraheader 2025-05-07T20:11:22.0196337Z Entering 'external/hipify_torch' 2025-05-07T20:11:22.0232690Z http.https://github.com/.extraheader 2025-05-07T20:11:22.0273391Z Entering 'external/json' 2025-05-07T20:11:22.0309174Z http.https://github.com/.extraheader 2025-05-07T20:11:22.0498065Z Stop and remove container: 31897149cfcd45e99da7b09c84542214_amazonlinux2023_3f340f 2025-05-07T20:11:22.0503319Z ##[command]/usr/bin/docker rm --force 2b31f69c500b43ab9782f5664ba09da9843f0f5b972ff047034471321a834ace 2025-05-07T20:11:23.4322218Z 2b31f69c500b43ab9782f5664ba09da9843f0f5b972ff047034471321a834ace 2025-05-07T20:11:23.4350489Z Remove container network: github_network_9484ef32e44d40e598a92c2c5c95b912 2025-05-07T20:11:23.4354647Z ##[command]/usr/bin/docker network rm github_network_9484ef32e44d40e598a92c2c5c95b912 2025-05-07T20:11:24.4119423Z github_network_9484ef32e44d40e598a92c2c5c95b912 2025-05-07T20:11:24.4154762Z A job completed hook has been configured by the self-hosted runner administrator 2025-05-07T20:11:24.4173725Z ##[group]Run '/home/ec2-user/runner-scripts/after_job.sh' 2025-05-07T20:11:24.4179702Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-05-07T20:11:24.4180119Z ##[endgroup] 2025-05-07T20:11:24.4298149Z [!ALERT!] Swap in detected! [!ALERT!] 2025-05-07T20:11:34.5430680Z [!ALERT!] Swap out detected [!ALERT!] 2025-05-07T20:11:50.7196169Z Cleaning up orphan processes